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Preface 


This 1s an introductory textbook for a first course in applied statistics and probability for un- 
dergraduate students in engineering and the physical or chemical sciences. These individuals 
play a significant role in designing and developing new products and manufacturing systems 
and processes, and they also improve existing systems. Statistical methods are an important 
tool in these activities because they provide the engineer with both descriptive and analytical 
methods for dealing with the variability in observed data. Although many of the methods we 
present are fundamental to statistical analysis in other disciplines, such as business and 
management, the life sciences, and the social sciences, we have elected to focus on an 
engineering-oriented audience. We believe that this approach will best serve students in 
engineering and the chemical/physical sciences and will allow them to concentrate on the 
many applications of statistics in these disciplines. We have worked hard to ensure that our ex- 
amples and exercises are engineering- and science-based, and in almost all cases we have used 
examples of real data—either taken from a published source or based on our consulting expe- 
riences. 

We believe that engineers in all disciplines should take at least one course in statistics. 
Unfortunately, because of other requirements, most engineers will only take one statistics 
course. This book can be used for a single course, although we have provided enough mate- 
rial for two courses in the hope that more students will see the important applications of sta- 
tistics in their everyday work and elect a second course. We believe that this book will also 
serve as a useful reference. 


ORGANIZATION OF THE BOOK 


We have retained the relatively modest mathematical level of the first two editions. We have 
found that engineering students who have completed one or two semesters of calculus should 
have no difficulty reading almost all of the text. It is our intent to give the reader an understand- 
ing of the methodology and how to apply it, not the mathematical theory. We have made many 
enhancements in this edition, including reorganizing and rewriting major portions of the book. 

Perhaps the most common criticism of engineering statistics texts 1s that they are too 
long. Both instructors and students complain that it is impossible to cover all of the topics in 
the book in one or even two terms. For authors, this 1s a serious issue because there is great va- 
riety in both the content and level of these courses, and the decisions about what material to 
delete without limiting the value of the text are not easy. After struggling with these issues, we 
decided to divide the text into two components; a set of core topics, many of which are most 
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likely to be covered in an engineering statistics course, and a set of supplementary topics, or 
topics that will be useful for some but not all courses. The core topics are in the printed book, 
and the complete text (both core and supplementary topics) is available on the CD that is 
included with the printed book. Decisions about topics to include in print and which to include 
only on the CD were made based on the results of a recent survey of instructors. 

The Interactive e-Text consists of the complete text and a wealth of additional material 
and features. The text and links on the CD are navigated using Adobe Acrobat!M, The links 
within the Interactive e-Text include the following: (1) from the Table of Contents to the se- 
lected eText sections, (2) from the Index to the selected topic within the e-7ext, (3) from refer- 
ence to a figure, table, or equation in one section to the actual figure, table, or equation in an- 
other section (all figures can be enlarged and printed), (4) from end-of-chapter Important 
Terms and Concepts to their definitions within the chapter, (5) from in-text boldfaced terms 
to their corresponding Glossary definitions and explanations, (6) from in-text references to the 
corresponding Appendix tables and charts, (7) from boxed-number end-of-chapter exercises 
(essentially most odd-numbered exercises) to their answers, (8) from some answers to the 
complete problem solution, and (9) from the opening splash screen to the textbook Web site. 

Chapter 1 is an introduction to the field of statistics and how engineers use statistical 
methodology as part of the engineering problem-solving process. This chapter also introduces 
the reader to some engineering applications of statistics, including building empirical models, 
designing engineering experiments, and monitoring manufacturing processes. These topics 
are discussed in more depth in subsequent chapters. 

Chapters 2, 3, 4, and 5 cover the basic concepts of probability, discrete and continuous 
random variables, probability distributions, expected values, joint probability distributions, 
and independence. We have given a reasonably complete treatment of these topics but have 
avoided many of the mathematical or more theoretical details. 

Chapter 6 begins the treatment of statistical methods with random sampling; data sum- 
mary and description techniques, including stem-and-leaf plots, histograms, box plots, and 
probability plotting; and several types of time series plots. Chapter 7 discusses point estimation 
of parameters. This chapter also introduces some of the important properties of estimators, the 
method of maximum likelihood, the method of moments, sampling distributions, and the cen- 
tral limit theorem. 

Chapter 8 discusses interval estimation for a single sample. Topics included are confi- 
dence intervals for means, variances or standard deviations, and proportions and prediction and 
tolerance intervals. Chapter 9 discusses hypothesis tests for a single sample. Chapter 10 pre- 
sents tests and confidence intervals for two samples. This material has been extensively rewrit- 
ten and reorganized. There is detailed information and examples of methods for determining 
appropriate sample sizes. We want the student to become familiar with how these techniques 
are used to solve real-world engineering problems and to get some understanding of the con- 
cepts behind them. We give a logical, heuristic development of the procedures, rather than a 
formal mathematical one. 

Chapters 11 and 12 present simple and multiple linear regression. We use matrix algebra 
throughout the multiple regression material (Chapter 12) because it 1s the only easy way to 
understand the concepts presented. Scalar arithmetic presentations of multiple regression are 
awkward at best, and we have found that undergraduate engineers are exposed to enough 
matrix algebra to understand the presentation of this material. 

Chapters 13 and 14 deal with single- and multifactor experiments, respectively. The no- 
tions of randomization, blocking, factorial designs, interactions, graphical data analysis, and 
fractional factorials are emphasized. Chapter 15 gives a brief introduction to the methods and 
applications of nonparametric statistics, and Chapter 16 introduces statistical quality control, 
emphasizing the control chart and the fundamentals of statistical process control. 
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Each chapter has an extensive collection of exercises, including end-of-section exercises 
that emphasize the material in that section, supplemental exercises at the end of the chapter 
that cover the scope of chapter topics, and mind-expanding exercises that often require the 
student to extend the text material somewhat or to apply it in a novel situation. As noted 
above, answers are provided to most odd-numbered exercises and the e-Text contains com- 
plete solutions to selected exercises. 


USING THE BOOK 


This is a very flexible textbook because instructors' ideas about what should be in a first 
course on statistics for engineers vary widely, as do the abilities of different groups of stu- 
dents. Therefore, we hesitate to give too much advice but will explain how we use the book. 

We believe that a first course in statistics for engineers should be primarily an applied sta- 
tistics course, not a probability course. In our one-semester course we cover all of Chapter 1 
(in one or two lectures); overview the material on probability, putting most of the emphasis on 
the normal distribution (six to eight lectures); discuss most of Chapters 6 though 10 on confi- 
dence intervals and tests (twelve to fourteen lectures); introduce regression models in 
Chapter 11 (four lectures); give an introduction to the design of experiments from Chapters 13 
and 14 (six lectures); and present the basic concepts of statistical process control, including 
the Shewhart control chart from Chapter 16 (four lectures). This leaves about three to four pe- 
riods for exams and review. Let us emphasize that the purpose of this course is to introduce 
engineers to how statistics can be used to solve real-world engineering problems, not to weed 
out the less mathematically gifted students. This course is not the “baby math-stat" course that 
is all too often given to engineers. 

If a second semester is available, it is possible to cover the entire book, including much 
of the e-Text material, if appropriate for the audience. It would also be possible to assign and 
work many of the homework problems in class to reinforce the understanding of the concepts. 
Obviously, multiple regression and more design of experiments would be major topics in a 
second course. 


USING THE COMPUTER 


In practice, engineers use computers to apply statistical methods to solve problems. Therefore, 
we strongly recommend that the computer be integrated into the class. Throughout the book we 
have presented output from Minitab as typical examples of what can be done with modern sta- 
tistical software. In teaching, we have used other software packages, including Statgraphics, 
JMP, and Statisticia. We did not clutter up the book with examples from many different packages 
because how the instructor integrates the software into the class is ultimately more important 
than which package is used. All text data is available in electronic form on the e-7ext CD. In 
some chapters, there are problems that we feel should be worked using computer software. We 
have marked these problems with a special icon in the margin. 

In our own classrooms, we use the computer in almost every lecture and demonstrate 
how the technique is implemented in software as soon as it is discussed in the lecture. 
Student versions of many statistical software packages are available at low cost, and students 
can either purchase their own copy or use the products available on the PC local area net- 
works. We have found that this greatly improves the pace of the course and student under- 
standing of the material. 
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USING THE WEB 


Additional resources for students and instructors can be found at www.wiley.com/college/ 
montgomery/. 
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CHAPTER OUTLINE 

1-1 THE ENGINEERING METHOD AND 1-2.5 A Factorial Experiment for the 
STATISTICAL THINKING Pull-off Force Problem (CD Only) 
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LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 

. Identify the role that statistics can play in the engineering problem-solving process 

. Discuss how variability affects the data collected and used for making engineering decisions 
. Explain the difference between enumerative and analytical studies 


. Discuss the different methods that engineers use to collect data 


Ui» W Ne 


. Identify the advantages that designed experiments have in comparison to other methods of col- 
lecting engineering data 


6. Explain the differences between mechanistic models and empirical models 


7. Discuss how probability and probability models are used in engineering and science 


CD MATERIAL 


8. Explain the factorial experimental design. 


9. Explain how factors can Interact. 


Answers for most odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e- Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
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available for some of the text sections that appear on CD only. These exercises may be found within 
the e-Text immediately following the section they accompany. 


1-1 THE ENGINEERING METHOD AND STATISTICAL THINKING 


Figure 1-1 The 
engineering method. 


An engineer is someone who solves problems of interest to society by the efficient application 
of scientific principles. Engineers accomplish this by either refining an existing product or 
process or by designing a new product or process that meets customers’ needs. The engineering, 
or scientific, method is the approach to formulating and solving these problems. The steps in 
the engineering method are as follows: 


1. Develop a clear and concise description of the problem. 


2. Identify, at least tentatively, the important factors that affect this problem or that may 
play a role in its solution. 


3. Propose a model for the problem, using scientific or engineering knowledge of the 
phenomenon being studied. State any limitations or assumptions of the model. 


4. Conduct appropriate experiments and collect data to test or validate the tentative 
model or conclusions made in steps 2 and 3. 


Refine the model on the basis of the observed data. 
Manipulate the model to assist in developing a solution to the problem. 


Conduct an appropriate experiment to confirm that the proposed solution to the prob- 
lem is both effective and efficient. 


8. Draw conclusions or make recommendations based on the problem solution. 


The steps in the engineering method are shown in Fig. 1-1. Notice that the engineering method 
features a strong interplay between the problem, the factors that may influence its solution, a 
model of the phenomenon, and experimentation to verify the adequacy of the model and the 
proposed solution to the problem. Steps 2—4 in Fig. 1-1 are enclosed in a box, indicating that 
several cycles or iterations of these steps may be required to obtain the final solution. 
Consequently, engineers must know how to efficiently plan experiments, collect data, analyze 
and interpret the data, and understand how the observed data are related to the model they 
have proposed for the problem under study. 

The field of statistics deals with the collection, presentation, analysis, and use of data to 
make decisions, solve problems, and design products and processes. Because many aspects of 
engineering practice involve working with data, obviously some knowledge of statistics is 
important to any engineer. Specifically, statistical techniques can be a powerful aid in design- 
ing new products and systems, improving existing designs, and designing, developing, and 
improving production processes. 
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Statistical methods are used to help us describe and understand variability. By variability, 
we mean that successive observations of a system or phenomenon do not produce exactly the 
same result. We all encounter variability in our everyday lives, and statistical thinking can 
give us a useful way to incorporate this variability into our decision-making processes. For 
example, consider the gasoline mileage performance of your car. Do you always get exactly the 
same mileage performance on every tank of fuel? Of course not—1in fact, sometimes the mileage 
performance varies considerably. This observed variability in gasoline mileage depends on 
many factors, such as the type of driving that has occurred most recently (city versus highway), 
the changes in condition of the vehicle over time (which could include factors such as tire 
inflation, engine compression, or valve wear), the brand and/or octane number of the gasoline 
used, or possibly even the weather conditions that have been recently experienced. These factors 
represent potential sources of variability in the system. Statistics gives us a framework for 
describing this variability and for learning about which potential sources of variability are the 
most important or which have the greatest impact on the gasoline mileage performance. 

We also encounter variability in dealing with engineering problems. For example, sup- 
pose that an engineer is designing a nylon connector to be used in an automotive engine 
application. The engineer is considering establishing the design specification on wall thick- 
ness at 3/32 inch but is somewhat uncertain about the effect of this decision on the connector 
pull-off force. If the pull-off force 1s too low, the connector may fail when it is installed in an 
engine. Eight prototype units are produced and their pull-off forces measured, resulting in the 
following data (in pounds): 12.6, 12.9, 13.4, 12.3, 13.6, 13.5, 12.6, 13.1. As we anticipated, 
not all of the prototypes have the same pull-off force. We say that there 1s variability in the 
pull-off force measurements. Because the pull-off force measurements exhibit variability, we 
consider the pull-off force to be a random variable. A convenient way to think of a random 
variable, say X, that represents a measurement, is by using the model 


X=pte (1-1) 


where p. is a constant and e is a random disturbance. The constant remains the same with every 
measurement, but small changes in the environment, test equipment, differences in the indi- 
vidual parts themselves, and so forth change the value of e. If there were no disturbances, e 
would always equal zero and X would always be equal to the constant u. However, this never 
happens in the real world, so the actual measurements X exhibit variability. We often need to 
describe, quantify and ultimately reduce variability. 

Figure 1-2 presents a dot diagram of these data. The dot diagram is a very useful plot for 
displaying a small body of data—say, up to about 20 observations. This plot allows us to see eas- 
ily two features of the data; the location, or the middle, and the scatter or variability. When the 
number of observations is small, it is usually difficult to identify any specific patterns in the vari- 
ability, although the dot diagram is a convenient way to see any unusual data features. 

The need for statistical thinking arises often in the solution of engineering problems. 
Consider the engineer designing the connector. From testing the prototypes, he knows that the 
average pull-off force is 13.0 pounds. However, he thinks that this may be too low for the 
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Figure 1-2 Dot diagram of the pull-off force Figure 1-3 Dot diagram of pull-off force for two wall 
data when wall thickness is 3/32 inch. thicknesses. 
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intended application, so he decides to consider an alternative design with a greater wall 
thickness, 1/8 inch. Eight prototypes of this design are built, and the observed pull-off force 
measurements are 12.9, 13.7, 12.8, 13.9, 14.2, 13.2, 13.5, and 13.1. The average is 13.4. 
Results for both samples are plotted as dot diagrams in Fig. 1-3, page 3. This display gives 
the impression that increasing the wall thickness has led to an increase in pull-off force. 
However, there are some obvious questions to ask. For instance, how do we know that an- 
other sample of prototypes will not give different results? Is a sample of eight prototypes 
adequate to give reliable results? If we use the test results obtained so far to conclude that 
increasing the wall thickness increases the strength, what risks are associated with this de- 
cision? For example, is it possible that the apparent increase in pull-off force observed in 
the thicker prototypes is only due to the inherent variability in the system and that increas- 
ing the thickness of the part (and its cost) really has no effect on the pull-off force? 

Often, physical laws (such as Ohm's law and the ideal gas law) are applied to help design 
products and processes. We are familiar with this reasoning from general laws to specific 
cases. But it is also important to reason from a specific set of measurements to more general 
cases to answer the previous questions. This reasoning is from a sample (such as the eight con- 
nectors) to a population (such as the connectors that will be sold to customers). The reasoning 
is referred to as statistical inference. See Fig. 1-4. Historically, measurements were obtained 
from a sample of people and generalized to a population, and the terminology has remained. 
Clearly, reasoning based on measurements from some objects to measurements on all objects 
can result in errors (called sampling errors). However, if the sample is selected properly, these 
risks can be quantified and an appropriate sample size can be determined. 

In some cases, the sample is actually selected from a well-defined population. The sam- 
ple is a subset of the population. For example, in a study of resistivity a sample of three wafers 
might be selected from a production lot of wafers in semiconductor manufacturing. Based on 
the resistivity data collected on the three wafers in the sample, we want to draw a conclusion 
about the resistivity of all of the wafers in the lot. 

In other cases, the population is conceptual (such as with the connectors), but it might be 
thought of as future replicates of the objects in the sample. In this situation, the eight proto- 
type connectors must be representative, in some sense, of the ones that will be manufactured 
in the future. Clearly, this analysis requires some notion of stability as an additional assump- 
tion. For example, it might be assumed that the sources of variability in the manufacture of the 
prototypes (such as temperature, pressure, and curing time) are the same as those for the con- 
nectors that will be manufactured in the future and ultimately sold to customers. 


| | | | 
| Population | | | 
Physical Population | ! ? | | | 
laws | | | Future | 
| : | population | 
? 

TUNE | | | | 
Types oF Statistical inference | Sanne | S | | 
reasoning | | | | 
CL s \- MUN, 1 ERU [ 

Product Sample X1: Xares Xn Xir Xar». Xn 

Gesigns Enumerative Analytic 
study study 
Figure 1-4 Statistical inference 1s one type of Figure 1-5 Enumerative versus analytic study. 
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The wafers-from-lots example is called an enumerative study. A sample is used to make 
an inference to the population from which the sample is selected. The connector example is 
called an analytic study. A sample is used to make an inference to a conceptual (future) 
population. The statistical analyses are usually the same in both cases, but an analytic study 
clearly requires an assumption of stability. See Fig. 1-5, on page 4. 


1-2 COLLECTING ENGINEERING DATA 
1-2.1 Basic Principles 


In the previous section, we illustrated some simple methods for summarizing data. In the en- 
gineering environment, the data is almost always a sample that has been selected from some 
population. Three basic methods of collecting data are 


e A retrospective study using historical data 
e An observational study 


e A designed experiment 


An effective data collection procedure can greatly simplify the analysis and lead to improved 
understanding of the population or process that is being studied. We now consider some ex- 
amples of these data collection methods. 


1-2.2 Retrospective Study 


Montgomery, Peck, and Vining (2001) describe an acetone-butyl alcohol distillation 
column for which concentration of acetone in the distillate or output product stream is an 
important variable. Factors that may affect the distillate are the reboil temperature, the con- 
densate temperature, and the reflux rate. Production personnel obtain and archive the 
following records: 


e The concentration of acetone in an hourly test sample of output product 
e The reboil temperature log, which is a plot of the reboil temperature over time 
e The condenser temperature controller log 


e The nominal reflux rate each hour 


The reflux rate should be held constant for this process. Consequently, production personnel 
change this very infrequently. 

A retrospective study would use either all or a sample of the historical process data 
archived over some period of time. The study objective might be to discover the relationships 
among the two temperatures and the reflux rate on the acetone concentration in the output 
product stream. However, this type of study presents some problems: 


1. We may not be able to see the relationship between the reflux rate and acetone con- 
centration, because the reflux rate didn’t change much over the historical period. 


2. The archived data on the two temperatures (which are recorded almost continu- 
ously) do not correspond perfectly to the acetone concentration measurements 
(which are made hourly). It may not be obvious how to construct an approximate 
correspondence. 
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3. Production maintains the two temperatures as closely as possible to desired targets or 
set points. Because the temperatures change so little, it may be difficult to assess their 
real impact on acetone concentration. 


4. Within the narrow ranges that they do vary, the condensate temperature tends to in- 
crease with the reboil temperature. Consequently, the effects of these two process 
variables on acetone concentration may be difficult to separate. 


AS you can see, a retrospective study may involve a lot of data, but that data may contain 
relatively little useful information about the problem. Furthermore, some of the relevant 
data may be missing, there may be transcription or recording errors resulting in outliers 
(or unusual values), or data on other important factors may not have been collected and 
archived. In the distillation column, for example, the specific concentrations of butyl alco- 
hol and acetone in the input feed stream are a very important factor, but they are not 
archived because the concentrations are too hard to obtain on a routine basis. As a result of 
these types of issues, statistical analysis of historical data sometimes identify interesting 
phenomena, but solid and reliable explanations of these phenomena are often difficult to 
obtain. 


1-2.3 Observational Study 


In an observational study, the engineer observes the process or population, disturbing it as lit- 
tle as possible, and records the quantities of interest. Because these studies are usually con- 
ducted for a relatively short time period, sometimes variables that are not routinely measured 
can be included. In the distillation column, the engineer would design a form to record the two 
temperatures and the reflux rate when acetone concentration measurements are made. It may 
even be possible to measure the input feed stream concentrations so that the impact of this fac- 
tor could be studied. Generally, an observational study tends to solve problems 1 and 2 above 
and goes a long way toward obtaining accurate and reliable data. However, observational 
studies may not help resolve problems 3 and 4. 


1-2.4 Designed Experiments 


In a designed experiment the engineer makes deliberate or purposeful changes in the control- 
lable variables of the system or process, observes the resulting system output data, and then 
makes an inference or decision about which variables are responsible for the observed changes 
in output performance. The nylon connector example in Section 1-1 illustrates a designed ex- 
periment; that is, a deliberate change was made in the wall thickness of the connector with the 
objective of discovering whether or not a greater pull-off force could be obtained. Designed 
experiments play a very important role in engineering design and development and in the 
improvement of manufacturing processes. Generally, when products and processes are designed 
and developed with designed experiments, they enjoy better performance, higher reliability, and 
lower overall costs. Designed experiments also play a crucial role in reducing the lead time for 
engineering design and development activities. 

For example, consider the problem involving the choice of wall thickness for the 
nylon connector. This is a simple illustration of a designed experiment. The engineer chose 
two wall thicknesses for the connector and performed a series of tests to obtain pull-off 
force measurements at each wall thickness. In this simple comparative experiment, the 
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engineer is interested in determining if there is any difference between the 3/32- and 
1/8-inch designs. An approach that could be used in analyzing the data from this experi- 
ment is to compare the mean pull-off force for the 3/32-inch design to the mean pull-off 
force for the 1/8-inch design using statistical hypothesis testing, which is discussed in 
detail in Chapters 9 and 10. Generally, a hypothesis is a statement about some aspect of the 
system in which we are interested. For example, the engineer might want to know if the 
mean pull-off force of a 3/32-inch design exceeds the typical maximum load expected to 
be encountered in this application, say 12.75 pounds. Thus, we would be interested in test- 
ing the hypothesis that the mean strength exceeds 12.75 pounds. This is called a single- 
sample hypothesis testing problem. It is also an example of an analytic study. Chapter 9 
presents techniques for this type of problem. Alternatively, the engineer might be inter- 
ested in testing the hypothesis that increasing the wall thickness from 3/32- to 1/8-inch 
results in an increase in mean pull-off force. Clearly, this is an analytic study; it is also an 
example of a two-sample hypothesis testing problem. Two-sample hypothesis testing 
problems are discussed in Chapter 10. 

Designed experiments are a very powerful approach to studying complex systems, such 
as the distillation column. This process has three factors, the two temperatures and the reflux 
rate, and we want to investigate the effect of these three factors on output acetone concentra- 
tion. A good experimental design for this problem must ensure that we can separate the effects 
of all three factors on the acetone concentration. The specified values of the three factors used 
in the experiment are called factor levels. Typically, we use a small number of levels for each 
factor, such as two or three. For the distillation column problem, suppose we use a “high,” and 
“low,” level (denoted +1 and — 1, respectively) for each of the factors. We thus would use two 
levels for each of the three factors. A very reasonable experiment design strategy uses every 
possible combination of the factor levels to form a basic experiment with eight different set- 
tings for the process. This type of experiment is called a factorial experiment. Table 1-1 pres- 
ents this experimental design. 

Figure 1-6, on page 8, illustrates that this design forms a cube in terms of these high and 
low levels. With each setting of the process conditions, we allow the column to reach equilib- 
rium, take a sample of the product stream, and determine the acetone concentration. We then 
can draw specific inferences about the effect of these factors. Such an approach allows us to 
proactively study a population or process. Designed experiments play a very important role in 
engineering and science. Chapters 13 and 14 discuss many of the important principles and 
techniques of experimental design. 


Table 1-1 The Designed Experiment (Factorial Design) for the 
Distillation Column 


Reboil Temp. Condensate Temp. Reflux Rate 
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Figure 1-6 The fac- 
torial design for the 
distillation column. 
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Reboil temperature 


1-2.5 A Factorial Experiment for the Connector Pull-off 
Force Problem (CD Only) 


1-2.6 Observing Processes Over Time 


Figure 1-7 The dot 
diagram illustrates 
variation but does not 
identify the problem. 


Often data are collected over time. In this case, it is usually very helpful to plot the data ver- 
sus time in a time series plot. Phenomena that might affect the system or process often be- 
come more visible in a time-oriented plot and the concept of stability can be better judged. 

Figure 1-7 is a dot diagram of acetone concentration readings taken hourly from the 
distillation column described in Section 1-2.2. The large variation displayed on the dot 
diagram indicates a lot of variability in the concentration, but the chart does not help explain 
the reason for the variation. The time series plot is shown in Figure 1-8, on page 9. A shift 
in the process mean level is visible in the plot and an estimate of the time of the shift can be 
obtained. 

W. Edwards Deming, a very influential industrial statistician, stressed that it is important 
to understand the nature of variability in processes and systems over time. He conducted an 
experiment in which he attempted to drop marbles as close as possible to a target on a table. 
He used a funnel mounted on a ring stand and the marbles were dropped into the funnel. See 
Fig. 1-9. The funnel was aligned as closely as possible with the center of the target. He then 
used two different strategies to operate the process. (1) He never moved the funnel. He just 
dropped one marble after another and recorded the distance from the target. (2) He dropped 
the first marble and recorded its location relative to the target. He then moved the funnel an 
equal and opposite distance in an attempt to compensate for the error. He continued to make 
this type of adjustment after each marble was dropped. 

After both strategies were completed, he noticed that the variability of the distance 
from the target for strategy 2 was approximately 2 times larger than for strategy 1. The ad- 
justments to the funnel increased the deviations from the target. The explanation is that the 
error (the deviation of the marble's position from the target) for one marble provides no 
information about the error that will occur for the next marble. Consequently, adjustments 
to the funnel do not decrease future errors. Instead, they tend to move the funnel farther 
from the target. 

This interesting experiment points out that adjustments to a process based on random dis- 
turbances can actually increase the variation of the process. This is referred to as overcontrol 
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Figure 1-8 A time series plot of concentration provides Figure 1-9 Deming's funnel experiment. 


more information than the dot diagram. 


Figure 1-10 Adjust- 
ments applied to 
random disturbances 
overcontrol the process 
and increase the devia- 
tions from the target. 


or tampering. Adjustments should be applied only to compensate for a nonrandom shift in 
the process—then they can help. A computer simulation can be used to demonstrate the les- 
sons of the funnel experiment. Figure 1-10 displays a time plot of 100 measurements 
(denoted as y) from a process in which only random disturbances are present. The target 
value for the process is 10 units. The figure displays the data with and without adjustments 
that are applied to the process mean in an attempt to produce data closer to target. Each 
adjustment is equal and opposite to the deviation of the previous measurement from target. 
For example, when the measurement is 11 (one unit above target), the mean is reduced by 
one unit before the next measurement is generated. The overcontrol has increased the devia- 
tions from the target. 

Figure 1-11 displays the data without adjustment from Fig. 1-10, except that the measure- 
ments after observation number 50 are increased by two units to simulate the effect of a shift 
in the mean of the process. When there is a true shift in the mean of a process, an adjustment 
can be useful. Figure 1-11 also displays the data obtained when one adjustment (a decrease of 
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Figure 1-11 Process 
mean shift is detected 
at observation number 
57, and one adjustment 
(a decrease of two 
units) reduces the 
deviations from target. 


Figure 1-12 A 
control chart for the 
chemical process 
concentration data. 
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two units) is applied to the mean after the shift is detected (at observation number 57). Note 
that this adjustment decreases the deviations from target. 

The question of when to apply adjustments (and by what amounts) begins with an under- 
standing of the types of variation that affect a process. A control chart is an invaluable way 
to examine the variability in time-oriented data. Figure 1-12 presents a control chart for the 
concentration data from Fig. 1-8. The center line on the control chart is just the average of the 
concentration measurements for the first 20 samples (x = 91.5 g/1) when the process is sta- 
ble. The upper control limit and the lower control limit are a pair of statistically derived lim- 
its that reflect the inherent or natural variability in the process. These limits are located three 
standard deviations of the concentration values above and below the center line. If the process 
is operating as it should, without any external sources of variability present in the system, the 
concentration measurements should fluctuate randomly around the center line, and almost all 
of them should fall between the control limits. 

In the control chart of Fig. 1-12, the visual frame of reference provided by the center line 
and the control limits indicates that some upset or disturbance has affected the process around 
sample 20 because all of the following observations are below the center line and two of them 
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actually fall below the lower control limit. This is a very strong signal that corrective action is 
required in this process. If we can find and eliminate the underlying cause of this upset, we can 
improve process performance considerably. 

Control charts are a very important application of statistics for monitoring, controlling, 
and improving a process. The branch of statistics that makes use of control charts is called sta- 
tistical process control, or SPC. We will discuss SPC and control charts in Chapter 16. 


1-3 MECHANISTIC AND EMPIRICAL MODELS 


Models play an important role in the analysis of nearly all engineering problems. Much of the 
formal education of engineers involves learning about the models relevant to specific fields 
and the techniques for applying these models in problem formulation and solution. As a sim- 
ple example, suppose we are measuring the flow of current in a thin copper wire. Our model 
for this phenomenon might be Ohm's law: 


Current — voltage/resistance 
or 
I= E/R (1-2) 


We call this type of model a mechanistic model because it is built from our underlying 
knowledge of the basic physical mechanism that relates these variables. However, if we 
performed this measurement process more than once, perhaps at different times, or even on 
different days, the observed current could differ slightly because of small changes or varia- 
tions in factors that are not completely controlled, such as changes in ambient temperature, 
fluctuations in performance of the gauge, small impurities present at different locations in the 
wire, and drifts in the voltage source. Consequently, a more realistic model of the observed 
current might be 


I=E/R+e (1-3) 


where e is a term added to the model to account for the fact that the observed values of 
current flow do not perfectly conform to the mechanistic model. We can think of e as a 
term that includes the effects of all of the unmodeled sources of variability that affect this 
system. 

Sometimes engineers work with problems for which there is no simple or well- 
understood mechanistic model that explains the phenomenon. For instance, suppose we are 
interested in the number average molecular weight (M,) of a polymer. Now we know that M, 
is related to the viscosity ofthe material (V), and it also depends on the amount of catalyst (C ) 
and the temperature (T) in the polymerization reactor when the material is manufactured. The 
relationship between M, and these variables is 


say, where the form of the function f is unknown. Perhaps a working model could be devel- 
oped from a first-order Taylor series expansion, which would produce a model of the form 


M, = Bo + Biv + B2C + Bal (1-5) 
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where the Q's are unknown parameters. Now just as in Ohm's law, this model will not exactly 
describe the phenomenon, so we should account for the other sources of variability that may 
affect the molecular weight by adding another term to the model; therefore 


M, = Bo + BV + BoC + B37 +e (1-6) 


is the model that we will use to relate molecular weight to the other three variables. This type of 
model is called an empirical model; that is, it uses our engineering and scientific knowledge of 
the phenomenon, but it is not directly developed from our theoretical or first-principles under- 
standing of the underlying mechanism. 

To illustrate these ideas with a specific example, consider the data in Table 1-2. This table 
contains data on three variables that were collected in an observational study in a semicon- 
ductor manufacturing plant. In this plant, the finished semiconductor is wire bonded to a 
frame. The variables reported are pull strength (a measure of the amount of force required to 
break the bond), the wire length, and the height of the die. We would like to find a model 
relating pull strength to wire length and die height. Unfortunately, there is no physical mech- 
anism that we can easily apply here, so it doesn't seem likely that a mechanistic modeling 
approach will be successful. 


Table 1-2 Wire Bond Pull Strength Data 


Observation Pull Strength Wire Length Die Height 


Number y Xj X» 
1 9.95 2 50 
2 24.45 8 110 
3 31.75 11 120 
4 35.00 10 550 
5 25.02 8 295 
6 16.86 4 200 
7 14.38 2 375 
8 9.60 2 52 
9 24.35 9 100 

10 27.50 8 300 
11 17.08 4 412 
12 37.00 HH 400 
13 41.95 12 500 
14 11.66 2 360 
15 21.65 4 205 
16 17.89 4 400 
17 69.00 20 600 
18 10.30 j| 585 
19 34.93 10 540 
20 46.59 I3 250 
2] 44.88 15 290 
22 54.12 16 510 
23 56.63 17 590 
24 22.13 6 100 


23 21.15 5 400 


Figure 1-13  Three- 
dimensional plot of 
the wire and pull 
strength data. 


Figure 1-14 Plot of 
predicted values of 
pull strength from the 
empirical model. 
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Figure 1-13 presents a three-dimensional plot of all 25 observations on pull strength, wire 
length, and die height. From examination of this plot, we see that pull strength increases as both 
wire length and die height increase. Furthermore, it seems reasonable to think that a model such as 


Pull strength = Bo + Bi(wire length) + B,(die height) + € 


would be appropriate as an empirical model for this relationship. In general, this type of em- 
pirical model is called a regression model. In Chapters 11 and 12 we show how to build 
these models and test their adequacy as approximating functions. We will use a method for 
estimating the parameters in regression models, called the method of least squares, that 
traces its origins to work by Karl Gauss. Essentially, this method chooses the parameters in 
the empirical model (the B's) to minimize the sum of the squared distances between each 
data point and the plane represented by the model equation. Applying this technique to the 
data in Table 1-2 results in 


"un t . . . 
Pull strength = 2.26 + 2.74(wire length) + 0.0125(die height) (1-7) 
where the “hat,” or circumflex, over pull strength indicates that this is an estimated or pre- 
dicted quantity. 

Figure 1-14 is a plot of the predicted values of pull strength versus wire length and die 
height obtained from Equation 1-7. Notice that the predicted values lie on a plane above the 
wire length—die height space. From the plot of the data in Fig. 1-13, this model does not ap- 
pear unreasonable. The empirical model in Equation 1-7 could be used to predict values of 
pull strength for various combinations of wire length and die height that are of interest. 
Essentially, the empirical model could be used by an engineer in exactly the same way that 
a mechanistic model can be used. 
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1-4 PROBABILITY AND PROBABILITY MODELS 


In Section 1-1, it was mentioned that decisions often need to be based on measurements from 
only a subset of objects selected in a sample. This process of reasoning from a sample of 
objects to conclusions for a population of objects was referred to as statistical inference. A 
sample of three wafers selected from a larger production lot of wafers in semiconductor man- 
ufacturing was an example mentioned. To make good decisions, an analysis of how well a 
sample represents a population is clearly necessary. If the lot contains defective wafers, how 
well will the sample detect this? How can we quantify the criterion to “detect well"? Basically, 
how can we quantify the risks of decisions based on samples? Furthermore, how should sam- 
ples be selected to provide good decisions—ones with acceptable risks? Probability models 
help quantify the risks involved in statistical inference, that 1s, the risks involved in decisions 
made every day. 

More details are useful to describe the role of probability models. Suppose a production 
lot contains 25 wafers. If all the wafers are defective or all are good, clearly any sample will 
generate all defective or all good wafers, respectively. However, suppose only one wafer in 
the lot is defective. Then a sample might or might not detect (include) the wafer. A probabil- 
ity model, along with a method to select the sample, can be used to quantify the risks that the 
defective wafer is or is not detected. Based on this analysis, the size of the sample might be 
increased (or decreased). The risk here can be interpreted as follows. Suppose a series of lots, 
each with exactly one defective wafer, are sampled. The details of the method used to select 
the sample are postponed until randomness is discussed in the next chapter. Nevertheless, 
assume that the same size sample (such as three wafers) 1s selected in the same manner from 
each lot. The proportion of the lots in which the defective wafer 1s included in the sample or, 
more specifically, the limit of this proportion as the number of lots in the series tends to infin- 
ity, 1s interpreted as the probability that the defective wafer is detected. 

A probability model is used to calculate this proportion under reasonable assumptions for 
the manner in which the sample is selected. This is fortunate because we do not want to at- 
tempt to sample from an infinite series of lots. Problems of this type are worked in Chapters 2 
and 3. More importantly, this probability provides valuable, quantitative information regard- 
ing any decision about lot quality based on the sample. 

Recall from Section 1-1 that a population might be conceptual, as in an analytic study that 
applies statistical inference to future production based on the data from current production. 
When populations are extended in this manner, the role of statistical inference and the associ- 
ated probability models becomes even more important. 

In the previous example, each wafer in the sample was only classified as defective or not. 
Instead, a continuous measurement might be obtained from each wafer. In Section 1-2.6, con- 
centration measurements were taken at periodic intervals from a production process. Figure 1-7 
shows that variability is present in the measurements, and there might be concern that the 
process has moved from the target setting for concentration. Similar to the defective wafer, 
one might want to quantify our ability to detect a process change based on the sample data. 
Control limits were mentioned in Section 1-2.6 as decision rules for whether or not to adjust 
a process. The probability that a particular process change is detected can be calculated with 
a probability model for concentration measurements. Models for continous measurements are 
developed based on plausible assumptions for the data and a result known as the central limit 
theorem, and the associated normal distribution is a particularly valuable probability model 
for statistical inference. Of course, a check of assumptions is important. These types of prob- 
ability models are discussed in Chapter 4. The objective is still to quantify the risks inherent 
in the inference made from the sample data. 
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Throughout Chapters 6 through 15, decisions are based statistical inference from sample 
data. Continuous probability models, specifically the normal distribution, are used extensively 
to quantify the risks in these decisions and to evaluate ways to collect the data and how large 
a sample should be selected. 
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1-2.5 A Factorial Experiment for the Connector Pull-off Force Problem 


(CD only) 


Much of what we know in the engineering and physical-chemical sciences is developed 
through testing or experimentation. Often engineers work in problem areas in which no 
scientific or engineering theory is directly or completely applicable, so experimentation 
and observation of the resulting data constitute the only way that the problem can be 
solved. Even when there is a good underlying scientific theory that we may rely on to 
explain the phenomena of interest, it is almost always necessary to conduct tests or exper- 
iments to confirm that the theory is indeed operative in the situation or environment in 
which it is being applied. We have observed that statistical thinking and statistical methods 
play an important role in planning, conducting, and analyzing the data from engineering 
experiments. 

To further illustrate the factorial design concept introduced in Section 1-2.4, suppose that 
in the connector wall thickness example, there are two additional factors of interest, time and 
temperature. The cure times of interest are 1 and 24 hours and the temperature levels are 70°F 
and 100°F. Now since all three factors have two levels, a factorial experiment would consist 
of the eight test combinations shown at the corners of the cube in Fig. S1-1. Two trials, or 
replicates, would be performed at each corner, resulting in a 16-run factorial experiment. The 
observed values of pull-off force are shown in parentheses at the cube corners in Fig. SI-1. 
Notice that this experiment uses eight 3/32-inch prototypes and eight 1/8-inch prototypes, the 
same number used in the simple comparative study in Section 1-1, but we are now investigat- 
ing three factors. Generally, factorial experiments are the most efficient way to study the joint 
effects of several factors. 

Some very interesting tentative conclusions can be drawn from this experiment. First, 
compare the average pull-off force of the eight 3/32-inch prototypes with the average pull-off 
force of the eight 1/8-inch prototypes (these are the averages of the eight runs on the left face 
and right face of the cube in Fig. S1-1, respectively), or 14.1 — 13.45 = 0.65. Thus, increas- 
ing the wall thickness from 3/32 to 1/8-inch increases the average pull-off force by 0.65 
pounds. Next, to measure the effect of increasing the cure time, compare the average of the 
eight runs in the back face of the cube (where time — 24 hours) with the average of the eight 
runs in the front face (where time = 1 hour), or 14.275 — 13.275 = 1. The effect of increas- 
ing the cure time from 1 to 24 hours is to increase the average pull-off force by 1 pound; that 
is, cure time apparently has an effect that is larger than the effect of increasing the wall 
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Figure S1-2 The two-factor interaction between cure time and cure temperature. 


thickness. The cure temperature effect can be evaluated by comparing the average of the eight 
runs in the top of the cube (where temperature = 100°F) with the average of the eight runs in 
the bottom (where temperature = 70°F), or 14.125 — 13.425 = 0.7. Thus, the effect of in- 
creasing the cure temperature is to increase the average pull-off force by 0.7 pounds. Thus, if 
the engineer's objective 1s to design a connector with high pull-off force, there are apparently 
several alternatives, such as increasing the wall thickness and using the “standard” curing 
conditions of 1 hour and 70°F or using the original 3/32-inch wall thickness but specifying a 
longer cure time and higher cure temperature. 

There is an interesting relationship between cure time and cure temperature that can be 
seen by examination of the graph in Fig. S1-2. This graph was constructed by calculating the 
average pull-off force at the four different combinations of time and temperature, plotting 
these averages versus time and then connecting the points representing the two temperature 
levels with straight lines. The slope of each of these straight lines represents the effect of cure 
time on pull-off force. Notice that the slopes of these two lines do not appear to be the same, 
indicating that the cure time effect 1s different at the two values of cure temperature. This is an 
example of an interaction between two factors. The interpretation of this interaction is very 
straightforward; if the standard cure time (1 hour) is used, cure temperature has little effect, 
but if the longer cure time (24 hours) is used, increasing the cure temperature has a large effect 
on average pull-off force. Interactions occur often in physical and chemical systems, and 
factorial experiments are the only way to investigate their effects. In fact, if interactions are 
present and the factorial experimental strategy 1s not used, incorrect or misleading results may 
be obtained. 

We can easily extend the factorial strategy to more factors. Suppose that the engineer 
wants to consider a fourth factor, type of adhesive. There are two types: the standard 
adhesive and a new competitor. Figure $1-3 illustrates how all four factors, wall thickness, 
cure time, cure temperature, and type of adhesive, could be investigated in a factorial 
design. Since all four factors are still at two levels, the experimental design can still be 
represented geometrically as a cube (actually, it’s a hypercube). Notice that as in any fac- 
torial design, all possible combinations of the four factors are tested. The experiment re- 
quires 16 trials. 
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Figure S1-3 A four-factorial experiment for the connector wall thick- 
ness problem. 


Generally, if there are k factors and they each have two levels, a factorial experimental 
design will require 2* runs. For example, with k = 4, the 2^ design in Fig. S1-3 requires 16 
tests. Clearly, as the number of factors increases, the number of trials required in a factorial 
experiment increases rapidly; for instance, eight factors each at two levels would require 
256 trials. This quickly becomes unfeasible from the viewpoint of time and other resources. 
Fortunately, when there are four to five or more factors, it 1s usually unnecessary to test all 
possible combinations of factor levels. A fractional factorial experiment is a variation of 
the basic factorial arrangement in which only a subset of the factor combinations are actu- 
ally tested. Figure S1-4 shows a fractional factorial experimental design for the four-factor 
version of the connector experiment. The circled test combinations in this figure are the 
only test combinations that need to be run. This experimental design requires only 8 runs in- 
stead of the original 16; consequently it would be called a one-half fraction. This is an ex- 
cellent experimental design in which to study all four factors. It will provide good informa- 
tion about the individual effects of the four factors and some information about how these 
factors interact. 

Factorial and fractional factorial experiments are used extensively by engineers and sci- 
entists in industrial research and development, where new technology, products, and 
processes are designed and developed and where existing products and processes are im- 
proved. Since so much engineering work involves testing and experimentation, it is essential 
that all engineers understand the basic principles of planning efficient and effective 
experiments. We discuss these principles in Chapter 13. Chapter 14 concentrates on the facto- 
rial and fractional factorials that we have introduced here. 
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Figure S1-4 A fractional factorial experiment for the connector wall 
thickness problem. 
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Answers for most odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e- Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e-Text immediately following the section they accompany. 


2-1 SAMPLE SPACES AND EVENTS 
2-1.1 Random Experiments 


If we measure the current in a thin copper wire, we are conducting an experiment. However, 
in day-to-day repetitions of the measurement the results can differ slightly because of small 
variations in variables that are not controlled in our experiment, including changes in ambient 
temperatures, slight variations in gauge and small impurities in the chemical composition of 
the wire if different locations are selected, and current source drifts. Consequently, this exper- 
iment (as well as many we conduct) is said to have a random component. In some cases, 
the random variations, are small enough, relative to our experimental goals, that they can be 
ignored. However, no matter how carefully our experiment is designed and conducted, the 
variation is almost always present, and its magnitude can be large enough that the important 
conclusions from our experiment are not obvious. In these cases, the methods presented in this 
book for modeling and analyzing experimental results are quite valuable. 

Our goal is to understand, quantify, and model the type of variations that we often 
encounter. When we incorporate the variation into our thinking and analyses, we can make 
informed judgments from our results that are not invalidated by the variation. 

Models and analyses that include variation are not different from models used in other areas 
of engineering and science. Figure 2-1 displays the important components. A mathematical 
model (or abstraction) of the physical system is developed. It need not be a perfect abstraction. 
For example, Newton's laws are not perfect descriptions of our physical universe. Still, they are 
useful models that can be studied and analyzed to approximately quantify the performance of a 
wide range of engineered products. Given a mathematical abstraction that is validated with 
measurements from our system, we can use the model to understand, describe, and quantify 
important aspects of the physical system and predict the response of the system to inputs. 

Throughout this text, we discuss models that allow for variations in the outputs of a sys- 
tem, even though the variables that we control are not purposely changed during our study. 
Figure 2-2 graphically displays a model that incorporates uncontrollable inputs (noise) that 
combine with the controllable inputs to produce the output of our system. Because of the 
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identifies deviations from the model. 


uncontrollable inputs, the same settings for the controllable inputs do not result in identical 
outputs every time the system is measured. 


Definition 
An experiment that can result in different outcomes, even though it is repeated in the 


same manner every time, is called a random experiment. 





For the example of measuring current in a copper wire, our model for the system might 
simply be Ohm's law. Because of uncontrollable inputs, variations in measurements of current 
are expected. Ohm's law might be a suitable approximation. However, if the variations are 
large relative to the intended use of the device under study, we might need to extend our model 
to include the variation. See Fig. 2-3. 

As another example, in the design of a communication system, such as a computer or 
voice communication network, the information capacity available to service individuals using 
the network is an important design consideration. For voice communication, sufficient 
external lines need to be purchased from the phone company to meet the requirements of a 
business. Assuming each line can carry only a single conversation, how many lines should be 
purchased? If too few lines are purchased, calls can be delayed or lost. The purchase of too 
many lines increases costs. Increasingly, design and product development is required to meet 
customer requirements at a competitive cost. 

In the design of the voice communication system, a model is needed for the number of calls 
and the duration of calls. Even knowing that on average, calls occur every five minutes and that 
they last five minutes is not sufficient. If calls arrived precisely at five-minute intervals and lasted 
for precisely five minutes, one phone line would be sufficient. However, the slightest variation in 
call number or duration would result in some calls being blocked by others. See Fig. 2-4. A system 
designed without considering variation will be woefully inadequate for practical use. Our model 
for the number and duration of calls needs to include variation as an integral component. An 
analysis of models including variation is important for the design of the phone system. 


2-1.2 Sample Spaces 


To model and analyze a random experiment, we must understand the set of possible out- 
comes from the experiment. In this introduction to probability, we make use of the basic 


Definition 


EXAMPLE 2-1 


Definition 
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concepts of sets and operations on sets. It is assumed that the reader is familiar with these 
topics. 


The set of all possible outcomes of a random experiment is called the sample space 


of the experiment. The sample space is denoted as S. 





A sample space is often defined based on the objectives of the analysis. 
Consider an experiment in which you select a molded plastic part, such as a connector, and 
measure its thickness. The possible values for thickness depend on the resolution of the meas- 
uring instrument, and they also depend on upper and lower bounds for thickness. However, it 
might be convenient to define the sample space as simply the positive real line 
See |e> 0} 

because a negative value for thickness cannot occur. 

If it is known that all connectors will be between 10 and 11 millimeters thick, the sample 
space could be 


S = {x]10<x< 11} 


If the objective of the analysis 1s to consider only whether a particular part is low, medium, 
or high for thickness, the sample space might be taken to be the set of three outcomes: 


S = {low, medium, high} 

If the objective of the analysis is to consider only whether or not a particular part con- 
forms to the manufacturing specifications, the sample space might be simplified to the set of 
two outcomes 

S = {yes, no} 


that indicate whether or not the part conforms. 


It is useful to distinguish between two types of sample spaces. 


A sample space is discrete if it consists of a finite or countable infinite set of outcomes. 


A sample space is continuous if it contains an interval (either finite or infinite) of 
real numbers. 





In Example 2-1, the choice S = R* is an example of a continuous sample space, whereas 
S — yes, no} is a discrete sample space. As mentioned, the best choice of a sample space 
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EXAMPLE 2-2 


depends on the objectives of the study. As specific questions occur later in the book, appro- 
priate sample spaces are discussed. 


If two connectors are selected and measured, the extension of the positive real line R is to take 
the sample space to be the positive quadrant of the plane: 


S= xc 


If the objective of the analysis 1s to consider only whether or not the parts conform to the 
manufacturing specifications, either part may or may not conform. We abbreviate yes and no 
as y and n. If the ordered pair yn indicates that the first connector conforms and the second 
does not, the sample space can be represented by the four outcomes: 


S = {yy, yn, ny, nn} 


If we are only interested in the number of conforming parts in the sample, we might sum- 
marize the sample space as 


S = {0, 1,2} 


As another example, consider an experiment in which the thickness is measured until a 
connector fails to meet the specifications. The sample space can be represented as 


S = ín, yn, yyn, yyyn, yyyyn, and so forth} 


In random experiments in which items are selected from a batch, we will indicate whether 
or not a selected item is replaced before the next one is selected. For example, if the batch 
consists of three items (a, b, c} and our experiment is to select two items without replace- 
ment, the sample space can be represented as 


Swithout = (ab, ac, ba, bc, ca, cb} 


This description of the sample space maintains the order of the items selected so that the out- 
come ab and ba are separate elements in the sample space. A sample space with less detail 
only describes the two items selected {{a, b}, (a, c}, (5, c}}. This sample space is the possi- 
ble subsets of two items. Sometimes the ordered outcomes are needed, but in other cases the 
simpler, unordered sample space is sufficient. 

If items are replaced before the next one 1s selected, the sampling is referred to as with 
replacement. Then the possible ordered outcomes are 


Swith = aa, ab, ac, ba, bb, bc, ca, cb, cc} 


The unordered description of the sample space is 1 (a, a}, (a, b}, {a,c}, {b,b}, {b,c}, 1e, ch}. 
Sampling without replacement is more common for industrial applications. 

Sometimes it is not necessary to specify the exact item selected, but only a property of the 
item. For example, suppose that there are 5 defective parts and 95 good parts in a batch. To 
study the quality of the batch, two are selected without replacement. Let g denote a good part 
and d denote a defective part. It might be sufficient to describe the sample space (ordered) in 
terms of quality of each part selected as 


S = (eg, gd, dg, dd} 


Figure 2-5 Tree 
diagram for three 
messages. 
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One must be cautious with this description of the sample space because there are many more 
pairs of items in which both are good than pairs in which both are defective. These differences 
must be accounted for when probabilities are computed later in this chapter. Still, this sum- 
mary of the sample space will be convenient when conditional probabilities are used later in 
this chapter. Also, if there were only one defective part in the batch, there would be fewer 
possible outcomes 


S = {gg, gd, dg} 


because dd would be impossible. For sampling questions, sometimes the most important part 
of the solution is an appropriate description of the sample space. 

Sample spaces can also be described graphically with tree diagrams. When a sample 
space can be constructed in several steps or stages, we can represent each of the n, ways of 
completing the first step as a branch of a tree. Each of the ways of completing the second step 
can be represented as n, branches starting from the ends of the original branches, and so forth. 


Each message in a digital communication system is classified as to whether it is received 
within the time specified by the system design. If three messages are classified, use a tree 
diagram to represent the sample space of possible outcomes. 

Each message can either be received on time or late. The possible results for three mes- 
sages can be displayed by eight branches in the tree diagram shown in Fig. 2-5. 


An automobile manufacturer provides vehicles equipped with selected options. Each vehicle 
is ordered 


With or without an automatic transmis- With one of three choices of a stereo 
sion system 
With or without air-conditioning With one of four exterior colors 


If the sample space consists of the set of all possible vehicle types, what 1s the number of 
outcomes in the sample space? The sample space contains 48 outcomes. The tree diagram for 
the different types of vehicles is displayed in Fig. 2-6. 


Consider an extension of the automobile manufacturer illustration in the previous example in 
which another vehicle option is the interior color. There are four choices of interior color: red, 
black, blue, or brown. However, 

With a red exterior, only a black or red interior can be chosen. 


With a white exterior, any interior color can be chosen. 
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Figure 2-6 Tree diagram for different types of vehicles. 


With a blue exterior, only a black, red, or blue interior can be chosen. 


With a brown exterior, only a brown interior can be chosen. 


In Fig. 2-6, there are 12 vehicle types with each exterior color, but the number of interior 
color choices depends on the exterior color. As shown in Fig. 2-7, the tree diagram can be ex- 
tended to show that there are 120 different vehicle types in the sample space. 


2-1.3 Events 


Often we are interested in a collection of related outcomes from a random experiment. 


Definition 


An event is a subset of the sample space of a random experiment. 





We can also be interested in describing new events from combinations of existing events. 
Because events are subsets, we can use basic set operations such as unions, intersections, and 
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complements to form other events of interest. Some of the basic set operations are summa- 
rized below in terms of events: 


e The union of two events is the event that consists of all outcomes that are contained 
in either of the two events. We denote the union as E, U Æ>. 


e The intersection of two events is the event that consists of all outcomes that are 
contained in both of the two events. We denote the intersection as E, N E». 


e The complement of an event in a sample space is the set of outcomes in the sample 
space that are not in the event. We denote the component of the event E as E’. 


Consider the sample space S = {yy, yn, ny, nn} in Example 2-2. Suppose that the set of all out- 
comes for which at least one part conforms is denoted as £,. Then, 


E, = by, yn, ny] 
The event in which both parts do not conform, denoted as Æ», contains only the single out- 
come, E, = {nn}. Other examples of events are E, = ©, the null set, and E, = S, the sample 


space. If E; = (yn, ny, nn}, 


EUE;—-S E QE; = {ynny} ££; = {nn} 


Measurements of the time needed to complete a chemical reaction might be modeled with the 
sample space S = R”, the set of positive real numbers. Let 


E, = {x| 1x < 10} and £,={x|3<x< 118} 
Then, 
By, Uo He | 1 =x, LS and E ME, = {x|3<x< 10} 
Also, 


Ej = {x |x = 10} and FY OE,={x|10=x< 118} 


Samples of polycarbonate plastic are analyzed for scratch and shock resistance. The results 
from 50 samples are summarized as follows: 


shock resistance 


high low 
scratch resistance high 40 4 
low l 5 


Let A denote the event that a sample has high shock resistance, and let B denote the event that a 
sample has high scratch resistance. Determine the number of samples in A N B, A', and A U B. 

The event A f| B consists of the 40 samples for which scratch and shock resistances 
are high. The event A’ consists of the 9 samples in which the shock resistance is low. The 
event A U B consists of the 45 samples in which the shock resistance, scratch resistance, 
or both are high. 
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(a) (b) (c) 
Sample space S with events A and B 


(d) (e) 
Figure 2-8 Venn diagrams. 


Diagrams are often used to portray relationships between sets, and these diagrams are 
also used to describe relationships between events. We can use Venn diagrams to represent a 
sample space and events in a sample space. For example, in Fig. 2-8(a) the sample space of 
the random experiment is represented as the points in the rectangle S. The events A and B are 
the subsets of points in the indicated regions. Figure 2-8(b) illustrates two events with no com- 


mon outcomes; Figs. 2-8(c) to 2-8(e) illustrate additional joint events. 
Two events with no outcomes in common have an important relationship. 


Definition 
Two events, denoted as E, and £5, such that 


E,(1 E, =Ø 


are said to be mutually exclusive. 





The two events in Fig. 2-8(b) are mutually exclusive, whereas the two events in Fig. 2-8(a) 


are not. 


Additional results involving events are summarized below. The definition of the comple- 


ment of an event implies that 
(EY =E 


The distributive law for set operations implies that 


(AUB)ANC=(ANC)U(BNO, and  (40nB)UC - (AU C) (BU C) 


DeMorgan’s laws imply that 


(AU B) =A'NB' and 


Also, remember that 


A(Y1B — B(Y1A and 


2-1.4 Counting Techniques (CD Only) 


2-] SAMPLE SPACES AND EVENTS 25 


(4B) = A' UB" 


AUB=BUA 


As sample spaces become larger, complete enumeration 1s difficult. Instead, counts of 
the number outcomes in the sample space and in various events are often used to analyze the 
random experiment. These methods are referred to as counting techniques and described on 


the CD. 


EXERCISES FOR SECTION 2-1 


Provide a reasonable description of the sample space for each 
of the random experiments in Exercises 2-1 to 2-18. There can 
be more than one acceptable interpretation of each experi- 
ment. Describe any assumptions you make. 


2-1. Each of three machined parts is classified as either 
above or below the target specification for the part. 


2-2. Each of four transmitted bits is classified as either in 
error or not in error. 


2-3. In the final inspection of electronic power supplies, 
three types of nonconformities might occur: functional, minor, 
or cosmetic. Power supplies that are defective are further clas- 
sified as to type of nonconformity. 


2-4. In the manufacturing of digital recording tape, elec- 
tronic testing is used to record the number of bits in error in a 
350-foot reel. 

2-5. Inthe manufacturing of digital recording tape, each of 
24 tracks is classified as containing or not containing one or 
more bits in error. 

2-6. An ammeter that displays three digits is used to meas- 
ure current in milliamperes. 

2-7. A scale that displays two decimal places is used to 
measure material feeds in a chemical plant in tons. 

2-8. The following two questions appear on an employee 
survey questionnaire. Each answer is chosen from the five- 
point scale 1 (never), 2, 3, 4, 5 (always). 


Is the corporation willing to listen to and fairly evaluate 
new ideas? 
How often are my coworkers important in my overall job 
performance? 
2-9. The concentration of ozone to the nearest part per billion. 


2-10. The time until a tranaction service 1s requested of a 
computer to the nearest millisecond. 


2-11. The pH reading of a water sample to the nearest tenth 
of a unit. 
2-12. The voids in a ferrite slab are classified as small, 


medium, or large. The number of voids in each category 1s 
measured by an optical inspection of a sample. 


2-13. The time of a chemical reaction is recorded to the 
nearest millisecond. 


2-14. An order for an automobile can specify either an 
automatic or a standard transmission, either with or without 
air-conditioning, and any one of the four colors red, blue, 
black or white. Describe the set of possible orders for this 
experiment. 


2-15. A sampled injection-molded part could have been 
produced in either one of two presses and in any one of the 
eight cavities in each press. 


2-16. An order for a computer system can specify memory 
of 4, 8, or 12 gigabytes, and disk storage of 200, 300, or 400 
gigabytes. Describe the set of possible orders. 


2-17. Calls are repeatedly placed to a busy phone line until 
a connect is achieved. 


2-18. Ina magnetic storage device, three attempts are made 
to read data before an error recovery procedure that reposi- 
tions the magnetic head is used. The error recovery procedure 
attempts three repositionings before an “abort” message is 
sent to the operator. Let 


s denote the success of a read operation 

f denote the failure of a read operation 

F denote the failure of an error recovery procedure 
S denote the success of an error recovery procedure 
A denote an abort message sent to the operator. 


Describe the sample space of this experiment with a tree 
diagram. 
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2-19. Three events are shown on the Venn diagram in the 
following figure: 


Reproduce the figure and shade the region that corresponds to 
each of the following events. 

(a) A' b) ANB 

(c) (ANB)UC (d (BU C)' 

(e) (4N B) UC 


2-20. Three events are shown on the Venn diagram in the 
following figure: 


Reproduce the figure and shade the region that corresponds to 
each of the following events. 

(a) A' (b) (AN B) U (A (a B') 

(c) (AQ B)UC (d) (BU C) 

(e) (4N B) UC 

2-21. A digital scale is used that provides weights to the 
nearest gram. 

(a) What is the sample space for this experiment? 

Let A denote the event that a weight exceeds 11 grams, let 5 
denote the event that a weight 1s less than or equal to 15 
grams, and let C denote the event that a weight is greater than 
or equal to 8 grams and less than 12 grams. 

Describe the following events. 

(b AUB (c) ANB 

(d) A' (e) AUBUC 


(f (AUC) Q A4nBnc 
h B'f1C (i) AU(BNC) 
2-22. In an injection-molding operation, the length and 


width, denoted as X and Y, respectively, of each molded part 
are evaluated. Let 


A denote the event of 48 < X < 52 centimeters 
B denote the event of 9 < Y < 11 centimeters 


C denote the event that a critical length meets customer 
requirements. 


Construct a Venn diagram that includes these events. Shade 

the areas that represent the following: 

(a) A (b ANB 

(c) A4 UB (d AUB 

(e) If these events were mutually exclusive, how successful 
would this production operation be? Would the process 
produce parts with X — 50 centimeters and Y — 10 
centimeters? 


2-25. Four bits are transmitted over a digital communica- 

tions channel. Each bit is either distorted or received without 

distortion. Let A; denote the event that the ith bit is distorted, 

i=1,...,4. 

(a) Describe the sample space for this experiment. 

(b) Are the A;'s mutually exclusive? 

Describe the outcomes in each of the following events: 

(c) Ay (d) A,’ 

(e) 41145043 44. (£) Ga 0 45) U (43 N 44) 

2-24. A sample of three calculators is selected from a manu- 

facturing line, and each calculator is classified as either defective 

or acceptable. Let 4, B, and C denote the events that the first, 

second, and third calculators respectively, are defective. 

(a) Describe the sample space for this experiment with a tree 
diagram. 

Use the tree diagram to describe each of the following 

events: 

(b) A (c) B 

(d ANB (e BUC 

2-25. A wireless garage door opener has a code determined 

by the up or down setting of 12 switches. How many out- 

comes are in the sample space of possible codes? 


2-26. Disks of polycarbonate plastic from a supplier are an- 
alyzed for scratch and shock resistance. The results from 100 
disks are summarized below: 


shock resistance 


high low 
scratch high 70 9 
resistance low 16 5 


Let A denote the event that a disk has high shock resistance, 
and let B denote the event that a disk has high scratch 


resistance. Determine the number of disks in A N B, A’, and 
A U B. 

2-27. Samples ofa cast aluminum part are classified on the 
basis of surface finish (in microinches) and edge finish. The 
results of 100 parts are summarized as follows: 


edge finish 
excellent good 
surface excellent 80 2 
finish good 10 8 


(a) Let A denote the event that a sample has excellent surface 
finish, and let B denote the event that a sample has excel- 
lent edge finish. Determine the number of samples in 
A' N B, B', and A UB. 

(b) Assume that each of two samples is to be classified on the 
basis of surface finish, either excellent or good, edge finish, 
either excellent or good. Use a tree diagram to represent the 
possible outcomes of this experiment. 

2-28. Samples of emissions from three suppliers are classi- 

fied for conformance to air-quality specifications. The results 

from 100 samples are summarized as follows: 


conforms 
yes no 
22 8 
supplier 2 29 J 
3 30 10 


Let A denote the event that a sample is from supplier 1, and let 
B denote the event that a sample conforms to specifications. 
Determine the number of samples in A' N B, B', and A U B. 


2-29. The rise time of a reactor is measured in minutes (and 
fractions of minutes). Let the sample space be positive, real 
numbers. Define the events 4 and B as follows: 

A= |x = 725} 


and 


B edxx2952.51 


2-2 INTERPRETATIONS OF PROBABILITY 


2-2.1 Introduction 
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Describe each of the following events: 

(a) A’ (b) B' 

(c) AONB (A AUB 

2-30. A sample of two items is selected without replace- 
ment from a batch. Describe the (ordered) sample space for 
each of the following batches: 

(a) The batch contains the items 1a, b, c, d]. 

(b) The batch contains the items (a, b, c, d, e, f; g}. 

(c) The batch contains 4 defective items and 20 good items. 
(d) The batch contains 1 defective item and 20 good items. 


2-31. A sample of two printed circuit boards is selected 
without replacement from a batch. Describe the (ordered) 
sample space for each of the following batches: 

(a) The batch contains 90 boards that are not defective, 8 
boards with minor defects, and 2 boards with major 
defects. 

(b) The batch contains 90 boards that are not defective, 8 
boards with minor defects, and 1 board with major 
defects. 


2-32. Counts of the Web pages provided by each of two 

computer servers in a selected hour of the day are recorded. 

Let A denote the event that at least 10 pages are provided by 

server 1 and let B denote the event that at least 20 pages are 

provided by server 2. 

(a) Describe the sample space for the numbers of pages for 
two servers graphically. 


Show each of the following events on the sample space graph: 


(b) A (c) B 
(d) ANB (e) AUB 
2-33. The rise time of a reactor is measured in minutes 


(and fractions of minutes). Let the sample space for the rise 
time of each batch be positive, real numbers. Consider 
the rise times of two batches. Let A denote the event that the 
rise time of batch 1 is less than 72.5 minutes, and let 5 
denote the event that the rise time of batch 2 is greater than 
52.5 minutes. 

Describe the sample space for the rise time of two batches 
graphically and show each of the following events on a two- 
dimensional plot: 

(a) A (b) B' 
(c) ANB (d AUB 


In this chapter, we introduce probability for discrete sample spaces—those with only a finite 
(or countably infinite) set of outcomes. The restriction to these sample spaces enables us to 
simplify the concepts and the presentation without excessive mathematics. 
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Figure 2-9 Relative 
frequency of corrupted 
pulses sent over a com- 
munication channel. 


Corrupted pulse 


Time 


Voltage 


Relative frequency of corrupted pulse = = 


Probability is used to quantify the likelihood, or chance, that an outcome of a random 
experiment will occur. “The chance of rain today is 30%” is a statement that quantifies our 
feeling about the possibility of rain. The likelihood of an outcome is quantified by assigning a 
number from the interval [0, 1] to the outcome (or a percentage from 0 to 100%). Higher num- 
bers indicate that the outcome is more likely than lower numbers. A 0 indicates an outcome 
will not occur. A probability of 1 indicates an outcome will occur with certainty. 

The probability of an outcome can be interpreted as our subjective probability, or degree 
of belief, that the outcome will occur. Different individuals will no doubt assign different 
probabilities to the same outcomes. Another interpretation of probability is based on the con- 
ceptual model of repeated replications of the random experiment. The probability of an 
outcome is interpreted as the limiting value of the proportion of times the outcome occurs in 
n repetitions of the random experiment as n increases beyond all bounds. For example, if we 
assign probability 0.2 to the outcome that there is a corrupted pulse in a digital signal, we 
might interpret this assignment as implying that, if we analyze many pulses, approximately 
20% of them will be corrupted. This example provides a relative frequency interpretation of 
probability. The proportion, or relative frequency, of replications of the experiment that result 
in the outcome is 0.2. Probabilities are chosen so that the sum of the probabilities of all out- 
comes in an experiment add up to 1. This convention facilitates the relative frequency inter- 
pretation of probability. Figure 2-9 illustrates the concept of relative frequency. 

Probabilities for a random experiment are often assigned on the basis of a reasonable 
model of the system under study. One approach is to base probability assignments on the sim- 
ple concept of equally likely outcomes. 

For example, suppose that we will select one laser diode randomly from a batch of 100. 
The sample space is the set of 100 diodes. Randomly implies that it is reasonable to assume 
that each diode in the batch has an equal chance of being selected. Because the sum of the 
probabilities must equal 1, the probability model for this experiment assigns probability of 
0.01 to each of the 100 outcomes. We can interpret the probability by imagining many repli- 
cations of the experiment. Each time we start with all 100 diodes and select one at random. 
The probability 0.01 assigned to a particular diode represents the proportion of replicates in 
which a particular diode is selected. 

When the model of equally likely outcomes is assumed, the probabilities are chosen to 
be equal. 


Whenever a sample space consists of N possible outcomes that are equally likely, the 


probability of each outcome is 1/N. 





Figure 2-10 
Probability of the 
event E is the sum of 
the probabilities of the 
outcomes in Æ. 


EXAMPLE 2-9 


Definition 


EXAMPLE 2-10 
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Diodes 





P(E) = 30(0.01) = 0.30 


It is frequently necessary to assign probabilities to events that are composed of several 
outcomes from the sample space. This is straightforward for a discrete sample space. 


Assume that 30% of the laser diodes in a batch of 100 meet the minimum power requirements 
of a specific customer. If a laser diode is selected randomly, that is, each laser diode is equally 
likely to be selected, our intuitive feeling is that the probability of meeting the customer’s 
requirements is 0.30. 

Let E denote the subset of 30 diodes that meet the customer’s requirements. Because 
E contains 30 outcomes and each outcome has probability 0.01, we conclude that the prob- 
ability of E is 0.3. The conclusion matches our intuition. Figure 2-10 illustrates this 
example. 


For a discrete sample space, the probability of an event can be defined by the reasoning 
used in the example above. 


For a discrete sample space, the probability of an event E, denoted as P(E), equals the 


sum of the probabilities of the outcomes in £. 





A random experiment can result in one of the outcomes (a, b, c, d} with probabilities 0.1, 0.3, 
0.5, and 0.1, respectively. Let A denote the event (a, b}, B the event (5, c, d}, and C the event 
{d}.Then, 

P(A) = 0.1 + 0.3 = 0.4 

P(B) = 0.3 + 0.5 + 0.1 = 0.9 

P(C) = 0.1 
Also, P(A’) = 0.6, P(B’) = 0.1, and P(C') = 0.9. Furthermore, because 4 N B = {b}, 


P(A N B) = 0.3. Because AUB = {a, b,c, d}, P(A U B) = 0.1 + 0.3 + 0.5 + 0.1 = I. 
Because A N C is the null set, P(A N C) = 0. 
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EXAMPLE 2-11 


EXAMPLE 2-12 


A visual inspection of a location on wafers from a semiconductor manufacturing process re- 
sulted in the following table: 


Number of 
Contamination 
Particles Proportion of Wafers 
0 0.40 
| 0.20 
2 0.15 
3 0.10 
4 0.05 
5 or more 0.10 


If one wafer is selected randomly from this process and the location is inspected, what is the 
probability that it contains no particles? If information were available for each wafer, we could 
define the sample space as the set of all wafers inspected and proceed as in the example with 
diodes. However, this level of detail is not needed in this case. We can consider the sample space 
to consist of the six categories that summarize the number of contamination particles on a wafer. 
Then, the event that there is no particle in the inspected location on the wafer, denoted as E, can 
be considered to be comprised of the single outcome, namely, E = {0}. Therefore, 


What is the probability that a wafer contains three or more particles in the inspected 
location? Let E denote the event that a wafer contains three or more particles in the inspected 
location. Then, E consists of the three outcomes 13, 4, 5 or more}. Therefore, 


P(E) = 0.10 + 0.05 + 0.10 = 0.25 


Suppose that a batch contains six parts with part numbers {a, b, c, d, e, f}. Suppose that two 
parts are selected without replacement. Let E denote the event that the part number of the first 
part selected is a. Then E can be written as E = (ab, ac, ad, ae, af}. The sample space can be 
enumerated. It has 30 outcomes. If each outcome is equally likely, P(E) = 5/30 = 1/6. 

Also, if E, denotes the event that the second part selected is a, E, = iba, ca, da, ea, fa} 
and with equally likely outcomes, P(E;) = 5/30 = 1/6. 


2-2.2 Axioms of Probability 


Now that the probability of an event has been defined, we can collect the assumptions that we 
have made concerning probabilities into a set of axioms that the probabilities in any random 
experiment must satisfy. The axioms ensure that the probabilities assigned in an experiment 
can be interpreted as relative frequencies and that the assignments are consistent with our 
intuitive understanding of relationships between relative frequencies. For example, if event A 
is contained in event B, we should have P(A) - P(B). The axioms do not determine 
probabilities; the probabilities are assigned based on our knowledge of the system under 
study. However, the axioms enable us to easily calculate the probabilities of some events from 
knowledge of the probabilities of other events. 


Axioms of 
Probability 
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Probability is a number that is assigned to each member of a collection of events 


from a random experiment that satisfies the following properties: 
If S is the sample space and £ is any event in a random experiment, 


(D eni 
Qe SERE SU 


(3) For two events E, and E, with E, 1 E, = © 


P(E, U Ey) = P(E) + P(E) 





The property that 0 = P(E) < 1 is equivalent to the requirement that a relative frequency 
must be between 0 and 1. The property that P(S) = 1 is a consequence of the fact that an 
outcome from the sample space occurs on every trial of an experiment. Consequently, the rel- 
ative frequency of S is 1. Property 3 implies that 1f the events E, and E, have no outcomes in 
common, the relative frequency of outcomes in E, U E, is the sum of the relative frequencies 
of the outcomes in £ and £,. 

These axioms imply the following results. The derivations are left as exercises at the end 
of this section. Now, 


and for any event E, 


For example, if the probability of the event E is 0.4, our interpretation of relative 
frequency implies that the probability of E' is 0.6. Furthermore, if the event E, is contained 


in the event EF), 


EXERCISES FOR SECTION 2-2 


2-34. Each of the possible five outcomes of a random ex- 
periment is equally likely. The sample space is (a, b, c, d, e}. 
Let A denote the event (a, b}, and let B denote the event 
1c, d, e}. Determine the following: 


(a) P(A) (b) P(B) 

(c) P(4') (d) P(4U B) 

(e) P(A N B) 

2-35. The sample space of a random experiment is (a, b, c, 


d, e} with probabilities 0.1, 0.1, 0.2, 0.4, and 0.2, respectively. 
Let A denote the event (a, b, c}, and let B denote the event 
1c, d, e}. Determine the following: 

(a) P(A) (b) P(B) 

(c) P(A’) (d) P(A UB) 

(e) P(A MB) 


P(Ei) = P(E) 


2-36. A part selected for testing is equally likely to have 

been produced on any one of six cutting tools. 

(a) What is the sample space? 

(b) What is the probability that the part is from tool 1? 

(c) What is the probability that the part is from tool 3 or 
tool 5? 

(d) What is the probability that the part is not from tool 4? 


2-37. An injection-molded part is equally likely to be ob- 

tained from any one of the eight cavities on a mold. 

(a) What is the sample space? 

(b) What is the probability a part is from cavity 1 or 2? 

(c) What is the probability that a part is neither from cavity 3 
nor 4? 
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2-38. A sample space contains 20 equally likely outcomes. 
If the probability of event A is 0.3, how many outcomes are in 
event A? 

2-39. Orders for a computer are summarized by the op- 
tional features that are requested as follows: 


proportion of orders 


no optional features 0.3 
one optional feature 0.5 
more than one optional feature 0.2 


(a) What is the probability that an order requests at least one 
optional feature? 

(b) What is the probability that an order does not request 
more than one optional feature? 


2-40. Ifthe last digit of a weight measurement is equally 
likely to be any of the digits 0 through 9, 


(a) What is the probability that the last digit is 0? 


(b) What is the probability that the last digit is greater than or 
equal to 5? 


2-41. A sample preparation for a chemical measurement is 

completed correctly by 2596 of the lab technicians, completed 

with a minor error by 70%, and completed with a major error 

by 596. 

(a) If a technician is selected randomly to complete the prepa- 
ration, what is the probability it is completed without 
error? 

(b) What is the probability that it is completed with either a 
minor or a major error? 


2-42. A credit card contains 16 digits between 0 and 9. 
However, only 100 million numbers are valid. If a number is 
entered randomly, what is the probability that it is a valid 
number? 


2-45. Suppose your vehicle is licensed in a state that issues 
license plates that consist of three digits (between 0 and 9) fol- 
lowed by three letters (between A and Z). If a license number 
is selected randomly, what is the probability that yours is the 
one selected? 


2-44. A message can follow different paths through 
servers on a network. The senders message can go to one of 
five servers for the first step, each of them can send to five 
servers at the second step, each of which can send to four 
servers at the third step, and then the message goes to the re- 
cipients server. 

(a) How many paths are possible? 

(b) If all paths are equally likely, what is the probability that a 
message passes through the first of four servers at the 
third step? 

2-45. Disks of polycarbonate plastic from a supplier are an- 

alyzed for scratch and shock resistance. The results from 100 

disks are summarized as follows: 


shock resistance 


high low 
scratch high 70 9 
resistance low 16 5 


Let A denote the event that a disk has high shock resistance, 
and let B denote the event that a disk has high scratch resist- 
ance. If a disk 1s selected at random, determine the following 
probabilities: 


(a) P(A) (b) P(B) 
() P(4) (d) P(A) 
(e) P(AU B) (f) P(A’ UB) 


2-46. Samples of a cast aluminum part are classified on the 
basis of surface finish (in microinches) and edge finish. The 
results of 100 parts are summarized as follows: 


edge finish 
excellent good 
surface excellent 80 2 
finish good 10 8 


Let A denote the event that a sample has excellent surface fin- 
ish, and let B denote the event that a sample has excellent 
length. If a part is selected at random, determine the following 
probabilities: 


(a) P(A) (b) P(B) 

(c) P(4) ^ (d) P(A) 

(e) P(AU B) (f) P(A’ UB) 

2-41. Samples of emissions from three suppliers are classi- 


fied for conformance to air-quality specifications. The results 
from 100 samples are summarized as follows: 


conforms 
yes no 
l 22 8 
supplier 2 25 5 
3 30 10 


Let A denote the event that a sample is from supplier 1, and let 
B denote the event that a sample conforms to specifications. 
If a sample is selected at random, determine the following 
probabilities: 


(a) P(A) (b) P(B) 
(c) P(4) | (d) P(A) 
(e) P(AU B) (f) P(A’ UB) 


2-48. Use the axioms of probability to show the following: 
(a) For any event E, P(E’) = 1 — P(E). 

(b) P((2) = 0 

(c) If A is contained in B, then P(A) = P(B) 
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2-3 ADDITION RULES 


EXAMPLE 2-13 


EXAMPLE 2-14 


Joint events are generated by applying basic set operations to individual events. Unions of events, 
such as A U B; intersections of events, such as A N B; and complements of events, such as A’, 
are commonly of interest. The probability of a joint event can often be determined from the prob- 
abilities of the individual events that comprise it. Basic set operations are also sometimes helpful 
in determining the probability of a joint event. In this section the focus is on unions of events. 


Table 2-1 lists the history of 940 wafers in a semiconductor manufacturing process. Suppose 
one wafer is selected at random. Let H denote the event that the wafer contains high levels of 
contamination. Then, P(H) = 358/940. 

Let C denote the event that the wafer is in the center of a sputtering tool. Then, 
P(C) = 626/940. Also, P(H M C) is the probability that the wafer is from the center of the sput- 
tering tool and contains high levels of contamination. Therefore, 


P(H N C) = 112/940 


The event H U C is the event that a wafer is from the center of the sputtering tool or 
contains high levels of contamination (or both). From the table, P(H U C) — 872/940. An 
alternative calculation of P(H U C) can be obtained as follows. The 112 wafers that comprise 
the event H f| C are included once in the calculation of P(H) and again in the calculation of 
P(C). Therefore, P(H U C) can be found to be 


P(H UC) = P(A) + P(C) — P(H N C) 
= 358/940 + 626/940 — 112/940 = 872/940 


The preceding example illustrates that the probability of A or B is interpreted as P(A U B) 
and that the following general addition rule applies. 





The wafers such as those described in Example 2-13 were further classified as either in the 
“center” or at the “edge” of the sputtering tool that was used in manufacturing, and by the 
degree of contamination. Table 2-2 shows the proportion of wafers in each category. What is 


Table 2-1 Wafers in Semiconductor Manufacturing Classified 
by Contamination and Location 


Location in Sputtering Tool 


Contamination Center Edge Total 
Low 514 68 582 
High 112 246 358 


Total 626 314 
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Table 2-2 Wafers Classified by Contamination and Location 


Number of 
Contamination 
Particles Center Edge Totals 
0 0.30 0.10 0.40 
] 0.15 0.05 0.20 
2 0.10 0.05 0.15 
3 0.06 0.04 0.10 
4 0.04 0.01 0.05 
5 or more 0.07 0.03 0.10 
Totals 0.72 0.28 1.00 


the probability that a wafer was either at the edge or that it contains four or more particles? Let 
E, denote the event that a wafer contains four or more particles, and let E; denote the event 
that a wafer 1s at the edge. 

The requested probability is P(E, U E2). Now, P(E;) = 0.15 and P(E;) = 0.28. Also, 
from the table, P(E; N E;) = 0.04 . Therefore, using Equation 2-1, we find that 


P(E, U Ey) = 0.15 + 0.28 — 0.04 = 0.39 


What 1s the probability that a wafer contains less than two particles or that it is both at the 
edge and contains more than four particles? Let EF, denote the event that a wafer contains less 
than two particles, and let E; denote the event that a wafer is both from the edge and contains 
more than four particles. The requested probability is P(E, U E>). Now, P(E;) = 0.60 and 
P(E,) = 0.03. Also, E, and E, are mutually exclusive. Consequently, there are no wafers in 
the intersection and P(E,  E,) = 0. Therefore, 


P(E, U E,) = 0.60 + 0.03 = 0.63 
Recall that two events A and B are said to be mutually exclusive if A N B = ©. Then, 


P(A N B) = 0, and the general result for the probability of A U B simplifies to the third ax- 
iom of probability. 


If A and B are mutually exclusive events, 


P(A U B) = P(A) + P(B) 





Three or More Events 
More complicated probabilities, such as P(A U B U C), can be determined by repeated use 
of Equation 2-1 and by using some basic set operations. For example, 


P(AUBUC) = PĪ(4 U B) UC] = P(A UB) + P(C) - P[(AUB) N C] 


Figure 2-11  Venn 
diagram of four mutu- 
ally exclusive events. 
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Upon expanding P(A U B) by Equation 2-1 and using the distributed rule for set opera- 
tions to simplify P|(4 U B) N C), we obtain 


P(AU BU C) = P(A) + P(B) — P(4 N B) + P(C) - P(AQ O U (B C) 
= P(A) + P(B) — P(A N B) + P(C) 
- [P(A C) + RB C) - An B CJ] 
B) — P(ANC) 
C) + P(APQ B C) 


We have developed a formula for the probability of the union of three events. Formulas can be 
developed for the probability of the union of any number of events, although the formulas 
become very complex. As a summary, for the case of three events 


P(AU BU C) = P(A) + P(B) + P(C) — P(A N B) 


— ECC) FeO T BMA BAC) Cs) 





Results for three or more events simplify considerably if the events are mutually exclu- 
sive. In general, a collection of events, E4, E>, ..., Ej, is said to be mutually exclusive if there 
is no overlap among any of them. 

The Venn diagram for several mutually exclusive events is shown in Fig. 2-11. By gener- 
alizing the reasoning for the union of two events, the following result can be obtained: 


A collection of events, FE), E>, ... , Ep 1s said to be mutually exclusive if for all pairs, 
Be ie = 


For a collection of mutually exclusive events, 


P(E, U EU ... U Ej) = P(Ej) + P(E>) +... P(E;) 





A simple example of mutually exclusive events will be used quite frequently. Let X denote the 
pH of a sample. Consider the event that X is greater than 6.5 but less than or equal to 7.8. This 
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probability is the sum of any collection of mutually exclusive events with union equal to the 


same range for X. One example is 


P(6.5 < X = 7.8) = P(6.5 < X = 7.0) + P(7.0 < X S 7.5) + P(T.5 < X x 7.8) 


Another example is 


P(6.5 < X < 7.8) = P(65 < X « 6.6) + P(6.6 < X < 7.1) 


+ P(7.1 < X = 7.4) + P(71.4 < X = 7.8) 


The best choice depends on the particular probabilities available. 


EXERCISES FOR SECTION 2-3 


2-49. If P(A) = 0.3, P(B) = 02, and P(AMB) = 0.1, 
determine the following probabilities: 

(a) P(A’) (b) P(A UB) 

(c) P(A’ N B) (d) P(A AN B') 

(e) P(AU BJ] (f) P(A’ UB) 

2-50. If A, B, and C are mutually exclusive events with 


P(A) = 0.2, P(B) = 0.3, and P(C) = 04, determine the fol- 
lowing probabilities: 


(a) PAUBUC) | (b) P490 Bh C) 


(c) P(A NB) (d) P(AUB)NC] 
(e) P(A' ( B' NC’) 
2-51. If A, B, and C are mutually exclusive events, is it pos- 


sible for P(A) = 0.3, P(B) = 0.4, and P(C) = 0.5? Why or 
why not? 

2-52. Disks of polycarbonate plastic from a supplier are an- 
alyzed for scratch and shock resistance. The results from 100 
disks are summarized as follows: 


shock resistance 


high low 
scratch high 70 9 
resistance low 16 5 


(a) If a disk is selected at random, what is the probability that 
its scratch resistance is high and its shock resistance is 
high? 

(b) If a disk is selected at random, what is the probability 
that its scratch resistance 1s high or its shock resistance 
is high? 

(c) Consider the event that a disk has high scratch resistance 
and the event that a disk has high shock resistance. Are 
these two events mutually exclusive? 


2-53. The analysis of shafts for a compressor is summarized 
by conformance to specifications. 


roundness conforms 


yes no 
surface finish yes 345 5 
conforms no 12 8 


(a) If a shaft is selected at random, what is the probability that 
the shaft conforms to surface finish requirements? 

(b) What is the probability that the selected shaft conforms 
to surface finish requirements or to roundness require- 
ments? 

(c) What is the probability that the selected shaft either con- 
forms to surface finish requirements or does not conform 
to roundness requirements? 

(d) What is the probability that the selected shaft conforms to 
both surface finish and roundness requirements? 


2-54. Cooking oil is produced in two main varieties: mono- 
and polyunsaturated. Two common sources of cooking oil are 
corn and canola. The following table shows the number of 
bottles of these oils at a supermarket: 


type of oil 
canola corn 
type of mono 7 15 
unsaturation poly 93 TI 


(a) If a bottle of oil is selected at random, what is the proba- 
bility that it belongs to the polyunsaturated category? 

(b) What is the probability that the chosen bottle is monoun- 
saturated canola oil? 

2-55. A manufacturer of front lights for automobiles tests 

lamps under a high humidity, high temperature environment 

using intensity and useful life as the responses of interest. The 

following table shows the performance of 130 lamps: 


useful life 
satisfactory unsatisfactory 
intensity satisfactory 117 3 
unsatisfactory 8 2 
(a) Find the probability that a randomly selected lamp will 
yield unsatisfactory results under any criteria. 


(b) The customers for these lamps demand 95% satisfactory 
results. Can the lamp manufacturer meet this demand? 


2-56. The shafts in Exercise 2-53 are further classified in terms 
of the machine tool that was used for manufacturing the shaft. 


Tool 1 


surface finish 


conforms 


Tool 2 


surface finish 


conforms 


yes 200 
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(a) If a shaft is selected at random, what is the probability that 
roundness conforms the shaft conforms to surface finish requirements or to 
ea roundness requirements or is from Tool 1? 

(b) Ifa shaft is selected at random, what is the probability that 
the shaft conforms to surface finish requirements or does 

H9 4 2 not conform to roundness requirements or is from Tool 2? 

(c) Ifa shaft is selected at random, what is the probability that 
the shaft conforms to both surface finish and roundness 
requirements or the shaft is from Tool 2? 


yes no 


roundness conforms 


ye ng (d) Ifa shaft is selected at random, what is the probability that 
yes 145 4 the shaft conforms to surface finish requirements or the 
no 8 6 shaft is from Tool 2? 
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Figure 2-12 
Conditional probabili- 
ties for parts with 
surface flaws. 


A digital communication channel has an error rate of one bit per every thousand transmitted. 
Errors are rare, but when they occur, they tend to occur in bursts that affect many consecutive 
bits. If a single bit is transmitted, we might model the probability of an error as 1/1000. 
However, if the previous bit was in error, because of the bursts, we might believe that the 
probability that the next bit is in error is greater than 1/1000. 

In a thin film manufacturing process, the proportion of parts that are not acceptable is 2%. 
However, the process is sensitive to contamination problems that can increase the rate of parts 
that are not acceptable. If we knew that during a particular shift there were problems with the 
filters used to control contamination, we would assess the probability of a part being unac- 
ceptable as higher than 2%. 

In a manufacturing process, 10% of the parts contain visible surface flaws and 25% of the 
parts with surface flaws are (functionally) defective parts. However, only 5% of parts without 
surface flaws are defective parts. The probability of a defective part depends on our knowl- 
edge of the presence or absence of a surface flaw. 

These examples illustrate that probabilities need to be reevaluated as additional informa- 
tion becomes available. The notation and details are further illustrated for this example. 

Let D denote the event that a part is defective and let F denote the event that a part has a 
surface flaw. Then, we denote the probability of D given, or assuming, that a part has a sur- 
face flaw as P(D|F). This notation is read as the conditional probability of D given F and it 
is interpreted as the probability that a part 1s defective, given that the part has a surface flaw. 
Because 25% of the parts with surface flaws are defective, our conclusion can be stated as 
P(D|F) = 0.25. Furthermore, because F’ denotes the event that a part does not have a surface 
flaw and because 5% of the parts without surface flaws are defective, we have that 
P(D|F') = 0.05. These results are shown graphically in Fig. 2-12. 





P(D|F) = 0.25 
2596 > 5% defective 
defective { P(D|F') = 0.05 


US —_ 
F = parts with F’ = parts without 
Surface flaws Surface flaws 
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EXAMPLE 2-16 


Definition 


Table 2-3 Parts Classified 


Surface Flaws 


Yes (event F) No Total 
Defective Yes (event D) 10 18 38 
No 30 342 362 


Total 40 360 400 


Table 2-3 provides an example of 400 parts classified by surface flaws and as (functionally) 
defective. For this table the conditional probabilities match those discussed previously in this 
section. For example, of the parts with surface flaws (40 parts) the number defective is 10. 
Therefore, 


P(D|F) = 10/40 = 0.25 
and of the parts without surface flaws (360 parts) the number defective is 18. Therefore, 
P(D|F') = 18/360 = 0.05 


In Example 2-16 conditional probabilities were calculated directly. These probabilities 
can also be determined from the formal definition of conditional probability. 


The conditional probability of an event B given an event A, denoted as P(B |A), is 


JUD = EAA (2-5) 


for P(A) > 0. 





This definition can be understood in a special case in which all outcomes of a random exper- 
iment are equally likely. If there are n total outcomes, 


P(A) = (number of outcomes in A)/n 
Also, 
P(A B) = (number of outcomes in A N B)/n 
Consequently, 


P(A P ByP(A) = number of outcomes in A N B 
number of outcomes in A 


Figure 2-13 Tree 
diagram for parts 
classified 
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Surface flaw 


360 40 
4oo N° Yes 700 
Defective — o oie SSS M "ON 
342 18 30 10 
360 nf Yes 260 0 No Yes 40 
e o e e 


Therefore, P(B | 4) can be interpreted as the relative frequency of event B among the trials that 
produce an outcome in event A. 


Again consider the 400 parts in Table 2-3. From this table 


10 /40 10 
POE) OL) UE pee eni 


Note that in this example all four of the following probabilities are different: 


P(F) — 40/400 
P(D) — 28/400 


P(F|D) = 10/28 
P(D|F) — 10/40 


Here, P(D) and P(D|F) are probabilities of the same event, but they are computed under two 
different states of knowledge. Similarly, P(F) and P(F|D) are computed under two different 
states of knowledge. 

The tree diagram in Fig. 2-13 can also be used to display conditional probabilities. The 
first branch is on surface flaw. Of the 40 parts with surface flaws, 10 are functionally defec- 
tive and 30 are not. Therefore, 


P(D|F)- 10/40 and P(D'|F)- 30/40 


Of the 360 parts without surface flaws, 18 are functionally defective and 342 are not. Therefore, 


P(D|F’) = 342/360 and . P(D'|F')- 18/360 

Random Samples from a Batch 

Recall that to select one item randomly from a batch implies that each item is equally likely. 
If more than one item is selected, randomly implies that each element of the sample space is 
equally likely. For example, when sample spaces were presented earlier in this chapter, sam- 
pling with and without replacement were defined and illustrated for the simple case of a batch 
with three items (a, b, cj. If two items are selected randomly from this batch without replace- 
ment, each of the six outcomes in the ordered sample space 


Swithout = (ab, ac, ba, bc, ca, cb} 


has probability 1/6. If the unordered sample space is used, each of the three outcomes in 
{{a, b}, (a, c), (5, c}} has probability 1/3. 


40 
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What is the conditional probability that b is selected second given that a is selected 
first? Because this question considers the results of each pick, the ordered sample space is 
used. The definition of conditional probability is applied as follows. Let E, denote the 
event that the first item selected is a and let E; denote the event that the second item se- 
lected is b. Then, 


E, = (ab, ac} and ŒE, = {ab, cb} and ŒE NE, = {ab} 
and from the definition of conditional probability 


1/6 
P(E,|Ej) = P(E, 0 E>)/P(E,) = B zm 1/2 


When the sample space is larger, an alternative calculation is usually more convenient. 
For example, suppose that a batch contains 10 parts from tool 1 and 40 parts from tool 2. If 
two parts are selected randomly, without replacement, what 1s the conditional probability that 
a part from tool 2 is selected second given that a part from tool 1 1s selected first? There are 
50 possible parts to select first and 49 to select second. Therefore, the (ordered) sample space 
has 50 X 49 = 2450 outcomes. Let E, denote the event that the first part is from tool 1 and £, 
denote the event that the second part is from tool 2. As above, a count of the number of out- 
comes in £; and the intersection is needed. 

Although the answer can be determined from this start, this type of question can be 
answered more easily with the following result. 


To select randomly implies that at each step of the sample, the items that remain in 


the batch are equally likely to be selected. 





If a part from tool 1 were selected with the first pick, 49 items would remain, 9 from tool 1 and 
40 from tool 2, and they would be equally likely to be picked. Therefore, the probability that 
a part from tool 2 would be selected with the second pick given this first pick 1s 


P(E,|E,) = 40/49. 


In this manner, other probabilities can also be simplified. For example, let the event £ 
consist of the outcomes with the first selected part from tool 1 and the second part from tool 2. 
To determine the probability of E, consider each step. The probability that a part from tool 1 
is selected with the first pick is P(E) = 10/50. The conditional probability that a part from 
tool 2 is selected with the second pick, given that a part from tool 1 is selected first is 
P(E;|E,) = 40/49. Therefore, 


40 10 


P(E) = P(E,| E,)P(E;) = 49° 50 


= 0.163 


EXAMPLE 2-18 


EXAMPLE 2-19 
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Sometimes a partition of the question into successive picks is an easier method to solve the 
problem. 


A. day's production of 850 manufactured parts contains 50 parts that do not meet customer 
requirements. Two parts are selected randomly without replacement from the batch. What is 
the probability that the second part is defective given that the first part 1s defective? 

Let A denote the event that the first part selected 1s defective, and let B denote the event 
that the second part selected is defective. The probability needed can be expressed as 
P(B |A). If the first part is defective, prior to selecting the second part, the batch contains 849 
parts, of which 49 are defective, therefore 


P(B|A) = 49/849 


Continuing the previous example, if three parts are selected at random, what is the probability 
that the first two are defective and the third is not defective? This event can be described in 
shorthand notation as simply P(ddn). We have 


50 49 800 
dde a ee sd d 
(ddn) = $50 849 ` 848 


The third term is obtained as follows. After the first two parts are selected, there are 848 
remaining. Of the remaining parts, 800 are not defective. In this example, it is easy to obtain 
the solution with a conditional probability for each selection. 


EXERCISES FOR SECTION 2-4 


2-57. Disks of polycarbonate plastic from a supplier are an- 


alyzed for scratch and shock resistance. The results from 100 
disks are summarized as follows: 


shock resistance 


high low 
scratch high 70 9 
resistance low 16 5 


Let A denote the event that a disk has high shock resistance, 
and let B denote the event that a disk has high scratch resist- 
ance. Determine the following probabilities: 


(a) P(A) (b) P(B) 
(c) P(A|B) (d) P(B|A) 
2-58. Samples of a cast aluminum part are classified 


on the basis of surface finish (in microinches) and length 
measurements. The results of 100 parts are summarized as 
follows: 


length 
excellent good 
surface excellent 80 2 
finish good 10 8 


Let A denote the event that a sample has excellent surface fin- 

ish, and let B denote the event that a sample has excellent 

length. Determine: 

(a) P(4) ^ (b) P(B) 

(c) P(A|B) (d) P(B|A) 

(e) Ifthe selected part has excellent surface finish, what is the 
probability that the length is excellent? 

(f) If the selected part has good length, what is the probability 
that the surface finish 1s excellent? 


2-59. The analysis of shafts for a compressor is summarized 
by conformance to specifications: 


roundness conforms 


yes no 
surface finish yes 345 5 
conforms no 12 8 


(a) If we know that a shaft conforms to roundness require- 
ments, what is the probability that it conforms to surface 
finish requirements? 

(b) If we know that a shaft does not conform to roundness 
requirements, what is the probability that it conforms to 
surface finish requirements? 
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2-60. The following table summarizes the analysis of samples 
of galvanized steel for coating weight and surface roughness: 


coating weight 


high low 
surface high 12 16 
roughness low 88 34 


(a) If the coating weight of a sample is high, what is the prob- 
ability that the surface roughness is high? 

(b) If the surface roughness of a sample is high, what is the 
probability that the coating weight is high? 

(c) If the surface roughness of a sample is low, what is the 
probability that the coating weight is low? 


2-61. Consider the data on wafer contamination and loca- 
tion in the sputtering tool shown in Table 2-2. Assume that one 
wafer is selected at random from this set. Let A denote the 
event that a wafer contains four or more particles, and let 5 
denote the event that a wafer is from the center of the sputter- 
ing tool. Determine: 


(a) P(A) (b) P(4|B) 
(c) P(B) (d) P(B|A) 
(e) PAN B) (f) P(AU B) 


2-62. A lot of 100 semiconductor chips contains 20 that are 

defective. Two are selected randomly, without replacement, 

from the lot. 

(a) What is the probability that the first one selected is defec- 
tive? 

(b) What is the probability that the second one selected is 
defective given that the first one was defective? 

(c) What is the probability that both are defective? 

(d) How does the answer to part (b) change if chips selected 
were replaced prior to the next selection? 


2-63. A lot contains 15 castings from a local supplier and 25 
castings from a supplier in the next state. Two castings are 
selected randomly, without replacement, from the lot of 40. 
Let A be the event that the first casting selected is from the 
local supplier, and let B denote the event that the second cast- 
ing is selected from the local supplier. Determine: 

(a) P(A) (b) P(B|A) 

(c) P(ANB) (d) P(AU B) 

2-64. Continuation of Exercise 2-63. Suppose three cast- 
ings are selected at random, without replacement, from the lot 


of 40. In addition to the definitions of events 4A and B, let C 

denote the event that the third casting selected is from the 

local supplier. Determine: 

(a P(ACQBf c) 

(b) PANBNC') 

2-65. <A batch of 500 containers for frozen orange juice con- 

tains 5 that are defective. Two are selected, at random, without 

replacement from the batch. 

(a) What 1s the probability that the second one selected 1s 
defective given that the first one was defective? 

(b) What is the probability that both are defective? 

(c) What is the probability that both are acceptable? 


2-66. Continuation of Exercise 2-65. Three containers are 

selected, at random, without replacement, from the batch. 

(a) What is the probability that the third one selected is defec- 
tive given that the first and second one selected were 
defective? 

(b) What is the probability that the third one selected is 
defective given that the first one selected was defective 
and the second one selected was okay? 

(c) What is the probability that all three are defective? 


2-67. A maintenance firm has gathered the following infor- 
mation regarding the failure mechanisms for air conditioning 
systems: 


evidence of gas leaks 


yes no 
evidence of yes 33 17 
electrical failure no 32 3 


The units without evidence of gas leaks or electrical failure 

showed other types of failure. If this is a representative sample 

of AC failure, find the probability 

(a) That failure involves a gas leak 

(b) That there is evidence of electrical failure given that there 
was a gas leak 

(c) That there is evidence of a gas leak given that there is 
evidence of electrical failure 

2-68. IfP(4|B) = 1, must A = B? Draw a Venn diagram to 

explain your answer. 

2-69. Suppose A and B are mutually exclusive events. 


Construct a Venn diagram that contains the three events A, B, 
and C such that P(4| C) = 1 and P(B|C) = 0? 


2-5 MULTIPLICATION AND TOTAL PROBABILITY RULES 


2-5.1 Multiplication Rule 


The definition of conditional probability in Equation 2-5 can be rewritten to provide a general 
expression for the probability of the intersection of two events. This formula is referred to as 
a multiplication rule for probabilities. 


Multiplication Rule 


EXAMPLE 2-20 
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P(A (à B) = P(B|A)P(A) = P(A|B)P(B) (2-6) 


The last expression in Equation 2-6 is obtained by interchanging A and B. 


The probability that an automobile battery subject to high engine compartment temperature 
suffers low charging current is 0.7. The probability that a battery is subject to high engine 
compartment temperature is 0.05. 

Let C denote the event that a battery suffers low charging current, and let T denote the 
event that a battery 1s subject to high engine compartment temperature. The probability that a 
battery is subject to low charging current and high engine compartment temperature is 


P(COT) = P(C|T)P(T) = 0.7 X 0.05 = 0.035 


2-5.2 Total Probability Rule 


The multiplication rule is useful for determining the probability of an event that depends on 
other events. For example, suppose that in semiconductor manufacturing the probability is 
0.10 that a chip that is subjected to high levels of contamination during manufacturing causes 
a product failure. The probability is 0.005 that a chip that is not subjected to high contamina- 
tion levels during manufacturing causes a product failure. In a particular production run, 20% 
of the chips are subject to high levels of contamination. What is the probability that a product 
using one of these chips fails? 

Clearly, the requested probability depends on whether or not the chip was exposed to high 
levels of contamination. We can solve this problem by the following reasoning. For any event 
B, we can write B as the union of the part of B in A and the part of B in A’. That is, 


B — (A B)U (4' NB) 


This result is shown in the Venn diagram in Fig. 2-14. Because A and A’ are mutually exclu- 
sive, A N B and A’ N B are mutually exclusive. Therefore, from the probability of the union 
of mutually exclusive events in Equation 2-2 and the Multiplication Rule in Equation 2-6, the 
following total probability rule is obtained. 


Figure 2-14 Partitioning B = (B ^ E}) o (BO Ed) o (BO E3) o (BO Ej) 
an event into two mutually 


Figure 2-15 Partitioning an event into 
exclusive subsets. 


several mutually exclusive subsets. 
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Total Probability 
Rule (two events) 


EXAMPLE 2-21 


Total Probability 
Rule (multiple 
events) 


EXAMPLE 2-22 


For any events A and B, 


I E = Je EAD Ge A) = PUP Aya) R n VR) (QUY; 





Consider the contamination discussion at the start of this section. Let F denote the event 
that the product fails, and let H denote the event that the chip is exposed to high levels of 
contamination. The requested probability is P(F), and the information provided can be rep- 
resented as 


P(F|H) - 0.10 and P(F|A’) = 0.005 
P(H) = 0.20 and P(H') = 0.80 
From Equation 2-7, 
P(F) = 0.10(0.20) + 0.005(0.80) = 0.0235 
which can be interpreted as just the weighted average of the two probabilities of failure. 

The reasoning used to develop Equation 2-7 can be applied more generally. In the devel- 
opment of Equation 2-7, we only used the two mutually exclusive A and A’. However, the fact 
that A U A’ = S, the entire sample space, was important. In general, a collection of sets 
Ei, E2, ... , Ej, such that E, U E U ... UE, = S is said to be exhaustive. A graphical dis- 


play of partitioning an event B among a collection of mutually exclusive and exhaustive 
events is shown in Fig. 2-15 on page 43. 


Assume £j, E>, ..., E, are k mutually exclusive and exhaustive sets. Then 


P(B) = P(B'Y E) + (BNE) +--+ P(B ES 
= P(B|E)P(E:) + P(BLES)P(E;) + --- + P(BIEP(E, (2-8) 





Continuing with the semiconductor manufacturing example, assume the following probabili- 
ties for product failure subject to levels of contamination in manufacturing: 


Probability of Failure Level of Contamination 
0.10 High 
0.01 Medium 


0.001 Low 
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0.20 


Contamination 
e 


0.30 


Medium Low 


LIN, LN, 


P(Fail| High) P(Not Fail| High) P(Fail|Medium) P(Not Fail| Medium)  P(Fail|Low) = P(Not Fail| Low) 
= 0.10 = 0.90 = 0.01 = 0.99 = 0.001 = 0.999 
e e \ e e. 
0.10(0.20) 0.90(0.20) | 0.01(0.30) 0.99(0.30) | 0.001(0.50) 0.999(0.50) 
= 0.02 20.18 - 0.003 20,297 - 0.0005 - 0.4995 


Figure 2-16 Tree 
diagram for 
Example 2-22. 


P(Fail) = 0.02 + 0.003 + 0.0005 = 0.0235 


In a particular production run, 20% of the chips are subjected to high levels of contami- 
nation, 30% to medium levels of contamination, and 50% to low levels of contamination. 
What is the probability that a product using one of these chips fails? Let 


H denote the event that a chip is exposed to high levels of contamination 


M denote the event that a chip is exposed to medium levels of contamination 


L denote the event that a chip is exposed to low levels of contamination 


Then, 


P(F) = P(F| H)P(H) + P(F|M)P(M) + P(F|L)P(L) 
= 0.10(0.20) + 0.01(0.30) + 0.001(0.50) = 0.0235 


This problem is also conveniently solved using the tree diagram in Fig. 2-16. 


EXERCISES FOR SECTION 2-5 


2-70. Suppose that P(A|B)=0.4 and P(B) = 0.5. 
Determine the following: 

(a) PAM B) 

(b) P(A’ MB) 


2-71. Suppose that P(A|B) = 0.2, P(A|B') = 0.3, and 
P(B) = 0.8. What is P(A)? 

2-72. The probability is 1% that an electrical connector that 
is kept dry fails during the warranty period of a portable com- 
puter. If the connector is ever wet, the probability of a failure 
during the warranty period is 5%. If 90% of the connectors are 
kept dry and 10% are wet, what proportion of connectors fail 
during the warranty period? 


2-73. Suppose 2% of cotton fabric rolls and 3% of nylon 
fabric rolls contain flaws. Of the rolls used by a manufacturer, 
70% are cotton and 30% are nylon. What is the probability 
that a randomly selected roll used by the manufacturer con- 
tains flaws? 


2-74. In the manufacturing of a chemical adhesive, 3% of 
all batches have raw materials from two different lots. This 
occurs when holding tanks are replenished and the remaining 
portion of a lot is insufficient to fill the tanks. 

Only 5% of batches with material from a single lot require 
reprocessing. However, the viscosity of batches consisting of 
two or more lots of material is more difficult to control, and 
40% of such batches require additional processing to achieve 
the required viscosity. 

Let A denote the event that a batch is formed from 
two different lots, and let B denote the event that a lot 
requires additional processing. Determine the following 
probabilities: 


(a) P(A) (b) P(A’) 

(c) P(B|A) (d) P(B|A") 
(e) (ANB) (f) P(ANB’) 
(g) P(B) 
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2-75. The edge roughness of slit paper products increases as 
knife blades wear. Only 1% of products slit with new blades 
have rough edges, 3% of products slit with blades of average 
sharpness exhibit roughness, and 5% of products slit with 
worn blades exhibit roughness. If 25% of the blades in manu- 
facturing are new, 60% are of average sharpness, and 15% are 
worn, what is the proportion of products that exhibit edge 
roughness? 


2-76. Samples of laboratory glass are in small, light pack- 
aging or heavy, large packaging. Suppose that 2 and 1% of 
the sample shipped in small and large packages, respec- 
tively, break during transit. If 60% of the samples are 
shipped in large packages and 40% are shipped in small 
packages, what proportion of samples break during 
shipment? 
2-77. Incoming calls to a customer service center are classi- 
fied as complaints (75% of call) or requests for information 
(25% of calls). Of the complaints, 40% deal with computer 
equipment that does not respond and 57% deal with 
incomplete software installation; and in the remaining 3% of 
complaints the user has improperly followed the installation 
instructions. The requests for information are evenly divided 
on technical questions (50%) and requests to purchase more 
products (50%). 
(a) What is the probability that an incoming call to the cus- 
tomer service center will be from a customer who has not 
followed installation instructions properly? 
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(b) Find the probability that an incoming call is a request for 
purchasing more products. 


2-78. Computer keyboard failures are due to faulty electri- 
cal connects (12%) or mechanical defects (88%). Mechanical 
defects are related to loose keys (27%) or improper assembly 
(73%). Electrical connect defects are caused by defective 
wires (35%), improper connections (13%), or poorly welded 
wires (52%). 

(a) Find the probability that a failure is due to loose keys. 

(b) Find the probability that a failure is due to improperly 

connected or poorly welded wires. 


2-79. A batch of 25 injection-molded parts contains 5 that 

have suffered excessive shrinkage. 

(a) If two parts are selected at random, and without replace- 
ment, what is the probability that the second part selected 
is one with excessive shrinkage? 

(b) If three parts are selected at random, and without replace- 
ment, what is the probability that the third part selected 1s 
one with excessive shrinkage? 

2-80. A lot of 100 semiconductor chips contains 20 that are 

defective. 

(a) Two are selected, at random, without replacement, from 
the lot. Determine the probability that the second chip se- 
lected is defective. 

(b) Three are selected, at random, without replacement, 
from the lot. Determine the probability that all are 
defective. 


In some cases, the conditional probability of P(B|A) might equal P(B). In this special case, 
knowledge that the outcome of the experiment is in event A does not affect the probability that 


the outcome is in event B. 


EXAMPLE 2-23 


Suppose a day's production of 850 manufactured parts contains 50 parts that do not meet 


customer requirements. Suppose two parts are selected from the batch, but the first part is 
replaced before the second part is selected. What is the probability that the second part is 
defective (denoted as B) given that the first part is defective (denoted as A)? The probability 


needed can be expressed as P(B |A). 


Because the first part 1s replaced prior to selecting the second part, the batch still contains 
850 parts, of which 50 are defective. Therefore, the probability of B does not depend on 
whether or not the first part was defective. That is, 


P(B|A) = 50/850 


Also, the probability that both parts are defective is 


P(A B) = P(B|A)P(A) = ( = . ( = = 0.0035 


850/ \850 
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Table 2-4 Parts Classified 


Surface Flaws 


Yes (event F) No Total 
Defective Yes (event D) 2 18 20 
No 38 342 380 
Total 40 360 400 


EXAMPLE 2-24 The information in Table 2-3 related surface flaws to functionally defective parts. In that case, 
we determined that P(D|F) = 10/40 = 0.25 and P(D) = 28/400 = 0.07. Suppose that the 
situation 1s different and follows Table 2-4. Then, 

P(D|F) = 2/40 = 0.05 and P(D) = 20/400 = 0.05 


That is, the probability that the part is defective does not depend on whether it has surface 
flaws. Also, 


P(F|D) = 2/20 = 0.10 and  P(F) = 40/400 = 0.10 


so the probability of a surface flaw does not depend on whether the part is defective. 
Furthermore, the definition of conditional probability implies that 


P(F A D) = P(D|F)P(F) 
but in the special case of this problem 


2 2 l 
P(F (1D) = P(D)P(F) ~ 40 20 +200 


The preceding example illustrates the following conclusions. In the special case that 
P(B|A) = P(B), we obtain 


P(A ( B) = P(B|A)P(A) = P(B)P(A) 
and 


AN A 
P(A|B) = s = i — P(A) 


These conclusions lead to an important definition. 


Definition 
Two events are independent if any one of the following equivalent statements is true: 


(1) P(A|B) = P(A) 


(2) P(B|A) = P(B) 
(3) P(AMB) = P(A)P(B) 
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EXAMPLE 2-25 


Definition 


EXAMPLE 2-26 





It is left as a mind-expanding exercise to show that independence implies related results 
such as 


P(A' 1B’) = P(A')P(B’). 


The concept of independence is an important relationship between events and is used 
throughout this text. A mutually exclusive relationship between two events is based only on 
the outcomes that comprise the events. However, an independence relationship depends on the 
probability model used for the random experiment. Often, independence is assumed to be part 
of the random experiment that describes the physical system under study. 


A day’s production of 850 manufactured parts contains 50 parts that do not meet customer 
requirements. Two parts are selected at random, without replacement, from the batch. Let A 
denote the event that the first part 1s defective, and let B denote the event that the second part 
is defective. 

We suspect that these two events are not independent because knowledge that the first 
part is defective suggests that it is less likely that the second part selected is defective. Indeed, 
P(B|A) — 49/849. Now, what is P(B)? Finding the unconditional P(B) is somewhat difficult 
because the possible values of the first selection need to be considered: 


P(B) = P(B|A)P(A) + P(B|A')P(4') 
= (49/849)(50/850) + (50/849)(800/850) 
= 50/850 


Interestingly, P(B), the unconditional probability that the second part selected is defec- 
tive, without any knowledge of the first part, is the same as the probability that the first part 
selected is defective. Yet, our goal is to assess independence. Because P(B|A) does not equal 
P(B), the two events are not independent, as we suspected. 


When considering three or more events, we can extend the definition of independence 
with the following general result. 


The events Ej, E;,..., E, are independent if and only if for any subset of these 
events E; , E; 


15? eee 


(2-10) 


This definition is typically used to calculate the probability that several events occur assuming 
that they are independent and the individual event probabilities are known. The knowledge 
that the events are independent usually comes from a fundamental understanding of the ran- 
dom experiment. 


Assume that the probability that a wafer contains a large particle of contamination is 0.01 and 
that the wafers are independent; that is, the probability that a wafer contains a large particle is 


EXAMPLE 2-27 


EXAMPLE 2-28 
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not dependent on the characteristics of any of the other wafers. If 15 wafers are analyzed, what 
is the probability that no large particles are found? 

Let E; denote the event that the ith wafer contains no large particles, i = 1,2, ..., 15. 
Then, P(E;) = 0.99. The probability requested can be represented as P(E, N E; (Y +++ (Y Ej5). 
From the independence assumption and Equation 2-10, 


P(E OQ E, +++ N ES) = P(E) X P(E) X = X P(E;5) = 0.999? = 0.86 
The following circuit operates only if there is a path of functional devices from left to right. 


The probability that each device functions is shown on the graph. Assume that devices fail 
independently. What is the probability that the circuit operates? 





Let T and B denote the events that the top and bottom devices operate, respectively. There 
is a path if at least one device operates. The probability that the circuit operates is 


P(Tor B) = 1 — Pl(Tor B) | = 1 — P(T' and B") 


a simple formula for the solution can be derived from the complements 7” and B’. From the 
independence assumption, 


P(T' and B’) = P(T')P(B') = (1 — 0.95* = 0.057 
SO 
P(T or B) = 1 — 0.05* = 0.9975 
The following circuit operates only if there is a path of functional devices from left to right. 


The probability that each device functions 1s shown on the graph. Assume that devices fail 
independently. What is the probability that the circuit operates? 
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The solution can be obtained from a partition of the graph into three columns. 
The probability that there is a path of functional devices only through the three units on the 
left can be determined from the independence in a manner similar to the previous example. 


It is 


[esq 


Similarly, the probability that there is a path of functional devices only through the two units 


in the middle is 


1 — 0.057 


The probability that there is a path of functional devices only through the one unit on the right 
is simply the probability that the device functions, namely, 0.99. Therefore, with the inde- 
pendence assumption used again, the solution is 


(1 — 0.1°)(1 — 0.057)(0.99) = 0.987 


EXERCISES FOR SECTION 2-6 


2-81. If P(A|B) = 0.4, P(B) = 0.8, and P(A) = 0.5, are 
the events A and B independent? 

2-82. If P(A|B) = 0.3, P(B) = 0.8, and P(A) = 0.3, are 
the events B and the complement of A independent? 


2-83. Disks of polycarbonate plastic from a supplier are an- 
alyzed for scratch and shock resistance. The results from 100 
disks are summarized as follows: 


shock resistance 


high low 
scratch high 70 9 
resistance low 16 3 


Let A denote the event that a disk has high shock resistance, 
and let B denote the event that a disk has high scratch resist- 
ance. Are events A and B independent? 

2-84. Samples of a cast aluminum part are classified on the 
basis of surface finish (in microinches) and length measure- 
ments. The results of 100 parts are summarized as follows: 


length 
excellent good 
surface excellent 80 2 
finish good 10 8 


Let A denote the event that a sample has excellent surface fin- 
ish, and let B denote the event that a sample has excellent 
length. Are events A and B independent? 

2-85. Samples of emissions from three suppliers are classi- 
fied for conformance to air-quality specifications. The results 
from 100 samples are summarized as follows: 


conforms 
yes no 
22 8 
supplier 2 ps 5 
3 30 10 


Let A denote the event that a sample is from supplier 1, and let 
B denote the event that a sample conforms to specifications. 
(a) Are events A and B independent? 

(b) Determine P(B|A). 

2-86. If P(A) = 0.2, P(B) = 0.2, and A and B are mutually 
exclusive, are they independent? 


2-87. The probability that a lab specimen contains high lev- 

els of contamination 1s 0.10. Five samples are checked, and 

the samples are independent. 

(a) What is the probability that none contains high levels of 
contamination? 

(b) What is the probability that exactly one contains high 
levels of contamination? 

(c) What 1s the probability that at least one contains high 
levels of contamination? 


2-88. Inatest of a printed circuit board using a random test 

pattern, an array of 10 bits is equally likely to be 0 or 1. 

Assume the bits are independent. 

(a) What is the probability that all bits are 1s? 

(b) What is the probability that all bits are Os? 

(c) What is the probability that exactly five bits are 1s and five 
bits are 0s? 


2-89. Eight cavities in an injection-molding tool produce 
plastic connectors that fall into a common stream. A sample is 


chosen every several minutes. Assume that the samples are 

independent. 

(a) What 1s the probability that five successive samples were 
all produced in cavity one of the mold? 

(b) What is the probability that five successive samples were 
all produced in the same cavity of the mold? 

(c) What is the probability that four out of five successive 
samples were produced in cavity one of the mold? 


2-90. The following circuit operates if and only if there 1s a 
path of functional devices from left to right. The probability 
that each device functions is as shown. Assume that the prob- 
ability that a device is functional does not depend on whether 
or not other devices are functional. What is the probability that 
the circuit operates? 





2-91. The following circuit operates if and only if there 1s a 
path of functional devices from left to right. The probability 
each device functions is as shown. Assume that the probabil- 
ity that a device functions does not depend on whether or not 
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other devices are functional. What is the probability that the 
circuit operates? 





2-92. An optical storage device uses an error recovery proce- 
dure that requires an immediate satisfactory readback of any 
written data. If the readback 1s not successful after three writing 
operations, that sector of the disk 1s eliminated as unacceptable 
for data storage. On an acceptable portion of the disk, the proba- 
bility of a satisfactory readback is 0.98. Assume the readbacks 
are independent. What is the probability that an acceptable por- 
tion of the disk is eliminated as unacceptable for data storage? 


2-93. A batch of 500 containers for frozen orange juice con- 
tains 5 that are defective. Two are selected, at random, without 
replacement, from the batch. Let A and B denote the events 
that the first and second container selected is defective, re- 
spectively. 

(a) Are A and B independent events? 

(b) If the sampling were done with replacement, would 4 and 

B be independent? 


In some examples, we do not have a complete table of information such as the parts in Table 
2-3. We might know one conditional probability but would like to calculate a different one. In 
the semiconductor contamination problem in Example 2-22, we might ask the following: If 
the semiconductor chip in the product fails, what is the probability that the chip was exposed 


to high levels of contamination? 


From the definition of conditional probability, 


P(A N B) = P(A|B)P(B) = P(B A A) = P(B|A)P(A) 


Now considering the second and last terms in the expression above, we can write 








This is a useful result that enables us to solve for P(A |B) in terms of P(B| A). 
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EXAMPLE 2-29 


Bayes’ 
Theorem 


EXAMPLE 2-30 


We can answer the question posed at the start of this section as follows: The probability 
requested can be expressed as P(H | F). Then, 


P(F|H)P(H)  0.10(0.20) 
P(H|F) = (OP) — = 00235 — 0.85 


The value of P(F) in the denominator of our solution was found in Example 2-20. 
In general, if P(B) in the denominator of Equation 2-11 is written using the Total 


Probability Rule in Equation 2-8, we obtain the following general result, which is known as 
Bayes' Theorem. 


If Ej, E», ..., E, are k mutually exclusive and exhaustive events and B is any 
event, 


P(B|E)P(E) 


P(E |B) = SIEPE) + FBIE)PUS) * — + FBIESPUES 


for P(B) > 0 





Because a new medical procedure has been shown to be effective in the early detection of an 
illness, a medical screening of the population is proposed. The probability that the test cor- 
rectly identifies someone with the illness as positive is 0.99, and the probability that the test 
correctly identifies someone without the illness as negative is 0.95. The incidence of the 
illness in the general population is 0.0001. You take the test, and the result is positive. What is 
the probability that you have the illness? 

Let D denote the event that you have the illness, and let S denote the event that the test 
signals positive. The probability requested can be denoted as P(D |S). The probability that the 
test correctly signals someone without the ilIness as negative is 0.95. Consequently, the prob- 
ability of a positive test without the illness is 


P(S|D') — 0.05 
From Bayes' Theorem, 


P(D|S) = P(S|D)P(D)/[P(S|D)P(D) + P(S| D')P(D")] 
= 0.99(0.0001)/[0.99(0.0001) + 0.05(1 — 0.0001) ] 
= 1/506 = 0.002 


Surprisingly, even though the test is effective, in the sense that P(S|D) is high and 
P(S|D') is low, because the incidence of the illness in the general population is low, the 
chances are quite small that you actually have the disease even if the test is positive. 


EXERCISES FOR SECTION 2-7 


2-94. Suppose that P(A|B) — 0.7, P(A)= 0.5, and 
P(B) = 0.2. Determine P(B| A). 

2-95. Software to detect fraud in consumer phone cards 
tracks the number of metropolitan areas where calls origi- 
nate each day. It is found that 1% of the legitimate users 
originate calls from two or more metropolitan areas in a 
single day. However, 3096 of fraudulent users originate 
calls from two or more metropolitan areas in a single day. 
The proportion of fraudulent users is 0.01%. If the 
same user originates calls from two or more metropolitan 
areas in a single day, what is the probability that the user is 
fraudulent? 


2-96. Semiconductor lasers used in optical storage products 
require higher power levels for write operations than for read 
operations. High-power-level operations lower the useful life 
of the laser. 

Lasers in products used for backup of higher speed mag- 
netic disks primarily write, and the probability that the useful 
life exceeds five years is 0.95. Lasers that are in products that 
are used for main storage spend approximately an equal 
amount of time reading and writing, and the probability that 
the useful life exceeds five years is 0.995. Now, 25% of the 
products from a manufacturer are used for backup and 75% of 
the products are used for main storage. 

Let A denote the event that a laser's useful life exceeds five 
years, and let B denote the event that a laser is in a product that 
is used for backup. 


Use a tree diagram to determine the following: 


(a) P(B) (b) P(A|B) 
(c) P(A|B’) (d) P(A (B) 
(e) PANE) ®© PCA) 


(g) What is the probability that the useful life of a laser 
exceeds five years? 

(h) What is the probability that a laser that failed before five 
years came from a product used for backup? 

2-97. Customers are used to evaluate preliminary product 

designs. In the past, 95% of highly successful products 

received good reviews, 60% of moderately successful prod- 
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ucts received good reviews, and 1096 of poor products 

received good reviews. In addition, 4096 of products have 

been highly successful, 35% have been moderately 

successful, and 25% have been poor products. 

(a) What is the probability that a product attains a good 
review? 

(b) If a new design attains a good review, what is the proba- 
bility that it will be a highly successful product? 

(c) If a product does not attain a good review, what is the 
probability that it will be a highly successful product? 


2-98. An inspector working for a manufacturing company 

has a 99% chance of correctly identifying defective items and 

a 0.5% chance of incorrectly classifying a good item as defec- 

tive. The company has evidence that its line produces 0.9% of 

nonconforming items. 

(a) What is the probability that an item selected for inspection 
is classified as defective? 

(b) If an item selected at random is classified as nondefective, 
what is the probability that it is indeed good? 


2-99. A new analytical method to detect pollutants in water 
is being tested. This new method of chemical analysis is im- 
portant because, if adopted, it could be used to detect three dif- 
ferent contaminants—organic pollutants, volatile solvents, 
and chlorinated compounds—instead of having to use a single 
test for each pollutant. The makers of the test claim that it can 
detect high levels of organic pollutants with 99.7% accuracy, 
volatile solvents with 99.95% accuracy, and chlorinated com- 
pounds with 89.7% accuracy. If a pollutant is not present, the 
test does not signal. Samples are prepared for the calibration 
of the test and 60% of them are contaminated with organic 
pollutants, 27% with volatile solvents, and 13% with traces of 
chlorinated compounds. 
A test sample is selected randomly. 
(a) What is the probability that the test will signal? 
(b) If the test signals, what is the probability that chlori- 
nated compounds are present? 


We often summarize the outcome from a random experiment by a simple number. In many 
of the examples of random experiments that we have considered, the sample space has 
been a description of possible outcomes. In some cases, descriptions of outcomes are suf- 
ficient, but in other cases, it 1s useful to associate a number with each outcome in the sam- 
ple space. Because the particular outcome of the experiment is not known in advance, the 
resulting value of our variable is not known in advance. For this reason, the variable that 
associates a number with the outcome of a random experiment is referred to as a random 


variable. 
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Definition 
A random variable is a function that assigns a real number to each outcome in the 
sample space of a random experiment. 

A random variable is denoted by an uppercase letter such as X. After an experi- 


ment is conducted, the measured value of the random variable is denoted by a low- 


ercase letter such as x = 70 milliamperes. 





Sometimes a measurement (such as current in a copper wire or length of a machined part) 
can assume any value in an interval of real numbers (at least theoretically). Then arbitrary pre- 
cision in the measurement is possible. Of course, in practice, we might round off to the nearest 
tenth or hundredth of a unit. The random variable that represents this measurement is said to 
be a continuous random variable. The range of the random variable includes all values in an 
interval of real numbers; that is, the range can be thought of as a continuum. 

In other experiments, we might record a count such as the number of transmitted bits that 
are received in error. Then the measurement is limited to integers. Or we might record that a 
proportion such as 0.0042 of the 10,000 transmitted bits were received in error. Then the 
measurement is fractional, but it is still limited to discrete points on the real line. Whenever 
the measurement is limited to discrete points on the real line, the random variable is said to be 
a discrete random variable. 


Definition 


A discrete random variable is a random variable with a finite (or countably infinite) 
range. 


A continuous random variable is a random variable with an interval (either finite or 
infinite) of real numbers for its range. 





In some cases, the random variable X is actually discrete but, because the range of possible 
values is so large, it might be more convenient to analyze X as a continuous random variable. For 
example, suppose that current measurements are read from a digital instrument that displays the 
current to the nearest one-hundredth of a milliampere. Because the possible measurements are 
limited, the random variable is discrete. However, it might be a more convenient, simple approx- 
imation to assume that the current measurements are values of a continuous random variable. 


Examples of 


saan Examples of continuous random variables: 
Variables electrical current, length, pressure, temperature, time, voltage, weight 


Examples of discrete random variables: 
number of scratches on a surface, proportion of defective parts among 1000 
tested, number of transmitted bits received in error. 





EXERCISES FOR SECTION 2-8 


2-100. Decide whether a discrete or continuous random (b) The number of times a transistor in a computer memory 


variable is the best model for each of the following vari- changes state in one operation. 
ables: (c) The volume of gasoline that is lost to evaporation during 


(a) The time until a projectile returns to earth. the filling of a gas tank. 


(d) The outside diameter of a machined shaft. 

(e) The number of cracks exceeding one-half inch in 10 miles 
of an interstate highway. 

(f) The weight of an injection-molded plastic part. 

(g) The number of molecules in a sample of gas. 

(h) The concentration of output from a reactor. 

(1) The current in an electronic circuit. 


Supplemental Exercises 


2-101. In circuit testing of printed circuit boards, each 
board either fails or does not fail the test. A board that fails the 
test is then checked further to determine which one of five de- 
fect types is the primary failure mode. Represent the sample 
space for this experiment. 


2-102. The data from 200 machined parts are summarized 
as follows: 


depth of bore 
above below 
edge condition target target 
coarse 15 10 
moderate 25 20 
smooth 50 80 


(a) What is the probability that a part selected has a moderate 
edge condition and a below-target bore depth? 

(b) What is the probability that a part selected has a moderate 
edge condition or a below-target bore depth? 

(c) What is the probability that a part selected does not have a 
moderate edge condition or does not have a below-target 
bore depth? 

(d) Construct a Venn diagram representation of the events in 
this sample space. 


2-103. Computers in a shipment of 100 units contain a 
portable hard drive, CD RW drive, or both according to the 
following table: 


portable hard drive 


yes no 
CD RW 
yes 15 80 
no 4 1 


Let A denote the events that a computer has a portable hard 
drive and let B denote the event that a computer has a CD RW 
drive. If one computer is selected randomly, compute 


(a) P(A) (b) P(A4 N B) 
(c) P(AU B) (d) P(4' NB) 
(e) P(A|B) 


2-104. The probability that a customer's order is not 
shipped on time is 0.05. A particular customer places three 
orders, and the orders are placed far enough apart in time that 
they can be considered to be independent events. 
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(a) What is the probability that all are shipped on time? 

(b) What is the probability that exactly one is not shipped on 
time? 

(c) What is the probability that two or more orders are not 
shipped on time? 


2-105. Let E, E;, and E, denote the samples that conform 
to a percentage of solids specification, a molecular weight 
specification, and a color specification, respectively. A total of 
240 samples are classified by the E, E», and E, specifications, 
where yes indicates that the sample conforms. 


E, yes 
E» 
yes no Total 
E, yes 200 l 201 
no 5 4 9 
Total 205 5 210 
E, no 
E» 
yes no 
E, yes 20 4 24 
no 6 0 6 
Total 26 4 30 


(a) Are E,, E,, and E4 mutually exclusive events? 

(b) Are E',, E^, and E’, mutually exclusive events? 

(c) What is P(E", or E^ or E)? 

(d) What is the probability that a sample conforms to all three 
specifications? 

(e) What is the probability that a sample conforms to the £, or 
E, specification? 

(f) What is the probability that a sample conforms to the E, or 
E, or E; specification? 

2-106. Transactions to a computer database are either new 

items or changes to previous items. The addition of an item can 

be completed less than 100 milliseconds 90% of the time, but 

only 2096 of changes to a previous item can be completed in 

less than this time. If 30% of transactions are changes, what is 

the probability that a transaction can be completed in less than 

100 milliseconds? 


2-107. A steel plate contains 20 bolts. Assume that 5 bolts 

are not torqued to the proper limit. Four bolts are selected at 

random, without replacement, to be checked for torque. 

(a) What is the probability that all four of the selected bolts 
are torqued to the proper limit? 

(b) What is the probability that at least one of the selected 
bolts is not torqued to the proper limit? 

2-108. The following circuit operates if and only if there is 

a path of functional devices from left to right. Assume devices 

fail independently and that the probability of failure of each 
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device is as shown. What is the probability that the circuit 
operates? 





2-109. The probability of getting through by telephone to 
buy concert tickets is 0.92. For the same event, the probability 
of accessing the vendor’s Web site is 0.95. Assume that these 
two ways to buy tickets are independent. What is the proba- 
bility that someone who tries to buy tickets through the 
Internet and by phone will obtain tickets? 


2-110. The British government has stepped up its information 
campaign regarding foot and mouth disease by mailing 
brochures to farmers around the country. It is estimated that 99% 
of Scottish farmers who receive the brochure possess enough in- 
formation to deal with an outbreak of the disease, but only 90% 
of those without the brochure can deal with an outbreak. After 
the first three months of mailing, 95% of the farmers in 
Scotland received the informative brochure. Compute the prob- 
ability that a randomly selected farmer will have enough infor- 
mation to deal effectively with an outbreak of the disease. 


2-111. In an automated filling operation, the probability of 
an incorrect fill when the process is operated at a low speed is 
0.001. When the process is operated at a high speed, the prob- 
ability of an incorrect fill is 0.01. Assume that 30% of the 
containers are filled when the process is operated at a high 
speed and the remainder are filled when the process is 
operated at a low speed. 

(a) What is the probability of an incorrectly filled container? 
(b) If an incorrectly filled container is found, what is the proba- 

bility that it was filled during the high-speed operation? 


2-112. An encryption-decryption system consists of three 

elements: encode, transmit, and decode. A faulty encode 

occurs in 0.5% of the messages processed, transmission errors 

occur in 1% of the messages, and a decode error occurs in 

0.1% of the messages. Assume the errors are independent. 

(a) What is the probability of a completely defect-free 
message? 

(b) What 1s the probability of a message that has either an 
encode or a decode error? 


2-113. Itis known that two defective copies of a commercial 

software program were erroneously sent to a shipping lot that 

has now a total of 75 copies of the program. A sample of copies 
will be selected from the lot without replacement. 

(a) If three copies of the software are inspected, determine the 
probability that exactly one of the defective copies will be 
found. 

(b) If three copies of the software are inspected, determine the 
probability that both defective copies will be found. 


(c) If 73 copies are inspected, determine the probability that 
both copies will be found. Hint: Work with the copies that 
remain in the lot. 


2-114. A robotic insertion tool contains 10 primary compo- 
nents. The probability that any component fails during the 
warranty period is 0.01. Assume that the components fail 
independently and that the tool fails 1f any component fails. 
What is the probability that the tool fails during the warranty 
period? 

2-115. An e-mail message can travel through one of two 
server routes. The probability of transmission error in each of 
the servers and the proportion of messages that travel each 
route are shown in the following table. Assume that the 
servers are independent. 


probability of error 


percentage 

of messages server] server2 server3 server4 
route 1 30 0.01 0.015 
route2 70 0.02 0.003 


(a) What is the probability that a message will arrive without 
error? 

(b) If a message arrives in error, what is the probability it was 
sent through route 1? 


2-116. A machine tool is idle 15% of the time. You request 
immediate use of the tool on five different occasions during 
the year. Assume that your requests represent independent 
events. 


(a) What is the probability that the tool is idle at the time of all 
of your requests? 

(b) What is the probability that the machine is idle at the time 
of exactly four of your requests? 

(c) What is the probability that the tool is idle at the time of at 
least three of your requests? 


2-117. A lot of 50 spacing washers contains 30 washers that 

are thicker than the target dimension. Suppose that three wash- 

ers are selected at random, without replacement, from the lot. 

(a) What is the probability that all three washers are thicker 
than the target? 

(b) What is the probability that the third washer selected 1s 
thicker than the target if the first two washers selected are 
thinner than the target? 

(c) What is the probability that the third washer selected 1s 
thicker than the target? 


2-118. Continuation of Exercise 2-117. Washers are se- 

lected from the lot at random, without replacement. 

(a) What is the minimum number of washers that need to be 
selected so that the probability that all the washers are 
thinner than the target is less than 0.10? 

(b) What is the minimum number of washers that need to be 
selected so that the probability that one or more washers 
are thicker than the target is at least 0.90? 


2-119. The following table lists the history of 940 orders for 
features in an entry-level computer product. 


extra memory 


no yes 
optional high- no 514 68 
speed processor yes 112 246 


Let A be the event that an order requests the optional high- 
speed processor, and let B be the event that an order requests 
extra memory. Determine the following probabilities: 

(a) P(A4UB) (b) P(A(C1 B) 

(c) P(A'UB) (d) P(A’ B") 

(e) What is the probability that an order requests an optional 
high-speed processor given that the order requests extra 
memory? 

(f) What is the probability that an order requests extra mem- 
ory given that the order requests an optional high-speed 
processor? 


2-120. The alignment between the magnetic tape and head 
in a magnetic tape storage system affects the performance of 
the system. Suppose that 10% of the read operations are de- 
graded by skewed alignments, 5% of the read operations are 
degraded by off-center alignments, and the remaining read op- 
erations are properly aligned. The probability of a read error is 
0.01 from a skewed alignment, 0.02 from an off-center align- 
ment, and 0.001 from a proper alignment. 

(a) What is the probability of a read error? 

(b) If a read error occurs, what is the probability that it is due 

to a skewed alignment? 

2-121. The following circuit operates if and only if there is 
a path of functional devices from left to right. Assume that de- 
vices fail independently and that the probability of failure of 
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each device is as shown. What is the probability that the 
circuit does not operate? 





2-122. A company that tracks the use of its web site deter- 
mined that the more pages a visitor views, the more likely the 
visitor is to provide contact information. Use the following ta- 
bles to answer the questions: 


Number of 


pages viewed: l 2 3 4 or more 


Percentage of 
visitors: 40 30 20 10 


Percentage of visitors 
in each page-view 
catgory that provide 
contact information: 10 10 20 40 


(a) What is the probability that a visitor to the web site 
provides contact information? 

(b) If a visitor provides contact information, what is the 
probability that the visitor viewed four or more pages? 
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MIND-EXPANDING EXERCISES 


2-123. The alignment between the magnetic tape and 
head in a magnetic tape storage system affects the per- 
formance of the system. Suppose that 10% of the read 
operations are degraded by skewed alignments, 5% by 
off-center alignments, 1% by both skewness and off- 
center, and the remaining read operations are properly 
aligned. The probability of a read error is 0.01 from a 
skewed alignment, 0.02 from an off-center alignment, 
0.06 from both conditions, and 0.001 from a proper 


alignment. What is the probability of a read error. 


2-124. Suppose that a lot of washers is large enough 
that it can be assumed that the sampling is done with re- 
placement. Assume that 60% of the washers exceed the 


target thickness. 


(a) What is the minimum number of washers that need 
to be selected so that the probability that all the 
washers are thinner than the target is less than 0.10? 

(b) What is the minimum number of washers that need to 
be selected so that the probability that one or more 
washers are thicker than the target is at least 0.90? 


2-125. A biotechnology manufacturing firm can pro- 
duce diagnostic test kits at a cost of $20. Each kit for 
which there is a demand in the week of production can be 
sold for $100. However, the half-life of components in 
the kit requires the kit to be scrapped if it is not sold in 
the week of production. The cost of scrapping the kit is 


$5. The weekly demand is summarized as follows: 
weekly demand 


Number of 

units 0 50 100 
Probability of 

demand 0.05 0.4 0.3 


IMPORTANT TERMS AND CONCEPTS 


In the E-book, click on any Event 

term or concept below to Independence 

go to that subject. Multiplication rule 
Addition rule Mutually exclusive 
Axioms of probability events 
Bayes' theorem Outcome 
Conditional probability 
Equally likely outcomes 


Random experiment 


How many kits should be produced each week to maxi- 
mize the mean earnings of the firm? 


2-126. Assume the following characteristics of the 
inspection process in Exercise 2-107. If an operator 
checks a bolt, the probability that an incorrectly 
torqued bolt is identified is 0.95. If a checked bolt is 
correctly torqued, the operator’s conclusion is always 
correct. What is the probability that at least one bolt in 
the sample of four is identified as being incorrectly 
torqued? 

2-127. If the events A and B are independent, show 
that A’ and B’ are independent. 


2-128. Suppose that a table of part counts is generalized 
as follows: 


conforms 
yes no 
supplier Í ka kb 
2 a b 


where a, b, and k are positive integers. Let A denote the 
event that a part is from supplier 1 and let B denote the 
event that a part conforms to specifications. Show that 
A and B are independent events. 

This exercise illustrates the result that whenever the 
rows of a table (with r rows and c columns) are propor- 
tional, an event defined by a row category and an event 
defined by a column category are independent. 


Random variables CD MATERIAL 
discrete and 
: Permutation 
continuous 
Combination 


Sample spaces—discrete 
and continuous 

Total probability rule 

With or without 
replacement 


2-1.4 Counting Techniques (CD Only) 


Multiplication 
Rule (for 
counting 

techniques) 


EXAMPLE 82-1 


In many of the examples in Chapter 2, it is easy to determine the number of outcomes in each 
event. In more complicated examples, determining the number of outcomes that comprise the 
sample space (or an event) becomes more difficult. To associate probabilities with events, it is 
important to know the number of outcomes both in an event and in the sample space. Some 
simple rules can be used to simplify the calculations. 

In Example 2-4, an automobile manufacturer provides vehicles equipped with selected 
options. Each vehicle is ordered 


With or without an automatic transmission 
With or without air conditioning 

With one of three choices of a stereo system 
With one of four exterior colors 


The tree diagram in Fig. 2-6 describes the sample space of all possible vehicle types. The size 
of the sample space equals the number of branches in the last level of the tree and this quantity 
equals 2 X 2 X 3 X 4 — 48. This leads to the following useful result. 


If an operation can be described as a sequence of k steps, and 


if the number of ways of completing step 1 is nį, and 


if the number of ways of completing step 2 is n, for each way of completing 
step 1, and 


if the number of ways of completing step 3 1s 74 for each way of completing 
step 2, and so forth, 


the total number of ways of completing the operation is 


IDE c 





In the design of a casing for a gear housing, we can use four different types of fasteners, 
three different bolt lengths, and three different bolt locations. From the multiplication rule, 
4 X 3 X 3 = 36 different designs are possible. 


Permutations 

Another useful calculation is the number of ordered sequences of the elements of a set. 
Consider a set of elements, such as S = (a, b, c}. A permutation of the elements is an ordered 
sequence of the elements. For example, abc, acb, bac, bca, cab, and cba are all of the permu- 
tations of the elements of S. 


The number of permutations of n different elements is n! where 


jm eX 2 xcu uo 
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EXAMPLE 82-2 


EXAMPLE 82-3 








This result follows from the multiplication rule. A permutation can be constructed by select- 
ing the element to be placed in the first position of the sequence from the » elements, then 
selecting the element for the second position from the n — 1 remaining elements, then select- 
ing the element for the third position from the remaining n — 2 elements, and so forth. 
Permutations such as these are sometimes referred to as linear permutations. 

In some situations, we are interested in the number of arrangements of only some of the 
elements of a set. The following result also follows from the multiplication rule. 


The number of permutations of a subset of r elements selected from a set of n differ- 
ent elements is 


m cocer (S2-2) 


n! 
(ul 


A printed circuit board has eight different locations in which a component can be placed. If four 
different components are to be placed on the board, how many different designs are possible? 

Each design consists of selecting a location from the eight locations for the first compo- 
nent, a location from the remaining seven for the second component, a location from the re- 
maining six for the third component, and a location from the remaining five for the fourth 
component. Therefore, 


8 
Pas qe Rc dein A 1680 different designs are possible. 


Sometimes we are interested in counting the number of ordered sequences for objects that 
are not all different. The following result is a useful, general calculation. 


The number of permutations of n = n, + m t =- + n, objects of which n, are of 
one type, n, are of a second type, ... , and n, are of an rth type is 


e (S2-3) 


ni no! n4! ... n,! 


Consider a machining operation in which a piece of sheet metal needs two identical diameter 
holes drilled and two identical size notches cut. We denote a drilling operation as d and a 
notching operation as n. In determining a schedule for a machine shop, we might be interested 
in the number of different possible sequences of the four operations. The number of possible 
sequences for two drilling operations and two notching operations is 


4! 


331 


The six sequences are easily summarized: ddnn, dndn, dnnd, nddn, ndnd, nndd. 


EXAMPLE 82-4 


EXAMPLE 82-5 


EXAMPLE 82-6 
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A part is labeled by printing with four thick lines, three medium lines, and two thin lines. If 
each ordering of the nine lines represents a different label, how many different labels can be 
generated by using this scheme? 

From Equation S2-3, the number of possible part labels is 


9! 
41 31 2! 





— 2520 


Combinations 
Another counting problem of interest is the number of subsets of r elements that can be se- 
lected from a set of n elements. Here, order is not important. Every subset of r elements can 
be indicated by listing the elements in the set and marking each element with a “*” if it is to 
be included in the subset. Therefore, each permutation of r *'s and n — r blanks indicate a dif- 
ferent subset and the number of these are obtained from Equation S2-3. 

For example, if the set is S — (a, b, c, d} the subset (a, c} can be indicated as 


a b c d 


x * 


The number of subsets of size r that can be selected from a set of n elements is 
denoted as (7) or C7 and 





(S2-4) 


A printed circuit board has eight different locations in which a component can be placed. If 
five identical components are to be placed on the board, how many different designs are pos- 
sible? 

Each design is a subset of the eight locations that are to contain the components. From 
Equation S2-4, the number of possible designs is 


8! 
5131 
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The following example uses the multiplication rule in combination with Equation S2-4 to an- 
swer a more difficult, but common, question. 


A bin of 50 manufactured parts contains three defective parts and 47 nondefective parts. A 
sample of six parts 1s selected from the 50 parts. Selected parts are not replaced. That 1s, each 
part can only be selected once and the sample is a subset of the 50 parts. How many different 
samples are there of size six that contain exactly two defective parts? 

A subset containing exactly two defective parts can be formed by first choosing the 
two defective parts from the three defective parts. Using Equation S2-4, this step can be 
completed in 





IA 91 — 4 diff 
3 ort uferent ways 
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Then, the second step 1s to select the remaining four parts from the 47 acceptable parts in the 
bin. The second step can be completed in 


47! 





47 l 
(2) = 343 ^ 178,365 different ways 


Therefore, from the multiplication rule, the number of subsets of size six that contain exactly 


two defective items is 


3 X 178,365 = 535,095 


As an additional computation, the total number of different subsets of size six 1s found 


to be 


| 


50 


— 30! _ 15,890,700 
| 6144) 0 ^" 





When probability is discussed in this chapter, the probability of an event is determined as 
the ratio of the number of outcomes in the event to the number of outcomes in the sample 
space (for equally likely outcomes). Therefore, the probability that a sample contains exactly 


two defective parts 1s 


15,890,700 


535,095 
— 0.034 


Note that Example S2-7 illustrates the hypergeometric distribution. 


EXERCISES FOR SECTION 2-1.4 


S2-1. An order for a personal digital assistant can specify 
any one of five memory sizes, any one of three types of dis- 
plays, any one of four sizes of a hard disk, and can either in- 
clude or not include a pen tablet. How many different systems 
can be ordered? 


S2-2. In a manufacturing operation, a part is produced by 
machining, polishing, and painting. If there are three machine 
tools, four polishing tools, and three painting tools, how many 
different routings (consisting of machining, followed by pol- 
ishing, and followed by painting) for a part are possible? 
S2-3. New designs for a wastewater treatment tank have 
proposed three possible shapes, four possible sizes, three loca- 
tions for input valves, and four locations for output valves. 
How many different product designs are possible? 

S2-4. A manufacturing process consists of 10 operations 
that can be completed in any order. How many different pro- 
duction sequences are possible? 

82-5. A manufacturing operations consists of 10 opera- 
tions. However, five machining operations must be com- 
pleted before any of the remaining five assembly operations 


can begin. Within each set of five, operations can be com- 
pleted in any order. How many different production se- 
quences are possible? 


82-6. In a sheet metal operation, three notches and four 

bends are required. If the operations can be done in any order, 

how many different ways of completing the manufacturing are 

possible? 

S2-7. A lot of 140 semiconductor chips is inspected by 

choosing a sample of five chips. Assume 10 ofthe chips do not 

conform to customer requirements. 

(a) How many different samples are possible? 

(b) How many samples of five contain exactly one noncon- 
forming chip? 

(c) How many samples of five contain at least one noncon- 
forming chip? 

S2-8. In the layout of a printed circuit board for an elec- 

tronic product, there are 12 different locations that can accom- 

modate chips. 

(a) If five different types of chips are to be placed on the 
board, how many different layouts are possible? 


(b) If the five chips that are placed on the board are of the 
same type, how many different layouts are possible? 


S2-9. Inthe laboratory analysis of samples from a chemical 
process, five samples from the process are analyzed daily. In 
addition, a control sample is analyzed two times each day to 
check the calibration of the laboratory instruments. 

(a) How many different sequences of process and control 
samples are possible each day? Assume that the five 
process samples are considered identical and that the two 
control samples are considered identical. 

(b) How many different sequences of process and control sam- 
ples are possible if we consider the five process samples to 
be different and the two control samples to be identical. 

(c) For the same situation as part (b), how many sequences 
are possible if the first test of each day must be a control 
sample? 

S2-10. Inthe design of an electromechanical product, seven 

different components are to be stacked into a cylindrical cas- 

ing that holds 12 components in a manner that minimizes the 
impact of shocks. One end of the casing is designated as the 
bottom and the other end is the top. 

(a) How many different designs are possible? 

(b) If the seven components are all identical, how many dif- 
ferent designs are possible? 

(c) If the seven components consist of three of one type of 
component and four of another type, how many different 
designs are possible? (more difficult) 


S2-11. The design of a communication system considered 

the following questions: 

(a) How many three-digit phone prefixes that are used to rep- 
resent a particular geographic area (such as an area code) 
can be created from the digits 0 through 9? 

(b) As in part (a), how many three-digit phone prefixes are 
possible that do not start with 0 or 1, but contain 0 or 1 as 
the middle digit? 

(c) How many three-digit phone prefixes are possible in 
which no digit appears more than once in each prefix? 
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S2-12. A byte is a sequence of eight bits and each bit is ei- 

ther 0 or 1. 

(a) How many different bytes are possible? 

(b) If the first bit of a byte is a parity check, that is, the first 
byte is determined from the other seven bits, how many 
different bytes are possible? 


S2-13. Ina chemical plant, 24 holding tanks are used for fi- 
nal product storage. Four tanks are selected at random and 
without replacement. Suppose that six of the tanks contain 
material in which the viscosity exceeds the customer require- 
ments. 

(a) What is the probability that exactly one tank in the sample 
contains high viscosity material? 

(b) What is the probability that at least one tank in the sample 
contains high viscosity material? 

(c) In addition to the six tanks with high viscosity levels, four 
different tanks contain material with high impurities. 
What is the probability that exactly one tank in the sample 
contains high viscosity material and exactly one tank in 
the sample contains material with high impurities? 


S2-14. Plastic parts produced by an injection-molding oper- 

ation are checked for conformance to specifications. Each tool 

contains 12 cavities in which parts are produced, and these 

parts fall into a conveyor when the press opens. An inspector 

chooses 3 parts from among the 12 at random. Two cavities 

are affected by a temperature malfunction that results in parts 

that do not conform to specifications. 

(a) What is the probability that the inspector finds exactly one 
nonconforming part? 

(b) What is the probability that the inspector finds at least one 
nonconforming part? 


S2-15. A bin of 50 parts contains five that are defective. A 

sample of two 1s selected at random, without replacement. 

(a) Determine the probability that both parts in the sample are 
defective by computing a conditional probability. 

(b) Determine the answer to part (a) by using the subset ap- 
proach that was described in this section. 


Discrete Random 


Variables and 
Probability 


Distributions 


CHAPTER OUTLINE 
3-1 DISCRETE RANDOM VARIABLES 3-6 BINOMIAL DISTRIBUTION 
3-2 PROBABILITY DISTRIBUTIONS 3-7 GEOMETRIC AND NEGATIVE 
AND PROBABILITY MASS BINOMIAL DISTRIBUTIONS 
FUNCTIONS 3-7.1 Geometric Distribution 
3-3 CUMULATIVE DISTRIBUTION 3-7.2 Negative Binomial Distribution 
FUNCTIONS 
3-8 HYPERGEOMETRIC DISTRIBUTION 
3-4 MEAN AND VARIANCE OF A 
DISCRETE RANDOM VARIABLE 3-9 POISSON DISTRIBUTION 
3.5 DISCRETE UNIFORM 
DISTRIBUTION 
LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 
1. Determine probabilities from probability mass functions and the reverse 


2. Determine probabilities from cumulative distribution functions and cumulative distribution func- 
tions from probability mass functions, and the reverse 


3. Calculate means and variances for discrete random variables 

4. Understand the assumptions for each of the discrete probability distributions presented 

5. Select an appropriate discrete probability distribution to calculate probabilities in specific 
applications 


6. Calculate probabilities, determine means and variances for each of the discrete probability 
distributions presented 


Answers for most odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e- Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for the text sections that appear on CD only. These exercises may be found within the e-Text 
immediately following the section they accompany. 
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60 CHAPTER3 DISCRETE RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS 


3-1 DISCRETE RANDOM VARIABLES 


EXAMPLE 3-1 


EXAMPLE 3-2 


EXAMPLE 3-3 


Many physical systems can be modeled by the same or similar random experiments and ran- 
dom variables. The distribution of the random variables involved in each of these common 
systems can be analyzed, and the results of that analysis can be used in different applications 
and examples. In this chapter, we present the analysis of several random experiments and 
discrete random variables that frequently arise in applications. We often omit a discussion of 
the underlying sample space of the random experiment and directly describe the distribution 
of a particular random variable. 


A voice communication system for a business contains 48 external lines. At a particular time, 
the system is observed, and some of the lines are being used. Let the random variable X denote 
the number of lines in use. Then, X can assume any of the integer values 0 through 48. When 
the system is observed, if 10 lines are in use, x = 10. 


In a semiconductor manufacturing process, two wafers from a lot are tested. Each wafer is 
classified as pass or fail. Assume that the probability that a wafer passes the test 1s 0.8 and that 
wafers are independent. The sample space for the experiment and associated probabilities are 
shown in Table 3-1. For example, because of the independence, the probability of the outcome 
that the first wafer tested passes and the second wafer tested fails, denoted as pf, 1s 


P(pf) = 0.8(0.2) = 0.16 


The random variable X is defined to be equal to the number of wafers that pass. The 
last column of the table shows the values of X that are assigned to each outcome in the 
experiment. 


Define the random variable X to be the number of contamination particles on a wafer in semi- 
conductor manufacturing. Although wafers possess a number of characteristics, the random 
variable X summarizes the wafer only in terms of the number of particles. 

The possible values of X are integers from zero up to some large value that represents the 
maximum number of particles that can be found on one of the wafers. If this maximum num- 
ber is very large, we might simply assume that the range of X 1s the set of integers from zero 
to infinity. 


Note that more than one random variable can be defined on a sample space. In Example 
3-3, we might define the random variable Y to be the number of chips from a wafer that fail 
the final test. 


Table 3-1 Wafer Tests 


Outcome 
Wafer 1 Wafer 2 Probability x 
Pass Pass 0.64 2 
Fail Pass 0.16 1 
Pass Fail 0.16 1 
Fail Fail 0.04 0 
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EXERCISES FOR SECTION 3-1 


For each of the following exercises, determine the range (pos- 
sible values) of the random variable. 


3-1. The random variable is the number of nonconforming 
solder connections on a printed circuit board with 1000 con- 
nections. 


3-2. Ina voice communication system with 50 lines, the ran- 
dom variable is the number of lines in use at a particular time. 


3-9. An electronic scale that displays weights to the nearest 
pound is used to weigh packages. The display shows only five 
digits. Any weight greater than the display can indicate is 
shown as 99999, The random variable is the displayed weight. 


3-4. A batch of 500 machined parts contains 10 that do not 
conform to customer requirements. The random variable is the 
number of parts in a sample of 5 parts that do not conform to 
customer requirements. 

3-5. A batch of 500 machined parts contains 10 that do not 
conform to customer requirements. Parts are selected succes- 
sively, without replacement, until a nonconforming part is 
obtained. The random variable is the number of parts selected. 


3-2 PROBABILITY DISTRIBUTIONS AND 


PROBABILITY MASS FUNCTIONS 


3-6. The random variable is the moisture content of a lot of 
raw material, measured to the nearest percentage point. 


3-7. The random variable is the number of surface flaws in 
a large coil of galvanized steel. 


3-8. The random variable is the number of computer clock 
cycles required to complete a selected arithmetic calculation. 


3-9. An order for an automobile can select the base model or 
add any number of 15 options. The random variable is the 
number of options selected in an order. 


3-10. Wood paneling can be ordered in thicknesses of 1/8, 
1/4, or 3/8 inch. The random variable is the total thickness of 
paneling in two orders. 


3-11. A group of 10,000 people are tested for a gene 
called Ifi202 that has been found to increase the risk for lupus. 
The random variable is the number of people who carry the 
gene. 


3-12. A software program has 5000 lines of code. The ran- 
dom variable is the number of lines with a fatal error. 


Random variables are so important in random experiments that sometimes we essentially ig- 
nore the original sample space of the experiment and focus on the probability distribution of 
the random variable. For example, in Example 3-1, our analysis might focus exclusively on 
the integers (0, 1,..., 48}! in the range of X. In Example 3-2, we might summarize the ran- 
dom experiment in terms of the three possible values of X, namely 10, 1, 2}. In this manner, a 
random variable can simplify the description and analysis of a random experiment. 

The probability distribution of a random variable X is a description of the probabilities 
associated with the possible values of X. For a discrete random variable, the distribution is 
often specified by just a list of the possible values along with the probability of each. In some 
cases, it is convenient to express the probability in terms of a formula. 


EXAMPLE 3-4 


There is a chance that a bit transmitted through a digital transmission channel is received in 


error. Let X equal the number of bits in error in the next four bits transmitted. The possible val- 
ues for X are (0, 1, 2, 3, 4}. Based on a model for the errors that is presented in the following 
section, probabilities for these values will be determined. Suppose that the probabilities are 


P(X = 1) = 0.2916 
P(X = 4) = 0.0001 


P(X = 2) = 0.0486 


The probability distribution of X is specified by the possible values along with the probability 
of each. A graphical description of the probability distribution of X is shown in Fig. 3-1. 


Suppose a loading on a long, thin beam places mass only at discrete points. See Fig. 3-2. 
The loading can be described by a function that specifies the mass at each of the discrete 
points. Similarly, for a discrete random variable_X, its distribution can be described by a func- 
tion that specifies the probability at each of the possible discrete values for X. 
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f (x) 
0.6561 e 
Loading 

0.2916 ° 0.0036 | | | 

"4 70-0001 | 
0.0486 AES 

O 1 2 3 4 x x 

Figure 3-1 Probability distribution Figure 3-2 Loadings at discrete points on a 
for bits in error. long, thin beam. 


Definition 
For a discrete random variable X with possible values x,, x», ... , x,, a probability 
mass function is a function such that 


tb eem 


Q) > f(x) = 1 
(3) f(x) a P(X ZA x;) 





For example, in Example 3-4, f(0) = 0.6561, f(1) = 0.2916, f(2) = 0.0486, f(3) = 0.0036, 
and f(4) = 0.0001. Check that the sum of the probabilities in Example 3-4 is 1. 


EXAMPLE 3-5 Let the random variable X denote the number of semiconductor wafers that need to be ana- 
lyzed in order to detect a large particle of contamination. Assume that the probability that a 
wafer contains a large particle is 0.01 and that the wafers are independent. Determine the 
probability distribution of X. 

Let p denote a wafer in which a large particle 1s present, and let a denote a wafer in which 
it is absent. The sample space of the experiment is infinite, and it can be represented as all pos- 
sible sequences that start with a string of a's and end with p. That is, 


s = {p, ap, aap, aaap, aaaap, aaaaap, and so forth} 


Consider a few special cases. We have P(X = 1) = P(p) = 0.01. Also, using the inde- 
pendence assumption 


P(X = 2) = P(ap) = 0.99(0.01) = 0.0099 
A general formula is 


P(X = x) = P(aa ... ap) = 0.99%! (0.01), for x = 1, 2, 3,... 
—— 


(x Tas 


Describing the probabilities associated with X in terms of this formula is the simplest method 
of describing the distribution of X in this example. Clearly f(x) = 0. The fact that the sum of 
the probabilities is 1 is left as an exercise. This is an example of a geometric random variable, 
and details are provided later in this chapter. 


EXERCISES FOR SECTION 3-2 


3-13. The sample space of a random experiment is {a, b, c, 
d, e,f 1, and each outcome is equally likely. A random variable 
is defined as follows: 


ME e 
x 0 0 1.5 1.5 2 3 


Determine the probability mass function of X. 

3-14. Use the probability mass function in Exercise 3-11 to 
determine the following probabilities: 

(a) P(X — 1.5) (by P(0.5 < X < 2.7) 

(c) P(X > 3) (d) POS X « 2) 

(e P(X-0 or X-2) 

Verify that the following functions are probability mass func- 
tions, and determine the requested probabilities. 


3-15. x = =] 0 1 2 
fc) | 1/8 | 2/8 | 2/8 | 2/8 | 1/8 

(a) P(X = 2) (b) P(X > —2) 

(c) (-1SX<1) (d P(Xx-1 or X=2) 

3416. f(x) = (8/70/20, x = 1, 2,3 

(a) P(X - 1) (b) P(X > 1) 

(c) P2« X«6) (d) P(Xx1 or X^» I) 

EEN f(x) = Tx x 20,1,2,3,4 

(a) P(X — 4) (b) P(X = 1) 

(c) P(2<X<4) (d) PX > —10) 

3.18. f(x) = G/AY1/4y, x 20,1,2,... 

(a) P(X 22) (b) P(X <2) 

(c) P(X 2) (d) P(X=1) 


3-19. Marketing estimates that a new instrument for the 
analysis of soil samples will be very successful, moderately 
successful, or unsuccessful, with probabilities 0.3, 0.6, 
and 0.1, respectively. The yearly revenue associated with 
a very successful, moderately successful, or unsuccessful 
product is $10 million, $5 million, and $1 million, respec- 
tively. Let the random variable X denote the yearly revenue of 
the product. Determine the probability mass function of X. 


3-20. A disk drive manufacturer estimates that in five years 
a storage device with | terabyte of capacity will sell with 
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probability 0.5, a storage device with 500 gigabytes capacity 
will sell with a probability 0.3, and a storage device with 100 
gigabytes capacity will sell with probability 0.2. The revenue 
associated with the sales in that year are estimated to be $50 
million, $25 million, and $10 million, respectively. Let X be 
the revenue of storage devices during that year. Determine the 
probability mass function of X. 


3-21. An optical inspection system is to distinguish 
among different part types. The probability of a correct 
classification of any part 1s 0.98. Suppose that three parts 
are inspected and that the classifications are independent. 
Let the random variable X denote the number of parts that 
are correctly classified. Determine the probability mass 
function of X. 


3-22. In a semiconductor manufacturing process, three 
wafers from a lot are tested. Each wafer is classified as pass or 
fail. Assume that the probability that a wafer passes the test 1s 
0.8 and that wafers are independent. Determine the probabil- 
ity mass function of the number of wafers from a lot that pass 
the test. 


3-23. The distributor of a machine for cytogenics has 
developed a new model. The company estimates that when it 
is introduced into the market, it will be very successful with a 
probability 0.6, moderately successful with a probability 0.3, 
and not successful with probability 0.1. The estimated yearly 
profit associated with the model being very successful is $15 
million and being moderately successful is $5 million; not 
successful would result in a loss of $500,000. Let X be the 
yearly profit of the new model. Determine the probability 
mass function of X. 


3-24. Anassembly consists of two mechanical components. 
Suppose that the probabilities that the first and second compo- 
nents meet specifications are 0.95 and 0.98. Assume that the 
components are independent. Determine the probability mass 
function of the number of components in the assembly that 
meet specifications. 


3-25. An assembly consists of three mechanical compo- 
nents. Suppose that the probabilities that the first, second, and 
third components meet specifications are 0.95, 0.98, and 0.99. 
Assume that the components are independent. Determine the 
probability mass function of the number of components in the 
assembly that meet specifications. 


3-3 CUMULATIVE DISTRIBUTION FUNCTIONS 


EXAMPLE 3-6 


In Example 3-4, we might be interested in the probability of three or fewer bits being in error. 


This question can be expressed as P(X < 3). 
The event that [X < 3} is the union of the events {X = 0}, LX = 1}, (X = 2}, and 
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{X = 3}. Clearly, these three events are mutually exclusive. Therefore, 


P(X x 3) = P(X = 0) + P(X = 1) + P(X = 2) + P(X = 3) 
= 0.6561 + 0.2916 + 0.0486 + 0.0036 = 0.9999 


This approach can also be used to determine 


P(X = 3) = P(X € 3) — P(X < 2) = 0.0036 


Example 3-6 shows that it is sometimes useful to be able to provide cumulative proba- 
bilities such as P(X < x) and that such probabilities can be used to find the probability mass 
function of a random variable. Therefore, using cumulative probabilities is an alternate 
method of describing the probability distribution of a random variable. 

In general, for any discrete random variable with possible values xj, x5,...,X,, 
the events [X = xı), (X = x)),..., {X ^ x,) are mutually exclusive. Therefore, 


P(X =< x m 2o ee fox). 





Definition 
The cumulative distribution function of a discrete random variable X, denoted as 
F(x), is 
ORE T EE ec) 
For a discrete random variable X, F(x) satisfies the following properties. 
(d) Fx) = PX Sx) = Ère f(xi) 
(ESSE 
(3) lt ss ben v s 
Like a probability mass function, a cumulative distribution function provides proba- 
bilities. Notice that even if the random variable X can only assume integer values, the 
cumulative distribution function can be defined at noninteger values. In Example 3-6, 
F(1.5) = P(X x 1.5) = P{X = 0} + P(X = 1) = 0.6561 + 0.2916 = 0.9477. Properties (1) 
and (2) of a cumulative distribution function follow from the definition. Property (3) follows 
from the fact that if x — y, the event that {X — x} is contained in the event (X S y}. 
The next example shows how the cumulative distribution function can be used to deter- 
mine the probability mass function of a discrete random variable. 
EXAMPLE 3-7? Determine the probability mass function of X from the following cumulative distribution 


function: 
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F(x) F(x) 
1.0 à. 1.000 — 
0.997 °——o 


0.2 
$.——————————Oo 
«—— ————àó «—————ó 
—2 O 2 x O 1 2 x 
Figure 3-3 Cumulative distribution function for Figure 3-4 Cumulative distribution 
Example 3-7. function for Example 3-8. 


Figure 3-3 displays a plot of F(x). From the plot, the only points that receive nonzero 
probability are —2, 0, and 2. The probability mass function at each point is the change in the 
cumulative distribution function at the point. Therefore, 


f(-2) = 0.2 — 0 = 0.2 f0) = 0.7 — 0.2 = 0.5 f(2) = 1.0 — 0.7 = 03 


EXAMPLE 3-8 Suppose that a day's production of 850 manufactured parts contains 50 parts that do not con- 
form to customer requirements. Two parts are selected at random, without replacement, from 
the batch. Let the random variable X equal the number of nonconforming parts in the sample. 
What is the cumulative distribution function of X? 

The question can be answered by first finding the probability mass function of X. 


_ 800 799 _ 


P(X = 0) = = 0.886 
850 849 
800 50 
P(X = 1) =2+—-+— = 0.111 
850 849 
50 49 
P(X = 2) = —-— = 0.003 
850 849 


Therefore, 


The cumulative distribution function for this example is graphed in Fig. 3-4. Note that 
F(x) is defined for all x from —9» < x < o and not only for 0, 1, and 2. 


EXERCISES FOR SECTION 3-3 


3-26. Determine the cumulative distribution function of the (c) P(-1. 1] Xx 1) (d) P(X » 0) 


random variable in Exercise 3-13. 3-28. Determine the cumulative distribution function for the 
3-27. Determine the cumulative distribution function for random variable in Exercise 3-17; also determine the following 
the random variable in Exercise 3-15; also determine the fol- probabilities: 

lowing probabilities: (a) P(X < 1.5) (b) P(X x 3) 


(a) P(X < 1.25) (b) P(X < 2.2) (c) P(X>2) (d) P(1« X x2) 
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3-29. Determine the cumulative distribution function for 
the random variable in Exercise 3-19. 

3-30. Determine the cumulative distribution function for 
the random variable in Exercise 3-20. 

3-31. Determine the cumulative distribution function for 
the random variable in Exercise 3-22. 

3-32. Determine the cumulative distribution function for 
the variable in Exercise 3-23. 


Verify that the following functions are cumulative distribution 
functions, and determine the probability mass function and the 
requested probabilities. 


3-33. 0 x«l 
F(x) = 4 0.5 |] 
1 a= Xx 
(a) P(X = 3) (b) P(X x 2) 
(e) Pulse. (de Pic?) 


3-34. Errors in an experimental transmission channel are 
found when the transmission 1s checked by a certifier that de- 
tects missing pulses. The number of errors found in an eight- 
bit byte is a random variable with the following distribution: 


0 x= 1 
0.7 l=x<4 
0.9 4=x< 7 
1 TES 


Determine each of the following probabilities: 


(a PX =4) (b) P(X» 7T) 
(c) P(X - 5) (d) P(X » 4) 
(e) P(X = 2) 
3-35. 0 x « -10 
|.]025 -10<x< 30 
œ) = 9 9.75 30 <x < 50 
l 50 € x 


(a) P(X = 50) 
(c) P(40- Xx 60) (d) P(X « 0) 

(e) P(0 = X « 10) (D P(-I0« X« I0) 

3-36. The thickness of wood paneling (in inches) that a cus- 
tomer orders is a random variable with the following cumula- 
tive distribution function: 


(b) P(X < 40) 


0 x « 1/8 
0.2 1/8 Sx < 1/4 
0.9 1/4 =x < 3/8 
1 3/8 =x 


Determine the following probabilities: 
(a P(X =< 1/18) (b) P(X = 1/4) 
(c) P(X = 5/16) (d) P(X > 1/4) 
(e) P(X = 1/2) 


3-4 MEAN AND VARIANCE OF A DISCRETE RANDOM VARIABLE 


Two numbers are often used to summarize a probability distribution for a random variable X. 
The mean is a measure of the center or middle of the probability distribution, and the variance 
is a measure of the dispersion, or variability in the distribution. These two measures do not 
uniquely identify a probability distribution. That 1s, two different distributions can have the 
same mean and variance. Still, these measures are simple, useful summaries of the probabil- 


ity distribution of X. 


Definition 


The mean or expected value of the discrete random variable X, denoted as u or E(X), is 


w= EX) = > xf) 


(3-3) 


The variance of X, denoted as o? or V(X), is 


"oo = IC hy Ee ces 6) — qu 


X 


The standard deviation of X is o = Vo. 
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(a) (b) 
Figure 3-5 A probability distribution can be viewed as a loading with the mean equal 
to the balance point. Parts (a) and (b) illustrate equal means, but Part (a) illustrates a 
larger variance. 


The mean of a discrete random variable X is a weighted average of the possible values of 
X, with weights equal to the probabilities. If f(x) is the probability mass function of a loading 
on a long, thin beam, E(X) is the point at which the beam balances. Consequently, E(X) 
describes the "center" of the distribution of X in a manner similar to the balance point of a 
loading. See Fig. 3-5. 

The variance of a random variable X is a measure of dispersion or scatter in the possible 
values for X. The variance of X uses weight f(x) as the multiplier of each possible squared 
deviation (x — p}. Figure 3-5 illustrates probability distributions with equal means but dif- 
ferent variances. Properties of summations and the definition of u can be used to show the 
equality of the formulas for variance. 


WX) = (x - uf) = xf) -2w» x0) + Ww f 


= Axfo)-2£j + pw? = Ef)-w 


Either formula for V(x) can be used. Figure 3-6 illustrates that two probability distributions 
can differ even though they have identical means and variances. 


In Example 3-4, there is a chance that a bit transmitted through a digital transmission channel 
is received in error. Let X equal the number of bits in error in the next four bits transmitted. 
The possible values for X are (0, 1, 2, 3, 4}. Based on a model for the errors that is presented 
in the following section, probabilities for these values will be determined. Suppose that the 
probabilities are 


P(X = 0) = 0.6561 P(X = 2) = 0.0486 P(X = 4) = 0.0001 
P(X = 1) = 0.2916 P(X = 3) = 0.0036 




















(a) (b) 
Figure 3-6 The probability distributions illustrated in Parts (a) and (b) differ even 
though they have equal means and equal variances. 
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EXAMPLE 3-10 


NOW 


p = E(X) = 0f(0) + 1/(1) + 2/2) + 3f(3) + 4f(4) 
= 0(0.6561) + 1(0.2916) + 2(0.0486) + 3(0.0036) + 4(0.0001) 
= 04 


Although X never assumes the value 0.4, the weighted average of the possible values is 0.4. 
To calculate V(X), a table is convenient. 


x x — 0.4 oe = gar Po) Mo Qr 
0 —0.4 0.16 0.6561 0.104976 
l 0.6 0.36 0.2916 0.104976 
2 1.6 2.56 0.0486 0.124416 
3 2.6 6.76 0.0036 0.024336 
4 3.6 12.96 0.0001 0.001296 


5 
V(X) = o^ = > f(x)(x, — 0.4)? = 0.36 
i=l 
The alternative formula for variance could also be used to obtain the same result. 


Two new product designs are to be compared on the basis of revenue potential. Marketing 
feels that the revenue from design A can be predicted quite accurately to be $3 million. The 
revenue potential of design B is more difficult to assess. Marketing concludes that there is a 
probability of 0.3 that the revenue from design B will be $7 million, but there is a 0.7 proba- 
bility that the revenue will be only $2 million. Which design do you prefer? 

Let X denote the revenue from design A. Because there is no uncertainty in the revenue 
from design A, we can model the distribution of the random variable X as $3 million with 
probability 1. Therefore, E(X) = $3 million. 

Let Y denote the revenue from design B. The expected value of Y in millions of dollars is 


E(Y) = $7(0.3) + $2(0.7) = $3.5 


Because E(Y) exceeds E(X), we might prefer design B. However, the variability of the result 
from design B is larger. That is, 


g^ e (T7 3503) 0 = 3.5) 10.7) 


— 5.25 millions of dollars squared 


Because the units of the variables in this example are millions of dollars, and because the vari- 
ance of a random variable squares the deviations from the mean, the units of o? are millions 
of dollars squared. These units make interpretation difficult. 

Because the units of standard deviation are the same as the units of the random variable, 
the standard deviation o 1s easier to interpret. In this example, we can summarize our results 
as "the average deviation of Y from its mean 1s $2.29 million." 


EXAMPLE 3-11 


Expected Value of a 
Function of a 
Discrete Random 
Variable 


EXAMPLE 3-12 
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The number of messages sent per hour over a computer network has the following distribution: 


x — number of messages 10 11 12 15 14 15 





f(x) 0.08 0.15 0.30 0.20 0.20 0.07 


Determine the mean and standard deviation of the number of messages sent per hour. 


E(X) = 10(0.08) + 11(0.15) + --- + 15(0.07) = 12.5 
V(X) = 107(0.08) + 117(0.15) + --- + 157(0.07) — 12.5? = 1.85 
o = VV(X) = V 1.85 = 1.36 
The variance of a random variable X can be considered to be the expected value of a specific 


function of X, namely, A(X) = (X — uY. In general, the expected value of any function A(X) 
of a discrete random variable is defined in a similar manner. 


If X is a discrete random variable with probability mass function f(x), 


E[H(X)] = >) xh(x)f(x) 





In Example 3-9, X is the number of bits in error in the next four bits transmitted. What is the 
expected value of the square of the number of bits in error? Now, A(X) = X°. Therefore, 


E{h(X)] = 0? x 0.6561 + 1? x 0.2916 + 2? X 0.0486 
+ 3? x 0.0036 + 4? x 0.0001 = 0.52 


In the previous example, the expected value of X^ does not equal E(X) squared. However, in 
the special case that A(X) = aX + b for any constants a and b, E[h(X)] = aE(X) + b. This 
can be shown from the properties of sums in the definition in Equation 3-4. 


EXERCISES FOR SECTION 3-4 


3-37. If the range of X is the set 10, 1, 2, 3, 4} and P(X = 3-42. Determine the mean and variance of the random vari- 


x) = 0.2 determine the mean and variance of the random variable. able in Exercise 3-20. 
3-38. Determine the mean and variance of the random vari- 3-43. Determine the mean and variance of the random vari- 


able in Exercise 3-13. 


able in Exercise 3-22. 


3-39. Determine the mean and variance of the random vari- 3-44. Determine the mean and variance of the random vari- 


able in Exercise 3-15. 


able in Exercise 3-23. 


3-40. Determine the mean and variance of the random vari- 3-45. The range of the random variable X is [0, 1, 2, 3, x], 


able in Exercise 3-17. 


where x is unknown. If each value is equally likely and the 


3-41. Determine the mean and variance of the random vari- mean of X is 6, determine x. 


able in Exercise 3-19. 
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3-5 DISCRETE UNIFORM DISTRIBUTION 


Definition 


EXAMPLE 3-13 


Figure 3-7 Probability ^" 
.l 


mass function for a 
discrete uniform ran- 
dom variable. 


The simplest discrete random variable 1s one that assumes only a finite number of possible 
values, each with equal probability. A random variable X that assumes each of the values 
X, X2, ... , X,, With equal probability 1/n, is frequently of interest. 


A random variable X has a discrete uniform distribution if each of the n values in 
its range, say, Xj, X», ... , x,, has equal probability. Then, 


pea) = We O) 





The first digit of a part's serial number is equally likely to be any one of the digits 0 through 9. 
If one part 1s selected from a large batch and X is the first digit of the serial number, X has a dis- 
crete uniform distribution with probability 0.1 for each value in R = (0, 1, 2, ..., 9}. That is, 


for each value in R. The probability mass function of X is shown in Fig. 3-7. 


Suppose the range of the discrete random variable X is the consecutive integers a, 
a t l,a t 2,..., b, fora x b. The range of X contains b — a + 1 values each with proba- 
bility 1/(b — a + 1). Now, 


a b(b + 1) — (a — l)a -- 
The algebraic identity > k = J can be used to simplify the result to 
k=a 


u = (b + a)/2, The derivation of the variance is left as an exercise. 


Suppose X is a discrete uniform random variable on the consecutive integers 
adt ld) EE» torus Mhe manoi Ts 


ID ar ie! 


ToO) 


2 


The variance of X is 
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As in Example 3-1, let the random variable X denote the number of the 48 voice lines that are 
in use at a particular time. Assume that X is a discrete uniform random variable with a range 


E(X) = (48 + 0)/2 = 24 


c = f[(48 — 0 + 1% — 1]/12] 7? = 14.14 


Equation 3-6 is more useful than it might first appear. If all the values in the range of a 
random variable X are multiplied by a constant (without changing any probabilities), the mean 
and standard deviation of X are multiplied by the constant. You are asked to verify this result 
in an exercise. Because the variance of a random variable is the square of the standard devia- 
tion, the variance of X 1s multiplied by the constant squared. More general results of this type 


EXAMPLE 3-14 

of 0 to 48. Then, 

and 

are discussed in Chapter 5. 
EXAMPLE 3-15 


Let the random variable Y denote the proportion of the 48 voice lines that are in use at a par- 
ticular time, and X denotes the number of lines that are in use at a particular time. Then, 


Y = X/48. Therefore, 


E(Y) = E(X)/48 = 0.5 


and 


EXERCISES FOR SECTION 3-5 


3-46. Let the random variable X have a discrete uniform 
distribution on the integers 0 = x = 100. Determine the mean 
and variance of X. 


3-47. Let the random variable X have a discrete uniform 
distribution on the integers 1 S x < 3. Determine the mean 
and variance of X. 


3-48. Letthe random variable X be equally likely to assume 
any of the values 1/8, 1/4, or 3/8. Determine the mean and 
variance of X. 


3-49. Thickness measurements of a coating process are 
made to the nearest hundredth of a millimeter. The thickness 
measurements are uniformly distributed with values 0.15, 
0.16, 0.17, 0.18, and 0.19. Determine the mean and variance 
of the coating thickness for this process. 


3-50. Product codes of 2, 3, or 4 letters are equally likely. 
What is the mean and standard deviation of the number of 
letters in 100 codes? 

3-51. The lengths of plate glass parts are measured to the 
nearest tenth of a millimeter. The lengths are uniformly dis- 
tributed, with values at every tenth of a millimeter starting at 


V(Y) = V(X)/48? = 0.087 


590.0 and continuing through 590.9. Determine the mean and 
variance of lengths. 


3-52. Suppose that_X has a discrete uniform distribution on 
the integers 0 through 9. Determine the mean, variance, and 
standard deviation of the random variable Y = 5X and com- 
pare to the corresponding results for X. 


3-53. Show that for a discrete uniform random variable X, 
if each of the values in the range of X is multiplied by the 
constant c, the effect 1s to multiply the mean of X by c and 
the variance of X by c^. That is, show that E(cX) = cE(X) 
and V(cX) = c?V(X). 

3-54. The probability of an operator entering alphanu- 
meric data incorrectly into a field in a database is equally 
likely. The random variable X is the number of fields on a 
data entry form with an error. The data entry form has 
28 fields. Is X a discrete uniform random variable? Why or 
why not. 
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3-6 BINOMIAL DISTRIBUTION 


Consider the following random experiments and random variables: 


1. Flipacoin 10 times. Let X = number of heads obtained. 


2. Aworn machine tool produces 1% defective parts. Let X = number of defective parts 
in the next 25 parts produced. 


3. Each sample of air has a 1096 chance of containing a particular rare molecule. Let 
X — the number of air samples that contain the rare molecule in the next 18 samples 
analyzed. 


4. Of all bits transmitted through a digital transmission channel, 10% are received in 
error. Let X — the number of bits in error in the next five bits transmitted. 


5. Amultiple choice test contains 10 questions, each with four choices, and you guess 
at each question. Let X — the number of questions answered correctly. 


In the next 20 births at a hospital, let X — the number of female births. 


Of all patients suffering a particular illness, 35% experience improvement from a 
particular medication. In the next 100 patients administered the medication, let X — 
the number of patients who experience improvement. 


These examples illustrate that a general probability model that includes these experiments as 
particular cases would be very useful. 

Each of these random experiments can be thought of as consisting of a series of repeated, 
random trials: 10 flips of the coin in experiment 1, the production of 25 parts in experiment 2, 
and so forth. The random variable in each case is a count of the number of trials that meet a 
specified criterion. The outcome from each trial either meets the criterion that X counts or it 
does not; consequently, each trial can be summarized as resulting in either a success or a fail- 
ure. For example, in the multiple choice experiment, for each question, only the choice that is 
correct is considered a success. Choosing any one of the three incorrect choices results in the 
trial being summarized as a failure. 

The terms success and failure are just labels. We can just as well use A and B or 0 or 1. 
Unfortunately, the usual labels can sometimes be misleading. In experiment 2, because X 
counts defective parts, the production of a defective part is called a success. 

A trial with only two possible outcomes is used so frequently as a building block of a 
random experiment that it is called a Bernoulli trial. It is usually assumed that the trials that 
constitute the random experiment are independent. This implies that the outcome from one 
trial has no effect on the outcome to be obtained from any other trial. Furthermore, it is 
often reasonable to assume that the probability of a success in each trial is constant. In 
the multiple choice experiment, if the test taker has no knowledge of the material and just 
guesses at each question, we might assume that the probability of a correct answer is 1/4 
for each question. 

Factorial notation is used in this section. Recall that n! denotes the product of the integers 
less than or equal to n: 


n! = n(n — 1)(n — 2) --- (2X1) 


For example, 
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and by definition 0! = 1. We also use the combinatorial notation 


For example, 





(2) 5! 120 
2 213! 2-6 


See Section 2-1.4, CD material for Chapter 2, for further comments. 


EXAMPLE 3-16 The chance that a bit transmitted through a digital transmission channel is received in error is 
0.1. Also, assume that the transmission trials are independent. Let X — the number of bits in 
error in the next four bits transmitted. Determine P(X — 2). 

Let the letter E denote a bit in error, and let the letter O denote that the bit is okay, that is, 
received without error. We can represent the outcomes of this experiment as a list of four let- 
ters that indicate the bits that are in error and those that are okay. For example, the outcome 
OEOE indicates that the second and fourth bits are in error and the other two bits are okay. The 
corresponding values for x are 


Outcome X Outcome 27 
OOOO 0 EOOO 1 
OOOE 1 EOOE 2 
OOEO 1 EOEO 2 
OOEE 2 EOEE 3 
OEOO 1 EEOO 2 
OEOE 2 EEOE 3 
OEEO 2 EEEO 3 
OEEE 3 EEEE 4 


The event that X = 2 consists of the six outcomes: 
{EEOO, EOEO, EOOE, OEEO, OEOE, OOEE} 
Using the assumption that the trials are independent, the probability of {EEOO} is 
P(EEOO) = P(E)P(E)P(O)P(O) = (0.1)(0.9? = 0.0081 


Also, any one of the six mutually exclusive outcomes for which X — 2 has the same proba- 
bility of occurring. Therefore, 


P(X = 2) = 6(0.0081) = 0.0486 
In general, 


P(X = x) = (number of outcomes that result in x errors) times (0.1)(0.9)* * 


14 
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Definition 





To complete a general probability formula, only an expression for the number of outcomes 
that contain x errors is needed. An outcome that contains x errors can be constructed by parti- 
tioning the four trials (letters) in the outcome into two groups. One group is of size x and 
contains the errors, and the other group is of size n — x and consists of the trials that are okay. 
The number of ways of partitioning four objects into two groups, one of which is of size x, is 


4 | 
( = NE M Therefore, in this example 
x!(4 — x)! 


P(X =x) = (“Jono 


4 
Notice that (2) = 41/|2! 2!] = 6, as found above. The probability mass function of X 


was shown in Example 3-4 and Fig. 3-1. 


The previous example motivates the following result. 


A random experiment consists of n Bernoulli trials such that 


(1) The trials are independent 


(2) Each trial results in only two possible outcomes, labeled as *success" and 
"failure" 


(3) The probability of a success in each trial, denoted as p, remains constant 


The random variable X that equals the number of trials that result in a success 
has a binomial random variable with parameters 0 < p < 1 andn = 1, 2, .... The 
probability mass function of X 1s 


(3-7) 


As in Example 3-16, (") equals the total number of different sequences of trials that 
contain x successes and n — x failures. The total number of different sequences that contain x 
successes and n — x failures times the probability of each sequence equals P(X = x). 

The probability expression above is a very useful formula that can be applied in a num- 
ber of examples. The name of the distribution is obtained from the binomial expansion. For 
constants a and b, the binomial expansion is 


(a + b) = 2 a*p"* 


k=0 


Let p denote the probability of success on a single trial. Then, by using the binomial 
expansion with a = p and b = 1 — p, we see that the sum of the probabilities for a bino- 
mial random variable is 1. Furthermore, because each trial in the experiment is classified 
into two outcomes, (success, failure}, the distribution is called a “bi’’-nomial. A more 
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Figure 3-8 Binomial distributions for selected values of n and p. 


EXAMPLE 3-17 


EXAMPLE 3-18 


general distribution, which includes the binomial as a special case, is the multinomial 
distribution. 

Examples of binomial distributions are shown in Fig. 3-8. For a fixed n, the distribution 
becomes more symmetric as p increases from 0 to 0.5 or decreases from 1 to 0.5. For a fixed 
p, the distribution becomes more symmetric as n increases. 


Several examples using the binomial coefficient (") follow. 
x 


i = 101/[3! 7!] = (10: 9 - 8)/(3 - 2) = 120 


15 
(is) = 15!/[10! 51] = (15+ 14+ 13 12+ 11)/(5 + 4 + 3 + 2) = 3003 


ow 
4 
Each sample of water has a 10% chance of containing a particular organic pollutant. Assume 
that the samples are independent with regard to the presence of the pollutant. Find the proba- 
bility that in the next 18 samples, exactly 2 contain the pollutant. 

Let X = the number of samples that contain the pollutant in the next 18 samples analyzed. 


Then X is a binomial random variable with p = 0.1 andn = 18. 
Therefore, 


= 100!/[4! 96!] = (100 - 99 - 98 - 97)/(4 - 3 + 2) = 3,921,225 
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18 
Now( >) = 18!/[2! 16!| = 18(17)/2 = 153. Therefore, 


P(X = 2) = 153(0.1)7(0.9)'° = 0.284 


Determine the probability that at least four samples contain the pollutant. The requested 
probability is 


P(X = 4) = : (3150957 


However, it is easier to use the complementary event, 


P(X =4)=1-PX<4)=1- Y (ortos 


= 1 — [0.150 + 0.300 + 0.284 + 0.168] = 0.098 


Determine the probability that 3 = X < 7. Now 


P3=X<7) Y (S oaro 


x=3 
= 0.168 + 0.070 + 0.022 + 0.005 
= 0.265 


The mean and variance of a binomial random variable depend only on the parameters p 
and n. Formulas can be developed from moment generating functions, and details are pro- 
vided in Section 5-8, part of the CD material for Chapter 5. The results are simply stated here. 


Definition 
If X is a binomial random variable with parameters p and n, 


n B00) = we HX = al = e) 





EXAMPLE 3-19 For the number of transmitted bits received in error in Example 3-16, n = 4 and p = 0.1, so 
E(X) = 4(0.1) = 0.4 and V(X) = 4(0.1)(0.9) = 0.36 


and these results match those obtained from a direct calculation in Example 3-9. 


EXERCISES FOR SECTION 3-6 


3-55. For each scenario described below, state whether or transducers in a sample of size 30 selected at random from 
not the binomial distribution is a reasonable model for the ran- the process. 

dom variable and why. State any assumptions you make. (b) From a batch of 50 temperature transducers, a sample of 
(a) A production process produces thousands of temperature size 30 is selected without replacement. Let X denote the 


transducers. Let X denote the number of nonconforming number of nonconforming transducers in the sample. 


(c) Four identical electronic components are wired to a con- 
troller that can switch from a failed component to one of 
the remaining spares. Let X denote the number of compo- 
nents that have failed after a specified period of operation. 

(d) Let X denote the number of accidents that occur along the 
federal highways in Arizona during a one-month period. 

(e) Let X denote the number of correct answers by a student 
taking a multiple choice exam in which a student can elim- 
inate some of the choices as being incorrect in some ques- 
tions and all of the incorrect choices in other questions. 

(f) Defects occur randomly over the surface of a semiconduc- 
tor chip. However, only 80% of defects can be found by 
testing. A sample of 40 chips with one defect each is 
tested. Let X denote the number of chips in which the test 
finds a defect. 

(g) Reconsider the situation in part (f). Now, suppose the sam- 
ple of 40 chips consists of chips with 1 and with 0 defects. 

(h) A filling operation attempts to fill detergent packages to 
the advertised weight. Let X denote the number of deter- 
gent packages that are underfilled. 

(1) Errors in a digital communication channel occur in bursts 
that affect several consecutive bits. Let X denote the num- 
ber of bits in error in a transmission of 100,000 bits. 

(j) Let X denote the number of surface flaws in a large coil of 
galvanized steel. 


3-56. The random variable X has a binomial distribution with 
n = 10 and p = 0.5. Sketch the probability mass function of X. 
(a) What value of X is most likely? 

(b) What value(s) of X is(are) least likely? 


3-57. The random variable X has a binomial distribution with 
n = 10 and p = 0.5. Determine the following probabilities: 


(a P(X-5) (b) P(X S 2) 
(c) P(Xz9) (d P3zX«5) 
3-58. Sketch the probability mass function of a binomial 


distribution with n — 10 and p — 0.01 and comment on the 
shape of the distribution. 

(a) What value of X is most likely? 

(b) What value of X is least likely? 


3-59, The random variable X has a binomial distribution with 
n = 10 and p = 0.01. Determine the following probabilities. 
(a P(X-25) (b) P(X S 2) 

(c) P(Xz9) (d)P(3zX«5) 

3-60. Determine the cumulative distribution function of a 
binomial random variable with n — 3 and p — 1/2. 


3-61. Determine the cumulative distribution function of a 
binomial random variable with n — 3 and p — 1/4. 


3-62. An electronic product contains 40 integrated circuits. 
The probability that any integrated circuit is defective is 0.01, 
and the integrated circuits are independent. The product oper- 
ates only if there are no defective integrated circuits. What is 
the probability that the product operates? 


3-63. Let X denote the number of bits received in error in a 
digital communication channel, and assume that X is a bino- 
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mial random variable with p — 0.001. If 1000 bits are trans- 

mitted, determine the following: 

(a P(X-1) (b) P(Xz 1) 

(c) P(X - 2) (d) mean and variance of X 

3-64. The phone lines to an airline reservation system are 

occupied 40% of the time. Assume that the events that the lines 

are occupied on successive calls are independent. Assume that 

10 calls are placed to the airline. 

(a) What is the probability that for exactly three calls the lines 
are occupied? 

(b) What is the probability that for at least one call the lines 
are not occupied? 

(c) What is the expected number of calls in which the lines 
are all occupied? 


3-65. Batches that consist of 50 coil springs from a production 

process are checked for conformance to customer requirements. 

The mean number of nonconforming coil springs in a batch is 5. 

Assume that the number of nonconforming springs in a batch, 

denoted as X, 1s a binomial random variable. 

(a) What are n and p? 

(b) What is P(X = 2)? 

(c) What is P(X = 49)? 

3-66. Astatistical process control chart example. Samples 

of 20 parts from a metal punching process are selected every 

hour. Typically, 1% of the parts require rework. Let X denote 

the number of parts in the sample of 20 that require rework. A 

process problem is suspected if X exceeds its mean by more 

than three standard deviations. 

(a) If the percentage of parts that require rework remains at 
1%, what is the probability that X exceeds its mean by 
more than three standard deviations? 

(b) If the rework percentage increases to 4%, what is the 
probability that X exceeds 1? 

(c) If the rework percentage increases to 4%, what is the 
probability that X exceeds 1 in at least one of the next five 
hours of samples? 


3-67. Because not all airline passengers show up for their 

reserved seat, an airline sells 125 tickets for a flight that holds 

only 120 passengers. The probability that a passenger does not 

show up is 0.10, and the passengers behave independently. 

(a) What is the probability that every passenger who shows 
up can take the flight? 

(b) What is the probability that the flight departs with empty 
seats? 


3-68. This exercise illustrates that poor quality can affect 
schedules and costs. A manufacturing process has 100 cus- 
tomer orders to fill. Each order requires one component part 
that is purchased from a supplier. However, typically, 2% of 
the components are identified as defective, and the compo- 
nents can be assumed to be independent. 
(a) If the manufacturer stocks 100 components, what is the 
probability that the 100 orders can be filled without 
reordering components? 
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(b) If the manufacturer stocks 102 components, what is the 
probability that the 100 orders can be filled without 
reordering components? 

(c) If the manufacturer stocks 105 components, what is the 
probability that the 100 orders can be filled without 
reordering components? 


3-69. A multiple choice test contains 25 questions, each 

with four answers. Assume a student just guesses on each 

question. 

(a) What is the probability that the student answers more than 
20 questions correctly? 


(b) What is the probability the student answers less than 5 
questions correctly? 


3-70. A particularly long traffic light on your morning com- 

mute is green 20% of the time that you approach it. Assume 

that each morning represents an independent trial. 

(a) Over five mornings, what is the probability that the light 1s 
green on exactly one day? 

(b) Over 20 mornings, what is the probability that the light is 
green on exactly four days? 

(c) Over 20 mornings, what is the probability that the light is 
green on more than four days? 
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3-7.1 Geometric Distribution 


Consider a random experiment that is closely related to the one used in the definition of a 
binomial distribution. Again, assume a series of Bernoulli trials (independent trials with con- 
stant probability p of a success on each trial). However, instead of a fixed number of trials, 
trials are conducted until a success is obtained. Let the random variable X denote the number 
of trials until the first success. In Example 3-5, successive wafers are analyzed until a large 
particle is detected. Then, X is the number of wafers analyzed. In the transmission of bits, X 
might be the number of bits transmitted until an error occurs. 


EXAMPLE 3-20 


The probability that a bit transmitted through a digital transmission channel is received in 


error is 0.1. Assume the transmissions are independent events, and let the random variable X 
denote the number of bits transmitted until the first error. 

Then, P(X = 5) is the probability that the first four bits are transmitted correctly and the 
fifth bit is in error. This event can be denoted as {OOOOE}, where O denotes an okay bit. 
Because the trials are independent and the probability of a correct transmission is 0.9, 


P(X = 5) = P(OOOOE) = 0.9*0.1 = 0.066 


Note that there is some probability that X will equal any integer value. Also, if the first trial is 
a success, X = 1. Therefore, the range of X is {1, 2, 3, ... }, that is, all positive integers. 


Definition 


In a series of Bernoulli trials (independent trials with constant probability p of a suc- 
cess), let the random variable X denote the number of trials until the first success. 
Then X is a geometric random variable with parameter 0 « p < 1 and 


f(x) = (1 — py !p 





I= ELEM (3-9) 


Examples of the probability mass functions for geometric random variables are shown in 
Fig. 3-9. Note that the height of the line at x 1s (1 — p) times the height of the line at x — 1. 
That is, the probabilities decrease in a geometric progression. The distribution acquires its 


name from this result. 


Figure 3-9 Geometric 
distributions for 
selected values of the 
parameter p. 


EXAMPLE 3-21 


EXAMPLE 3-22 
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The probability that a wafer contains a large particle of contamination is 0.01. If it is assumed 
that the wafers are independent, what 1s the probability that exactly 125 wafers need to be 
analyzed before a large particle is detected? 

Let X denote the number of samples analyzed until a large particle 1s detected. Then X is 
a geometric random variable with p = 0.01. The requested probability is 


P(X = 125) = (0.99)*0.01 = 0.0029 


The derivation of the mean and variance of a geometric random variable is left as an exercise. 
Note that $7, k(1 — p)* !p can be shown to equal 1/p. The results are as follows. 


If X is a geometric random variable with parameter p, 


je pO = IS 





Consider the transmission of bits in Example 3-20. Here, p — 0.1. The mean number of 
transmissions until the first error is 1/0.1 — 10. The standard deviation of the number 
of transmissions before the first error 1s 


o = [(1 — 0.1)/0.17]'” = 9.49 
Lack of Memory Property 


A geometric random variable has been defined as the number of trials until the first success. 
However, because the trials are independent, the count of the number of trials until the next 
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EXAMPLE 3-23 


success can be started at any trial without changing the probability distribution of the random 
variable. For example, in the transmission of bits, if 100 bits are transmitted, the probability 
that the first error, after bit 100, occurs on bit 106 is the probability that the next six outcomes 
are OOOOOE. This probability is (0.9)(0.1) = 0.059, which is identical to the probability 
that the initial error occurs on bit 6. 

The implication of using a geometric model is that the system presumably will not wear 
out. The probability of an error remains constant for all transmissions. In this sense, the geo- 
metric distribution is said to lack any memory. The lack of memory property will be dis- 
cussed again in the context of an exponential random variable in Chapter 4. 


In Example 3-20, the probability that a bit is transmitted in error is equal to 0.1. Suppose 
50 bits have been transmitted. The mean number of bits until the next error is 1/0.1 = 10— 
the same result as the mean number of bits until the first error. 


3-7.2 Negative Binomial Distribution 


EXAMPLE 3-24 


Definition 


A generalization of a geometric distribution in which the random variable is the number of 
Bernoulli trials required to obtain r successes results in the negative binomial distribution. 


As in Example 3-20, suppose the probability that a bit transmitted through a digital transmis- 
sion channel is received in error is 0.1. Assume the transmissions are independent events, and 
let the random variable X denote the number of bits transmitted until the fourth error. 

Then, X has a negative binomial distribution with r = 4. Probabilities involving X can be 
found as follows. The P(X — 10) 1s the probability that exactly three errors occur in the first 
nine trials and then trial 10 results in the fourth error. The probability that exactly three errors 
occur in the first nine trials is determined from the binomial distribution to be 


(3) (0.1)°(0.9)° 


Because the trials are independent, the probability that exactly three errors occur in the first 
9 trials and trial 10 results in the fourth error is the product of the probabilities of these two 
events, namely, 


(3) (0.1) (0.9)*(0.1) = b 0:109) 


The previous result can be generalized as follows. 


In a series of Bernoulli trials (independent trials with constant probability p of a suc- 
cess), let the random variable X denote the number of trials until r successes occur. 
Then X is a negative binomial random variable with parameters 0 « p « 1 and 
r= 1l,23,..., and 


x cec (3-11) 





Figure 3-10 Negative 
binomial distributions 
for selected values of the 
parameters r and p. 


Figure 3-11 Negative 
binomial random 
variable represented as 
a sum of geometric 
random variables. 
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Because at least r trials are required to obtain r successes, the range of X is from r to oc. In the 
special case that r = 1, a negative binomial random variable is a geometric random variable. 
Selected negative binomial distributions are illustrated in Fig. 3-10. 

The lack of memory property of a geometric random variable implies the following. Let 
X denote the total number of trials required to obtain r successes. Let X, denote the number of 
trials required to obtain the first success, let X, denote the number of extra trials required to 
obtain the second success, let X4 denote the number of extra trials to obtain the third success, 
and so forth. Then, the total number of trials required to obtain r successes is 
X = X, + X, + + X. Because of the lack of memory property, each of the random vari- 
ables Xj, X5, ..., X, has a geometric distribution with the same value of p. Consequently, a 
negative binomial random variable can be interpreted as the sum of r geometric random vari- 
ables. This concept is illustrated in Fig. 3-11. 

Recall that a binomial random variable is a count of the number of successes in n 
Bernoulli trials. That is, the number of trials is predetermined, and the number of successes is 
random. A negative binomial random variable is a count of the number of trials required to 


X= Xj + Xo + X3 


Xi X» X3 
f MM FN T M — — gn 
* * * 
1 2 3 4 5 6 7 8 9 10 11 12 
Trials 


* indicates a trial that results in a "success". 
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obtain r successes. That is, the number of successes is predetermined, and the number of trials 
is random. In this sense, a negative binomial random variable can be considered the opposite, 
or negative, of a binomial random variable. 

The description of a negative binomial random variable as a sum of geometric random 
variables leads to the following results for the mean and variance. Sums of random variables 
are studied in Chapter 5. 


If X is a negative binomial random variable with parameters p and r, 


p — E(X) = r/p ZO m o qo (3-12) 





EXAMPLE 3-25 A Web site contains three identical computer servers. Only one is used to operate the site, and 
the other two are spares that can be activated in case the primary system fails. The probability 
of a failure in the primary computer (or any activated spare system) from a request for service 
is 0.0005. Assuming that each request represents an independent trial, what is the mean num- 
ber of requests until failure of all three servers? 

Let X denote the number of requests until all three servers fail, and let Xj, X5, and X; 
denote the number of requests before a failure of the first, second, and third servers used, 
respectively. Now, X = X, + X; + X. Also, the requests are assumed to comprise independ- 
ent trials with constant probability of failure p — 0.0005. Furthermore, a spare server is not 
affected by the number of requests before it 1s activated. Therefore, X has a negative binomial 
distribution with p = 0.0005 and r = 3. Consequently, 


E(X) = 3/0.0005 = 6000 requests 


What is the probability that all three servers fail within five requests? The probability is 
P(X = 5) and 


P(X = 5) = P(X = 3) + P(X = 4) + P(X = 5) 
3 4 
= 0.0005? + (3) 0.0005°(0.9995) + (2) 0.0005?(0.9995)* 


= 125 X 10 '° + 375 x 10? + 749 x 107" 


= 1.249 x 10? 
EXERCISES FOR SECTION 3-7 
3-71. Suppose the random variable X has a geometric 3-72. Suppose the random variable X has a geometric 
distribution with p = 0.5. Determine the following proba- distribution with a mean of 2.5. Determine the following 
bilities: probabilities: 
(a) P(X — 1) (b) PX = 4) (a) P(X — 1) (b) PX = 4) 
(c) P(X —- 8) (d) P(X S 2) (c) P(X—- 5) (d) P(Xx3) 


(e) P(X > 2) (e) P(X > 3) 
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3-73. The probability of a successful optical alignment in 

the assembly of an optical data storage product is 0.8. Assume 

the trials are independent. 

(a) What is the probability that the first successful alignment 
requires exactly four trials? 

(b) What is the probability that the first successful alignment 
requires at most four trials? 

(c) What is the probability that the first successful alignment 
requires at least four trials? 


3-74. In a clinical study, volunteers are tested for a gene 

that has been found to increase the risk for a disease. The 

probability that a person carries the gene is 0.1. 

(a) What is the probability 4 or more people will have to be 
tested before 2 with the gene are detected? 

(b) How many people are expected to be tested before 2 with 
the gene are detected? 


3-75. Assume that each of your calls to a popular radio station 

has a probability of 0.02 of connecting, that is, of not obtaining a 

busy signal. Assume that your calls are independent. 

(a) What is the probability that your first call that connects is 
your tenth call? 

(b) What is the probability that it requires more than five calls 
for you to connect? 

(c) What is the mean number of calls needed to connect? 


3-76. In Exercise 3-70, recall that a particularly long traffic 

light on your morning commute is green 20% of the time that 

you approach it. Assume that each morning represents an 

independent trial. 

(a) What is the probability that the first morning that the light 
is green is the fourth morning that you approach it? 

(b) What is the probability that the light is not green for 10 
consecutive mornings? 


3-77. A trading company has eight computers that it uses to 

trade on the New York Stock Exchange (NYSE). The proba- 

bility of a computer failing in a day is 0.005, and the comput- 

ers fail independently. Computers are repaired in the evening 

and each day is an independent trial. 

(a) What is the probability that all eight computers fail in a 
day? 

(b) What is the mean number of days until a specific com- 
puter fails? 

(c) What is the mean number of days until all eight computers 
fail in the same day? 


3-78. In Exercise 3-66, recall that 20 parts are checked each 

hour and that X denotes the number of parts in the sample of 

20 that require rework. 

(a) If the percentage of parts that require rework remains at 
1%, what is the probability that hour 10 is the first sample 
at which X exceeds 1? 

(b) If the rework percentage increases to 4%, what is the 
probability that hour 10 is the first sample at which X 
exceeds 1? 


(c) If the rework percentage increases to 496, what is the 
expected number of hours until X exceeds 1? 


3-79. Consider a sequence of independent Bernoulli trials 

with p — 0.2. 

(a) What 1s the expected number of trials to obtain the first 
success? 

(b) After the eighth success occurs, what is the expected num- 
ber of trials to obtain the ninth success? 


3-80. Show that the probability density function of a nega- 
tive binomial random variable equals the probability density 
function of a geometric random variable when r — 1. Show 
that the formulas for the mean and variance of a negative bi- 
nomial random variable equal the corresponding results for 
geometric random variable when r — 1. 


3-81. Suppose that X is a negative binomial random variable 
with p = 0.2 andr = 4. Determine the following: 

(a) E(X) (b) P(X — 20) 

(c) P(X = 19) (d) P(X = 21) 

(e) The most likely value for X 

3-82. The probability is 0.6 that a calibration of a transducer 
in an electronic instrument conforms to specifications for the 
measurement system. Assume the calibration attempts are 
independent. What is the probability that at most three 
calibration attempts are required to meet the specifications for 
the measurement system? 


3-83. An electronic scale in an automated filling operation 

stops the manufacturing line after three underweight packages 

are detected. Suppose that the probability of an underweight 

package is 0.001 and each fill is independent. 

(a) What is the mean number of fills before the line is 
stopped? 

(b) What is the standard deviation of the number of fills 
before the line is stopped? 


3-84. A fault-tolerant system that processes transactions for 
a financial services firm uses three separate computers. If the 
operating computer fails, one of the two spares can be imme- 
diately switched online. After the second computer fails, the 
last computer can be immediately switched online. Assume 
that the probability of a failure during any transaction is 1078 
and that the transactions can be considered to be independent 
events. 
(a) What is the mean number of transactions before all com- 
puters have failed? 
(b) What is the variance of the number of transactions before 
all computers have failed? 
3-85. Derive the expressions for the mean and variance of a 
geometric random variable with parameter p. (Formulas for 
infinite series are required.) 
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3-8 HYPERGEOMETRIC DISTRIBUTION 


In Example 3-8, a day’s production of 850 manufactured parts contains 50 parts that do not 
conform to customer requirements. Two parts are selected at random, without replacement 
from the day’s production. That is, selected units are not replaced before the next selection is 
made. Let A and B denote the events that the first and second parts are nonconforming, re- 
spectively. In Chapter 2, we found P(B|A) = 49/849 and P(A) = 50/850. Consequently, 
knowledge that the first part is nonconforming suggests that it 1s less likely that the second 
part selected is nonconforming. 

This experiment is fundamentally different from the examples based on the binomial dis- 
tribution. In this experiment, the trials are not independent. Note that, in the unusual case that 
each unit selected is replaced before the next selection, the trials are independent and there is 
a constant probability of a nonconforming part on each trial. Then, the number of noncon- 
forming parts in the sample is a binomial random variable. 

Let X equal the number of nonconforming parts in the sample. Then 


P(X = 0) = P(both parts conform) = (800/850)(799/849) = 0.886 


P(X = 1) = P(first part selected conforms and the second part selected 
does not, or the first part selected does not and the second part 
selected conforms) 


= (800/850)(50/849) + (50/850)(800/849) = 0.111 
P(X = 2) = P(both parts do not conform) = (50/850)(49/849) = 0.003 


As in this example, samples are often selected without replacement. Although probabili- 
ties can be determined by the reasoning used in the example above, a general formula for 
computing probabilities when samples are selected without replacement is quite useful. The 
counting rules presented in Section 2-1.4, part of the CD material for Chapter 2, can be used 
to justify the formula given below. 


Definition 

A set of N objects contains 

K objects classified as successes 

N — K objects classified as failures 
A sample of size n objects is selected randomly (without replacement) from the NV 
objects, where K = N and n SN. 

Let the random variable X denote the number of successes in the sample. Then 
X 1s a hypergeometric random variable and 


x = max(0, n + K — N} to min{K, n} (3-13) 





The expression min(K, n} is used in the definition of the range of X because the maximum 
number of successes that can occur in the sample is the smaller of the sample size, 7, 


EXAMPLE 3-26 


EXAMPLE 3-27 
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Figure 3-12 Hypergeometric distributions for 
selected values of parameters N, K, and n. 


and the number of successes available, K. Also, if n + K > N, at least n + K — N suc- 
cesses must occur in the sample. Selected hypergeometric distributions are illustrated in 
Fig. 3-12. 


The example at the start of this section can be reanalyzed by using the general expression in 
the definition of a hypergeometric random variable. That is, 


(C 
0 2 
_ 319600 . 46 








P(X = 0) = = 
(50) 360825 
2 
ry) 
1 1 
Melhs- cu id, = 0.111 
(50) 360825 
2 
eee 
2 0 
e ERE Lnd ds 


"m = 360825 
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A batch of parts contains 100 parts from a local supplier of tubing and 200 parts from a sup- 
plier of tubing in the next state. If four parts are selected randomly and without replacement, 
what is the probability they are all from the local supplier? 
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EXAMPLE 3-28 


Let X equal the number of parts in the sample from the local supplier. Then, X has a 
hypergeometric distribution and the requested probability is P(X — 4). Consequently, 


P(X = 4) = Ke) as 


S 


What is the probability that two or more parts in the sample are from the local supplier? 
ones P) P) 
2 2 3 1 4 0 
" oU oU 
4 4 4 


= 0.298 + 0.098 + 0.0119 = 0.408 
What is the probability that at least one part in the sample is from the local supplier? 


P) 
0 4 
P(Xz1)—51—P(X0)-91—-——— ———- = 0.804 
(0) 
4 
The mean and variance of a hypergeometric random variable can be determined from 
the trials that comprise the experiment. However, the trials are not independent, and so the 


calculations are more difficult than for a binomial distribution. The results are stated as 
follows. 





P(X = 2) = 


If X is a hypergeometric random variable with parameters N, K, and n, then 


p — E(X) = np 


where p — K/N. 





Here p is interpreted as the proportion of successes in the set of N objects. 


In the previous example, the sample size is 4. The random variable X is the number of parts in 
the sample from the local supplier. Then, p — 100/300 — 1/3. Therefore, 


E(X) — 4(100/300) — 1.33 


Finite 
Population 
Correction 

Factor 


EXAMPLE 3-29 


Figure 3-13 
Comparison of hyper- 
geometric and binomial 
distributions. 
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and 
V(X) = 4(1/3)(2/3)[(300 — 4)/299] = 0.88 
For a hypergeometric random variable, E(X) is similar to the mean a binomial random 


variable. Also, V(X) differs from the result for a binomial random variable only by the term 
shown below. 


The term in the variance of a hypergeometric random variable 


N= 7 
dy em 


is called the finite population correction factor. 





sampling with replacement is equivalent to sampling from an infinite set because the propor- 
tion of success remains constant for every trial in the experiment. As mentioned previously, if 
sampling were done with replacement, X would be a binomial random variable and its vari- 
ance would be np(1 — p). Consequently, the finite population correction represents the cor- 
rection to the binomial variance that results because the sampling 1s without replacement from 
the finite set of size N. 

If n is small relative to N, the correction is small and the hypergeometric distribution is sim- 
ilar to the binomial. In this case, a binomial distribution can effectively approximate the distribu- 
tion of the number of units of a specified type in the sample. A case is illustrated in Fig. 3-13. 


A listing of customer accounts at a large corporation contains 1000 customers. Of these, 700 
have purchased at least one of the corporation's products in the last three months. To evaluate 
a new product design, 50 customers are sampled at random from the corporate listing. What is 


0.3 t 3 
0.2 
(x) + + 
0.1 
+ t 
0.0 
0 I 2 3 4 5 


e Hypergeometric N = 50, n = 5, K= 25 
+ Binomial n = 5, p = 0.5 





|o o HM. [2 B a J5 
Hypergeometric probability | 0.025 0.149 0.326 0.326 0.149 0.025 
Binomial probability 0.031 0.156 0.321 0.312 0.156 0.031 
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the probability that more than 45 of the sampled customers have purchased from the corpora- 
tion in the last three months? 

The sampling is without replacement. However, because the sample size of 50 is small 
relative to the number of customer accounts, 1000, the probability of selecting a customer who 
has purchased from the corporation in the last three months remains approximately constant 
as the customers are chosen. 

For example, let 4 denote the event that the first customer selected has purchased 
from the corporation in the last three months, and let B denote the event that the second 
customer selected has purchased from the corporation in the last three months. Then, 
P(A) = 700/1000 = 0.7 and P(B|A) = 699/999 = 0.6997. That is, the trials are approxi- 
mately independent. 

Let X denote the number of customers in the sample who have purchased from the cor- 
poration in the last three months. Then, X is a hypergeometric random variable with N — 
1000, n = 50, and K = 700. Consequently, p = K/N = 0.7. The requested probability is 
P(X > 45). Because the sample size is small relative to the batch size, the distribution of X 
can be approximated as binomial with n = 50 and p = 0.7. Using the binomial approximation 
to the distribution of X results in 


50 

50 

P(X»45)- > Jora — 0.7y*^* = 0.00017 
x-46 NX 


The probability from the hypergeometric distribution 1s 0.000166, but this requires computer 


software. The result agrees well with the binomial approximation. 


EXERCISES FOR SECTION 3-8 


3-86. Suppose X has a hypergeometric distribution with 
N = 100, n = 4, and K = 20. Determine the following: 

(a P(X-1) (b) P(X = 6) 

(c) P(X = 4) (d) Determine the mean and variance of X. 
3-87. Suppose X has a hypergeometric distribution with 
N = 20, n = 4, and K = 4. Determine the following: 

(a) (X 2 1) (b) P(X — 4) 

(c) P(X - 2) (d) Determine the mean and variance of X. 
3-88. Suppose X has a hypergeometric distribution with 
N = 10,n = 3, and K = 4. Sketch the probability mass func- 
tion of X. 


3-89. Determine the cumulative distribution function for X 
in Exercise 3-88. 


3-90. Alot of 75 washers contains 5 in which the variability 

in thickness around the circumference of the washer 1s unac- 

ceptable. A sample of 10 washers is selected at random, 

without replacement. 

(a) What is the probability that none of the unacceptable 
washers is in the sample? 

(b) What is the probability that at least one unacceptable 
washer is in the sample? 

(c) What is the probability that exactly one unacceptable 
washer is in the sample? 


(d) What is the mean number of unacceptable washers in the 
sample? 

3-91. A company employs 800 men under the age of 55. 

Suppose that 30% carry a marker on the male chromosome 

that indicates an increased risk for high blood pressure. 

(a) If 10 men in the company are tested for the marker in this 
chromosome, what is the probability that exactly 1 man 
has the marker? 

(b) If 10 men in the company are tested for the marker in this 
chromosome, what is the probability that more than 1 has 
the marker? 


3-92. Printed circuit cards are placed in a functional test 

after being populated with semiconductor chips. A lot contains 

140 cards, and 20 are selected without replacement for func- 

tional testing. 

(a) If 20 cards are defective, what is the probability that at 
least 1 defective card is in the sample? 

(b) If 5 cards are defective, what is the probability that at least 
1 defective card appears in the sample? 

3-93. Magnetic tape is slit into half-inch widths that are 

wound into cartridges. A slitter assembly contains 48 blades. 

Five blades are selected at random and evaluated each day for 


sharpness. If any dull blade is found, the assembly is replaced 

with a newly sharpened set of blades. 

(a) If 10 of the blades in an assembly are dull, what is the 
probability that the assembly is replaced the first day it 1s 
evaluated? 

(b) If 10 of the blades in an assembly are dull, what is the 
probability that the assembly is not replaced until the third 
day of evaluation? [Hint: Assume the daily decisions are 
independent, and use the geometric distribution. | 

(c) Suppose on the first day of evaluation, two of the blades 
are dull, on the second day of evaluation six are dull, and 
on the third day of evaluation, ten are dull. What is the 
probability that the assembly is not replaced until the third 
day of evaluation? [Hint: Assume the daily decisions are 
independent. However, the probability of replacement 
changes every day.] 


3-94. A state runs a lottery in which 6 numbers are ran- 

domly selected from 40, without replacement. A player 

chooses 6 numbers before the state's sample is selected. 

(a) What is the probability that the 6 numbers chosen by a 
player match all 6 numbers in the state's sample? 

(b) What is the probability that 5 of the 6 numbers chosen by 
a player appear in the state's sample? 


3-9 POISSON DISTRIBUTION 


3.9 POISSON DISTRIBUTION 89 


(c) What is the probability that 4 of the 6 numbers chosen by 
a player appear in the state's sample? 

(d) If a player enters one lottery each week, what is the 
expected number of weeks until a player matches all 6 
numbers in the state's sample? 


3-95. Continuation of Exercises 3-86 and 3-87. 

(a) Calculate the finite population corrections for Exercises 
3-86 and 3-87. For which exercise should the binomial 
approximation to the distribution of X be better? 

(b) For Exercise 3-86, calculate P(X — 1) and P(X — 4) as- 
suming that X has a binomial distribution and compare 
these results to results derived from the hypergeometric 
distribution. 

(c) For Exercise 3-87, calculate P(X — 1) and P(X — 4) 
assuming that X has a binomial distribution and compare 
these results to the results derived from the hypergeometric 
distribution. 


3-96. Use the binomial approximation to the hypergeo- 
metric distribution to approximate the probabilities in 
Exercise 3-92. What 1s the finite population correction in this 
exercise? 


We introduce the Poisson distribution with an example. 


EXAMPLE 3-30 


Consider the transmission of n bits over a digital communication channel. Let the random 


variable X equal the number of bits in error. When the probability that a bit is in error 1s con- 
stant and the transmissions are independent, X has a binomial distribution. Let p denote the 
probability that a bit is in error. Let A = pn. Then, E(x) = pn = A and 


vere Qe 


Now, suppose that the number of bits transmitted increases and the probability of an error 
decreases exactly enough that pn remains equal to a constant. That is, n increases and p de- 
creases accordingly, such that E(X) = A remains constant. Then, with some work, it can be 


shown that 


liii, uS PX 


e ^X* 
x) = "EE 





x = 0,1, 2,... 


Also, because the number of bits transmitted tends to infinity, the number of errors can equal 
any nonnegative integer. Therefore, the range of X is the integers from zero to infinity. 


The distribution obtained as the limit in the above example is more useful than the deri- 
vation above implies. The following example illustrates the broader applicability. 
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EXAMPLE 3-31 


Definition 


Flaws occur at random along the length of a thin copper wire. Let X denote the random vari- 
able that counts the number of flaws in a length of L millimeters of wire and suppose that the 
average number of flaws in L millimeters is A. 

The probability distribution of X can be found by reasoning in a manner similar to the pre- 
vious example. Partition the length of wire into n subintervals of small length, say, 1 microm- 
eter each. If the subinterval chosen is small enough, the probability that more than one flaw 
occurs in the subinterval 1s negligible. Furthermore, we can interpret the assumption that 
flaws occur at random to imply that every subinterval has the same probability of containing 
a flaw, say, p. Finally, if we assume that the probability that a subinterval contains a flaw is in- 
dependent of other subintervals, we can model the distribution of X as approximately a bino- 
mial random variable. Because 


E(X) =A = np 
we obtain 
p ^ A/n 


That is, the probability that a subinterval contains a flaw is A/n. With small enough subinter- 
vals, n 1s very large and p is very small. Therefore, the distribution of X is obtained as in the 
previous example. 


Example 3-31 can be generalized to include a broad array of random experiments. The 
interval that was partitioned was a length of wire. However, the same reasoning can be 
applied to any interval, including an interval of time, an area, or a volume. For example, 
counts of (1) particles of contamination in semiconductor manufacturing, (2) flaws in rolls 
of textiles, (3) calls to a telephone exchange, (4) power outages, and (5) atomic particles 
emitted from a specimen have all been successfully modeled by the probability mass func- 
tion in the following definition. 


Given an interval of real numbers, assume counts occur at random throughout the in- 
terval. If the interval can be partitioned into subintervals of small enough length such 
that 


(1) the probability of more than one count in a subinterval is zero, 


(2) the probability of one count in a subinterval is the same for all subintervals 
and proportional to the length of the subinterval, and 


(3) the count in each subinterval is independent of other subintervals, the ran- 
dom experiment is called a Poisson process. 


The random variable X that equals the number of counts in the interval is a Poisson 
random variable with parameter 0 < A, and the probability mass function of X is 


ESAE 
A 
fe) =—— _x=0,1,2... (3-15) 
5 
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Figure 3-14 Poisson distributions for selected values of the parameters. 


Historically, the term process has been used to suggest the observation of a system over 
time. In our example with the copper wire, we showed that the Poisson distribution could 
also apply to intervals such as lengths. Figure 3-14 provides graphs of selected Poisson 


distributions. 
It is important to use consistent units in the calculation of probabilities, means, and vari- 


ances involving Poisson random variables. The following example illustrates unit conversions. 
For example, if the 

average number of flaws per millimeter of wire is 3.4, then the 

average number of flaws in 10 millimeters of wire is 34, and the 

average number of flaws in 100 millimeters of wire is 340. 


If a Poisson random variable represents the number of counts in some interval, the mean of the 
random variable must equal the expected number of counts in the same length of interval. 
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EXAMPLE 3-32 For the case of the thin copper wire, suppose that the number of flaws follows a Poisson dis- 
tribution with a mean of 2.3 flaws per millimeter. Determine the probability of exactly 2 flaws 
in ] millimeter of wire. 

Let X denote the number of flaws in 1 millimeter of wire. Then, E(X) — 2.3 flaws and 


—23 2 
Po) =" = 0.965 
2! 


Determine the probability of 10 flaws in 5 millimeters of wire. Let X denote the number 
of flaws in 5 millimeters of wire. Then, X has a Poisson distribution with 


E(X) = 5 mm X 2.3 flaws/mm = 11.5 flaws 
Therefore, 


115 
=g l 


P(X = 10) T 


= 0.113 


Determine the probability of at least 1 flaw in 2 millimeters of wire. Let X denote the 
number of flaws in 2 millimeters of wire. Then, X has a Poisson distribution with 


E(X) = 2 mm X 2.3 flaws/mm = 4.6 flaws 
Therefore, 


P(Xz1)21—-2(X-20)—-1-— e ** = 0.9899 


EXAMPLE 3-33 Contamination is a problem in the manufacture of optical storage disks. The number of particles 
of contamination that occur on an optical disk has a Poisson distribution, and the average number 
of particles per centimeter squared of media surface is 0.1. The area of a disk under study is 100 
squared centimeters. Find the probability that 12 particles occur in the area of a disk under study. 
Let X denote the number of particles in the area of a disk under study. Because the mean 
number of particles is 0.1 particles per cm? 


E(X) = 100 cm? X 0.1 particles/em* = 10 particles 


Therefore, 


-—]10 12 
P(X = 12) = 2 = 9095 
12! 


The probability that zero particles occur in the area of the disk under study is 
PX=0)=e" =454 x10" 


Determine the probability that 12 or fewer particles occur in the area of the disk under 
study. The probability is 


12 e 910! 
P(X = 12) = P(X = 0) + P(X = 1) +-+ + P(X = 12) = Y, — 
i=0 E 
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Because this sum is tedious to compute, many computer programs calculate cumulative 
Poisson probabilities. From one such program, P(X < 12) = 0.791. 


The derivation of the mean and variance of a Poisson random variable is left as an exer- 
cise. The results are as follows. 


If X is a Poisson random variable with parameter A, then 





The mean and variance of a Poisson random variable are equal. For example, if particle counts 
follow a Poisson distribution with a mean of 25 particles per square centimeter, the variance 
is also 25 and the standard deviation of the counts is 5 per square centimeter. Consequently, 
information on the variability 1s very easily obtained. Conversely, if the variance of count data 
is much greater than the mean of the same data, the Poisson distribution is not a good model 


for the distribution of the random variable. 


EXERCISES FOR SECTION 3-9 


3-97. Suppose X has a Poisson distribution with a mean of 
4. Determine the following probabilities: 

(a P(X-0) (b) P(X S 2) 

(c) P(X —-4) (d) P(X-8) 

3-98. Suppose X has a Poisson distribution with a mean of 
0.4. Determine the following probabilities: 


(a P(X-20) (b) P(X S 2) 
(c) P(X —-4) (d) P(X = 8) 
3-99. Suppose that the number of customers that enter 


a bank in an hour is a Poisson random variable, and sup- 
pose that P(X = 0) 0.05. Determine the mean and 
variance of X. 


3-100. The number of telephone calls that arrive at a phone 

exchange is often modeled as a Poisson random variable. 

Assume that on the average there are 10 calls per hour. 

(a) What is the probability that there are exactly 5 calls in one 
hour? 

(b) What is the probability that there are 3 or less calls in one 
hour? 

(c) What is the probability that there are exactly 15 calls in 
two hours? 

(d) What is the probability that there are exactly 5 calls in 
30 minutes? 


3-101. The number of flaws in bolts of cloth in textile man- 

ufacturing is assumed to be Poisson distributed with a mean of 

0.1 flaw per square meter. 

(a) What is the probability that there are two flaws in 1 square 
meter of cloth? 

(b) What is the probability that there is one flaw in 10 square 
meters of cloth? 


(c) What is the probability that there are no flaws in 20 square 
meters of cloth? 

(d) What is the probability that there are at least two flaws in 
10 square meters of cloth? 


3-102. When a computer disk manufacturer tests a disk, it 
writes to the disk and then tests it using a certifier. The certi- 
fier counts the number of missing pulses or errors. The num- 
ber of errors on a test area on a disk has a Poisson distribution 
with A = 0.2. 

(a) What is the expected number of errors per test area? 

(b) What percentage of test areas have two or fewer errors? 


3-103. The number of cracks in a section of interstate high- 
way that are significant enough to require repair is assumed 
to follow a Poisson distribution with a mean of two cracks 
per mile. 

(a) What is the probability that there are no cracks that require 
repair in 5 miles of highway? 

(b) What is the probability that at least one crack requires 
repair in 1/2 mile of highway? 

(c) If the number of cracks is related to the vehicle load on 
the highway and some sections of the highway have a 
heavy load of vehicles whereas other sections carry 
a light load, how do you feel about the assumption of a 
Poisson distribution for the number of cracks that 
require repair? 

3-104. The number of failures for a cytogenics machine 

from contamination in biological samples is a Poisson random 

variable with a mean of 0.01 per 100 samples. 

(a) If the lab usually processes 500 samples per day, what is 
the expected number of failures per day? 
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(b) What is the probability that the machine will not fail dur- 
ing a study that includes 500 participants? (Assume one 
sample per participant.) 

3-105. The number of surface flaws in plastic panels used 

in the interior of automobiles has a Poisson distribution with 

a mean of 0.05 flaw per square foot of plastic panel. Assume 

an automobile interior contains 10 square feet of plastic 

panel. 

(a) What is the probability that there are no surface flaws in 
an auto’s interior? 

(b) If 10 cars are sold to a rental company, what is the proba- 
bility that none of the 10 cars has any surface flaws? 

(c) If 10 cars are sold to a rental company, what is the proba- 
bility that at most one car has any surface flaws? 


3-106. The number of failures of a testing instrument from 

contamination particles on the product is a Poisson random 

variable with a mean of 0.02 failure per hour. 

(a) What is the probability that the instrument does not fail in 
an 8-hour shift? 

(b) What is the probability of at least one failure in a 24-hour 
day? 


Supplemental Exercises 


3-107. A shipment of chemicals arrives in 15 totes. Three of 
the totes are selected at random, without replacement, for an 
inspection of purity. If two of the totes do not conform to 
purity requirements, what is the probability that at least one of 
the nonconforming totes is selected in the sample? 


3-108. The probability that your call to a service line is an- 

swered in less than 30 seconds is 0.75. Assume that your calls 

are independent. 

(a) If you call 10 times, what is the probability that exactly 9 
of your calls are answered within 30 seconds? 

(b) If you call 20 times, what is the probability that at least 16 
calls are answered in less than 30 seconds? 

(c) If you call 20 times, what is the mean number of calls that 
are answered in less than 30 seconds? 


3-109. Continuation of Exercise 3-108. 

(a) What is the probability that you must call four times to 
obtain the first answer in less than 30 seconds? 

(b) What is the mean number of calls until you are answered 
in less than 30 seconds? 


3-110. Continuation of Exercise 3-109. 

(a) What is the probability that you must call six times in 
order for two of your calls to be answered in less than 30 
seconds? 

(b) What is the mean number of calls to obtain two answers in 
less than 30 seconds? 


3-111. The number of messages sent to a computer bulletin 

board is a Poisson random variable with a mean of 5 messages 

per hour. 

(a) What is the probability that 5 messages are received in 
1 hour? 


(b) What is the probability that 10 messages are received in 
1.5 hours? 

(c) What is the probability that less than two messages are 
received in one-half hour? 


3-112. A Web site is operated by four identical computer 
servers. Only one is used to operate the site; the others are 
spares that can be activated in case the active server fails. The 
probability that a request to the Web site generates a failure in 
the active server is 0.0001. Assume that each request is an in- 
dependent trial. What is the mean time until failure of all four 
computers? 


3-113. The number of errors in a textbook follow a Poisson 
distribution with a mean of 0.01 error per page. What is the 
probability that there are three or less errors in 100 pages? 


3-114. The probability that an individual recovers from an 
illness in a one-week time period without treatment is 0.1. 
Suppose that 20 independent individuals suffering from this 
illness are treated with a drug and 4 recover in a one-week 
time period. If the drug has no effect, what 1s the probability 
that 4 or more people recover in a one-week time period? 


3-115. Patient response to a generic drug to control pain is 
scored on a 5-point scale, where a 5 indicates complete relief. 
Historically the distribution of scores 1s 


l 2 3 4 J 
0.05 0.1 0.2 0.25 0.4 


Two patients, assumed to be independent, are each scored. 
(a) What is the probability mass function of the total score? 
(b) What is the probability mass function of the average score? 


3-116. In a manufacturing process that laminates several 

ceramic layers, 19o of the assemblies are defective. Assume 

that the assemblies are independent. 

(a) What is the mean number of assemblies that need to be 
checked to obtain five defective assemblies? 

(b) What is the standard deviation of the number of assemblies 
that need to be checked to obtain five defective assemblies? 


3-117. Continuation of Exercise 3-116. Determine the mini- 
mum number of assemblies that need to be checked so that the 
probability of at least one defective assembly exceeds 0.95. 


3-118. Determine the constant c so that the following func- 

tion is a probability mass function: f(x) = cx forx = 1, 2, 3, 4. 

3-119. A manufacturer of a consumer electronics product ex- 

pects 2% of units to fail during the warranty period. A sample of 

500 independent units is tracked for warranty performance. 

(a) What is the probability that none fails during the warranty 
period? 

(b) What is the expected number of failures during the 
warranty period? 

(c) What is the probability that more than two units fail 
during the warranty period? 


3-120. Messages that arrive at a service center for an infor- 
mation systems manufacturer have been classified on the basis 


of the number of keywords (used to help route messages) and 
the type of message, either email or voice. Also, 70% of the 
messages arrive via email and the rest are voice. 


number of keywords 0 l 2 3 4 
email 0.1 0.1 0.2 0.4 0.2 
voice 0.3 0.4 0.2 0.1 0 


Determine the probability mass function of the number of 
keywords in a message. 


3-121. The random variable X has the following probability 
distribution: 

x 2 3 5 8 
probability 0.2 0.4 0.3 0.1 


Determine the following: 
(a) P(X S 3) 
PTAS) 
(e) V(X) 

3-122. Determine the probability mass function for the ran- 
dom variable with the following cumulative distribution 
function: 


(b) P(X > 2.5) 
(d) E(X) 


0 xe 2 
0.2 25 y «37 

F(x) = 40.5 5.7] x x « 6.5 
0.8 6.5 5 x « 85 
1 8.5 = X 


3-123. Each main bearing cap in an engine contains four 

bolts. The bolts are selected at random, without replacement, 

from a parts bin that contains 30 bolts from one supplier and 

70 bolts from another. 

(a) What is the probability that a main bearing cap contains 
all bolts from the same supplier? 

(b) What is the probability that exactly three bolts are from 
the same supplier? 


3-124. Assume the number of errors along a magnetic 
recording surface is a Poisson random variable with a mean of 
one error every 10? bits. A sector of data consists of 4096 
eight-bit bytes. 

(a) What is the probability of more than one error in a sector? 
(b) What is the mean number of sectors until an error is found? 


3-9 POISSON DISTRIBUTION 95 


3-125. An installation technician for a specialized commu- 
nication system is dispatched to a city only when three or 
more orders have been placed. Suppose orders follow a 
Poisson distribution with a mean of 0.25 per week for a city 
with a population of 100,000 and suppose your city contains a 
population of 800,000. 

(a) What is the probability that a technician 1s required after a 
one-week period? 

(b) If you are the first one in the city to place an order, what is 
the probability that you have to wait more than two weeks 
from the time you place your order until a technician is 
dispatched? 

3-126. From 500 customers, a major appliance manufac- 
turer will randomly select a sample without replacement. The 
company estimates that 2596 of the customers will provide 
useful data. If this estimate is correct, what is the probability 
mass function of the number of customers that will provide 
useful data? 

(a) Assume that the company samples 5 customers. 

(b) Assume that the company samples 10 customers. 


3-127. It is suspected that some of the totes containing 
chemicals purchased from a supplier exceed the moisture con- 
tent target. Samples from 30 totes are to be tested for moisture 
content. Assume that the totes are independent. Determine the 
proportion of totes from the supplier that must exceed the 
moisture content target so that the probability is 0.90 that at 
least one tote in the sample of 30 fails the test. 


3-128. Messages arrive to a computer server according 
to a Poisson distribution with a mean rate of 10 per 
hour. Determine the length of an interval of time such that 
the probability that no messages arrive during this interval 
is 0.90. 


3-129. Flaws occur in the interior of plastic used for auto- 

mobiles according to a Poisson distribution with a mean of 

0.02 flaw per panel. 

(a) If 50 panels are inspected, what is the probability that 
there are no flaws? 

(b) What is the expected number of panels that need to be 
inspected before a flaw is found? 

(c) If 50 panels are inspected, what is the probability that the 
number of panels that have one or more flaws 1s less than 
or equal to 2? 
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MIND-EXPANDING EXERCISES 


3-130. Derive the mean and variance of a hypergeo- 
metric random variable (difficult exercise). 


3-131. Show that the function f(x) in Example 3-5 
satisfies the properties of a probability mass function by 
summing the infinite series. 


3-132. Derive the formula for the mean and standard 
deviation of a discrete uniform random variable over the 
range of integers deed F l.. b: 


3-133. A company performs inspection on shipments 
from suppliers in order to defect nonconforming prod- 
ucts. Assume a lot contains 1000 items and 1% are 
nonconforming. What sample size is needed so that the 
probability of choosing at least one nonconforming item 
in the sample is at least 0.90? Assume the binomial 
approximation to the hypergeometric distribution is 
adequate. 


3-134. A company performs inspection on shipments 
from suppliers in order to detect nonconforming prod- 
ucts. The company’s policy is to use a sample size that is 
always 10% of the lot size. Comment on the effective- 
ness of this policy as a general rule for all sizes of lots. 


IMPORTANT TERMS AND CONCEPTS 


In the E-book, click on any 
term or concept below to 


go to that subject. 
Bernoulli trial 
Binomial distribution 
Cumulative probability 
distribution function- 
discrete random 
variable 
Discrete uniform distri- 
bution 


Expected value of a 


function of a random 


variable 
Finite population 
correction factor 
Geometric distribution 
Hypergeometric distri- 
bution 
Lack of memory 
property-discrete 
random variable 


3-135. Surface flaws in automobile exterior panels 
follow a Poisson distribution with a mean of 0.1 flaw per 
panel. If 100 panels are checked, what is the probability 
that fewer than five panels have any flaws? 

3-136. A large bakery can produce rolls in lots of ei- 
ther 0, 1000, 2000, or 3000 per day. The production cost 
per item is $0.10. The demand varies randomly accord- 
ing to the following distribution: 


demand for rolls 0 1000 2000 3000 
probability of demand 0.3 02 OOTO- 


Every roll for which there is a demand is sold for $0.30. 
Every roll for which there is no demand is sold in a sec- 
ondary market for $0.05. How many rolls should the 
bakery produce each day to maximize the mean profit? 
3-137. A manufacturer stocks components obtained 
from a supplier. Suppose that 2% of the components are 
defective and that the defective components occur inde- 
pendently. How many components must the manufacturer 
have in stock so that the probability that 100 orders can be 
completed without reordering components is at least 0.95? 


Probability mass 
function 

Standard deviation- 
discrete random 


Mean-discrete random 
variable 

Mean-function of a 
discrete random 


variable variable 
Negative binomial Variance-discrete 
distribution random variable 


Poisson distribution 

Poisson process 

Probability distribution- 
discrete random 
variable 


Continuous Random 


Variables and 
Probability 


Distributions 
CHAPTER OUTLINE 
4-1 CONTINUOUS RANDOM 4-7 NORMAL APPROXIMATION TO 
VARIABLES THE BINOMIAL AND POISSON 
4-2 PROBABILITY DISTRIBUTIONS DISTRIBUTIONS 
AND PROBABILITY DENSITY 4-8 CONTINUITY CORRECTION TO 
FUNCTIONS IMPROVE THE APPROXIMATION 
4-3 CUMULATIVE DISTRIBUTION (CD ONLY) 
FUNCTIONS 4-9 EXPONENTIAL DISTRIBUTION 
4-4 MEAN AND VARIANCE OF A 4-10 ERLANG AND GAMMA 
CONTINUOUS RANDOM DISTRIBUTIONS 
VARIABLE 4-10.1 Erlang Distribution 
4-5 Sache ie ee 4-10.2 Gamma Distribution 


4-6 NORMAL DISTRIBUTION 4-11 WEIBULL DISTRIBUTION 
4-12 LOGNORMAL DISTRIBUTION 


LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 
1. Determine probabilities from probability density functions. 


2. Determine probabilities from cumulative distribution functions and cumulative distribution func- 
tions from probability density functions, and the reverse. 


3. Calculate means and variances for continuous random variables. 


4 


. Understand the assumptions for each of the continuous probability distributions presented. 


5. Select an appropriate continuous probability distribution to calculate probabilities in specific 
applications. 


6. Calculate probabilities, determine means and variances for each of the continuous probability 
distributions presented. 


1. Standardize normal random variables. 
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8. Use the table for the cumulative distribution function of a standard normal distribution to calcu- 
late probabilities. 


9. Approximate probabilities for some binomial and Poisson distributions. 


CD MATERIAL 


10. Use continuity corrections to improve the normal approximation to those binomial and Poisson 
distributions. 


Answers for most odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e-Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e-Text immediately following the section they accompany. 


4-1 CONTINUOUS RANDOM VARIABLES 


Previously, we discussed the measurement of the current in a thin copper wire. We noted that 
the results might differ slightly in day-to-day replications because of small variations in vari- 
ables that are not controlled in our experiment—changes in ambient temperatures, small im- 
purities in the chemical composition of the wire, current source drifts, and so forth. 

Another example is the selection of one part from a day's production and very accurately 
measuring a dimensional length. In practice, there can be small variations in the actual 
measured lengths due to many causes, such as vibrations, temperature fluctuations, operator 
differences, calibrations, cutting tool wear, bearing wear, and raw material changes. Even the 
measurement procedure can produce variations in the final results. 

In these types of experiments, the measurement of interest— current in a copper wire ex- 
periment, length of a machined part— can be represented by a random variable. It is reason- 
able to model the range of possible values of the random variable by an interval (finite or 
infinite) of real numbers. For example, for the length of a machined part, our model enables 
the measurement from the experiment to result in any value within an interval of real numbers. 
Because the range is any value in an interval, the model provides for any precision in length 
measurements. However, because the number of possible values of the random variable X is 
uncountably infinite, X has a distinctly different distribution from the discrete random vari- 
ables studied previously. The range of X includes all values in an interval of real numbers; that 
is, the range of X can be thought of as a continuum. 

A number of continuous distributions frequently arise in applications. These distributions 
are described, and example computations of probabilities, means, and variances are provided 
in the remaining sections of this chapter. 


4-2 PROBABILITY DISTRIBUTIONS AND PROBABILITY 
DENSITY FUNCTIONS 


Density functions are commonly used in engineering to describe physical systems. For exam- 
ple, consider the density of a loading on a long, thin beam as shown in Fig. 4-1. For any point 
x along the beam, the density can be described by a function (in grams/cm). Intervals with 
large loadings correspond to large values for the function. The total loading between points a 
and b is determined as the integral of the density function from a to b. This integral is the area 


Definition 
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f (x) 
o0 
= P(a « X <b) 
xe) 
3 V o 
x a b x 
Figure 4-1 Density Figure 4-2 Probability determined from the area 


function of a loading on a under f(x). 
long, thin beam. 


under the density function over this interval, and it can be loosely interpreted as the sum of all 
the loadings over this interval. 

similarly, a probability density function f(x) can be used to describe the probability dis- 
tribution of a continuous random variable X. If an interval is likely to contain a value for X, 
its probability is large and it corresponds to large values for f(x). The probability that X is be- 
tween a and b is determined as the integral of f(x) from a to b. See Fig. 4-2. 


For a continuous random variable X, a probability density function is a function 
such that 


a)  f(x)z0 


f(x) dx = area under f(x) from a to b 


for any a and b 





A probability density function provides a simple description of the probabilities associ- 
ated with a random variable. As long as f(x) is nonnegative and f% f(x) dx = 1, 
0 < P(a « X <b) X 1 sothatthe probabilities are properly restricted. A probability density 
function is zero for x values that cannot occur and it is assumed to be zero wherever it is not 
specifically defined. 

A histogram is an approximation to a probability density function. See Fig. 4-3. For each 
interval of the histogram, the area of the bar equals the relative frequency (proportion) of the 
measurements in the interval. The relative frequency is an estimate of the probability that a 
measurement falls in the interval. Similarly, the area under f(x) over any interval equals the 
true probability that a measurement falls in the interval. 

The important point is that f(x) is used to calculate an area that represents the prob- 
ability that X assumes a value in [a, 5]. For the current measurement example, the proba- 
bility that X results in [14 mA, 15 mA] is the integral of the probability density function of 
X over this interval. The probability that X results in [14.5 mA, 14.6 mA] 1s the integral of 
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EXAMPLE 4-1 


f x) 
f x) 


x 0 10 20 x 


Figure 4-3 Histogram approximates a probability density Figure 4-4 Probability density 
function. function for Example 4-1. 


the same function, f(x), over the smaller interval. By appropriate choice of the shape of f(x), 
we can represent the probabilities associated with any continuous random variable X. The 
shape of f(x) determines how the probability that X assumes a value in [14.5 mA, 14.6 mA] 
compares to the probability of any other interval of equal or different length. 

For the density function of a loading on a long thin beam, because every point has zero 
width, the loading at any point is zero. Similarly, for a continuous random variable X and any 
value x. 


P(X=x)=0 


Based on this result, it might appear that our model of a continuous random variable is use- 
less. However, in practice, when a particular current measurement is observed, such as 14.47 
milliamperes, this result can be interpreted as the rounded value of a current measurement that 
is actually in a range such as 14.465 = x < 14.475. Therefore, the probability that the 
rounded value 14.47 is observed as the value for X is the probability that X assumes a value in 
the interval [14.465, 14.475], which is not zero. Similarly, because each point has zero 
probability, one need not distinguish between inequalities such as < or S for continuous 
random variables. 


If X is a continuous random variable, for any x, and x, 


P(x, SX Sx) = Py, < XS x) = Pai SX <M) = Play << X< xy) (42) 





Let the continuous random variable X denote the current measured in a thin copper wire in 
milliamperes. Assume that the range of X is [0, 20 mA], and assume that the probability den- 
sity function of X is f(x) = 0.05 for 0 < x = 20. What is the probability that a current meas- 
urement is less than 10 milliamperes? 

The probability density function is shown in Fig. 4-4. It is assumed that f(x) = 0 wherever 
it is not specifically defined. The probability requested is indicated by the shaded area in Fig. 4-4. 


10 10 
P(X < 10) = |o dx — | 0.05 dx — 0.5 


0 0 


EXAMPLE 4-2 
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f(x) 


12.5 12.6 x 


Figure 4-5 Probability density function for 
Example 4-2. 


As another example, 


20 
P(5 < X < 20) = [æ dx = 0.75 


5 


Let the continuous random variable X denote the diameter of a hole drilled in a sheet metal 
component. The target diameter is 12.5 millimeters. Most random disturbances to the process 
result in larger diameters. Historical data show that the distribution of X can be modeled by a 
probability density function f(x) = 20e ?9€- 125 x = 12.5. 

If a part with a diameter larger than 12.60 millimeters is scrapped, what proportion of 
parts is scrapped? The density function and the requested probability are shown in Fig. 4-5. A 
part is scrapped if X > 12.60. Now, 


CO CO 
oo 








P(X > 12.60) = ES dx = | Me Oe eS eee) = 0.135 
12.6 12.6 = 
What proportion of parts is between 12.5 and 12.6 millimeters? Now, 
12.6 
12.6 
P(12.5 < X < 12.6) = | f(x) dx = —e ?*«-123! — — 0.865 
12.5 
12.5 


Because the total area under f(x) equals 1, we can also calculate P(12.5 < X < 12.6) = 
1 — P(X > 12.6) = 1 — 0.135 = 0.865. 


EXERCISES FOR SECTION 4-2 


4-1. Suppose that f(x) = e * for 0 < x. Determine the fol- 4-3. Suppose that f(x) = x/8 for 3 < x < 5. Determine the 


lowing probabilities: 


following probabilities: 


(a) P(1 < X) (b) P(1« X « 2.5) (a) P(X < 4) (b) P(X > 3.5) 

(c) P(X =3) (d) PX< 4) (c) P(4 < X < 5) (d) P(X < 4.5) 

(e) P(3 x X) (e) P(X < 3.5 or X > 4.5) 

4-2. Suppose that f(x) = e "for 0 < x. 4-4. Suppose that f(x) = e * ? for 4 < x. Determine the 
(a) Determine x such that P(x « X) — 0.10. following probabilities: 


(b) Determine x such that PLY = x) = 0.10. (a) P(1« X) (b P25 X<5) 
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(c) P(5 « X) (d) P(8 < X « 12) 

(e) Determine x such that P(X « x) — 0.90. 

4-5. Suppose that f(x) = 1.5x? for ^1 < x < 1. Determine 
the following probabilities: 
(a) P(0 « X) 

(c) P(-05 € X < 0.5) 
(e) P(X < 0 or X > —0.5) 
(f) Determine x such that P(x < X) = 0.05. 

4-6. The probability density function of the time to failure 


of an electronic component in a copier (in hours) is f(x) = 
—x/1000 


(b) P(0.5 « X) 
(d) P(X < —2) 


e 
1000 
(a) A component lasts more than 3000 hours before failure. 
(b) A component fails in the interval from 1000 to 2000 hours. 

(c) A component fails before 1000 hours. 
(d) Determine the number of hours at which 10% of all com- 
ponents have failed. 


for x > 0. Determine the probability that 


4-7. The probability density function of the net weight in 

pounds of a packaged chemical herbicide is f(x) = 2.0 for 

49.75 « x « 50.25 pounds. 

(a) Determine the probability that a package weighs more 
than 50 pounds. 


(b) How much chemical is contained in 90% of all packages? 


4-8. The probability density function of the length of a 
hinge for fastening a door is f(x) = 1.25 for 74.6 < x < 75.4 
millimeters. Determine the following: 

(a) P(X « 74.8) 

(b) P(X < 74.8 or X > 75.2) 

(c) If the specifications for this process are from 74.7 
to 75.3 millimeters, what proportion of hinges meets 
specifications? 

4-9. The probability density function of the length of a 

metal rod is f(x) = 2 for 2.3 <x < 2.8 meters. 

(a) If the specifications for this process are from 2.25 to 2.75 
meters, what proportion of the bars fail to meet the speci- 
fications? 

(b) Assume that the probability density function is f(x) = 2 
for an interval of length 0.5 meters. Over what value 
should the density be centered to achieve the greatest pro- 
portion of bars within specifications? 


4-10. IfXis a continuous random variable, argue that P(x, = 
X x x) = P(x, €« X € x) = P(x, € X € x) = P(x, € X € x). 


4-3 CUMULATIVE DISTRIBUTION FUNCTIONS 


An alternative method to describe the distribution of a discrete random variable can also be 
used for continuous random variables. 


Definition 


The cumulative distribution function of a continuous random variable X is 


for —o» « x « ©, 





Extending the definition of f(x) to the entire real line enables us to define the cumulative dis- 
tribution function for all real numbers. The following example illustrates the definition. 


EXAMPLE 4-3 


For the copper current measurement in Example 4-1, the cumulative distribution function of 


the random variable X consists of three expressions. If x < 0, f(x) = 0. Therefore, 


F(x) = 0, 


for x <0 
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and 
F(x) = EZ du = 0.05x, for 0x « 20 
0 
Finally, 
F(x) = EC du=1, for 20x 
0 
Therefore, 
0 x: «0 
F(x) = §0.05x | 0zx«20 
l 20 5 


The plot of F(x) is shown in Fig. 4-6. 


Notice that in the definition of F(x) any < can be changed to < and vice versa. That is, 
F(x) can be defined as either 0.05x or 0 at the end-point x = 0, and F(x) can be defined as 
either 0.05x or 1 at the end-point x = 20. In other words, F(x) is a continuous function. For a 
discrete random variable, F(x) is not a continuous function. Sometimes, a continuous random 
variable is defined as one that has a continuous cumulative distribution function. 


EXAMPLE 4-4 For the drilling operation in Example 4-2, F(x) consists of two expressions. 
F(x) 2» 0 for x< 12.5 


and for 12.5 < x 


F(x) = | 20-2 du 
2 
=) = e 20(e— 12.5) 


Therefore, 


0 x= 125 
= e 20(x— 12.5) 12.5 x 


Figure 4-7 displays a graph of F(x). 


F(x) F(x) 
1 1 
0 20 x 0 12.5 x 
Figure 4-6 Cumulative distribution Figure 4-7 Cumulative distribution 


function for Example 4-3. function for Example 4-4. 
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The probability density function of a continuous random variable can be determined from 
the cumulative distribution function by differentiating. Recall that the fundamental theorem of 
calculus states that 





dx 
Then, given F(x) 
dF(x) 
fx) = = 
as long as the derivative exists. 
EXAMPLE 4-5 The time until a chemical reaction is complete (in milliseconds) is approximated by the 
cumulative distribution function 
Flx) = 0 xs 0 
(x) = f= e700 øz 


Determine the probability density function of X. What proportion of reactions is complete 
within 200 milliseconds? Using the result that the probability density function is the deriva- 
tive of the F(x), we obtain 


0 x«0 
f(x) me HN 0zx 


The probability that a reaction completes within 200 milliseconds is 


P(X < 200) = F(200) = 1 — e^? = 0.8647. 


EXERCISES FOR SECTION 4-3 


4-11. Suppose the cumulative distribution function of the Determine the following: 
random variable X is (a P(X-«1.8) (b) P(X > —1.5) 
(c) PX< —2) QD Pela X= 1) 
0 xc D 4-15. Determine the cumulative distribution function for 
Fx) =< 02. cy 5 the distribution in Exercise 4-1. 
1 5 zx 4-14. Determine the cumulative distribution function for 


Determine the following: 


the distribution in Exercise 4-3. 


4-15. Determine the cumulative distribution function for 
the distribution in Exercise 4-4. 


(a) P(X « 2.8) (b) P(X > 1.5) l TEM l 
(c) P(X < —2) (d) P(X > 6) 4-16. Determine the cumulative distribution function for 


4-12. 


Suppose the cumulative distribution function of the 
random variable X is 


the distribution in Exercise 4-6. Use the cumulative distribu- 

tion function to determine the probability that a component 

lasts more than 3000 hours before failure. 

4-17. Determine the cumulative distribution function for 
0 y 2 the distribution in Exercise 4-8. Use the cumulative distribu- 

Fix) = “025% +05. —2=7= 2 tion function to determine the probability that a length 
1 Jay exceeds 75 millimeters. 
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Determine the probability density function for each of the fol- 4-20. 
lowing cumulative distribution functions. 


0 Se 
418. Fy)-21-e? x>0 t 


U2s3rr 05 =x] 


-19. F(x) — 
iis = Vosr4025 1zx«is 
0 x<0 l L5 EX 
0.2 0zx«A4 
(x) = t : 4-21. The gap width is an important property of a magnetic 


0.04x + 0.64 45x<9 


recording head. In coded units, if the width is a continuous ran- 
1 Dx 


dom variable over the range from 0 < x < 2 with f(x) = 0.5x, 
determine the cumulative distribution function of the gap width. 


4-4 MEAN AND VARIANCE OF A CONTINUOUS 
RANDOM VARIABLE 


The mean and variance of a continuous random variable are defined similarly to a discrete 
random variable. Integration replaces summation in the definitions. If a probability density 
function is viewed as a loading on a beam as in Fig. 4-1, the mean is the balance point. 


Definition 


Suppose X is a continuous random variable with probability density function f(x). 
The mean or expected value of X, denoted as u or E(X), is 


The variance of X, denoted as V(X) or o7, is 


oO 


(x — uf) dx = | GTC 


= 00) 


The standard deviation of X iso = Vo. 





The equivalence of the two formulas for variance can be derived as one, as was done for dis- 
crete random variables. 


EXAMPLE 4-6 For the copper current measurement in Example 4-1, the mean of X is 


20 
= 10 
0 





20 
E(X) = ES dx = 0.05x*/2 
0 


The variance of X is 


20 


Qo = | e - 1) de = ost = 10}/3 — 33.33 


0 





0 
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Expected Value 
of a Function of 
a Continuous 
Random 
Variable 


EXAMPLE 4-7 


EXAMPLE 4-8 
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The expected value of a function A(X) of a continuous random variable is defined similarly to 
a function of a discrete random variable. 


If X is a continuous random variable with probability density function f(x), 





In Example 4-1, X is the current measured in milliamperes. What is the expected value of the 
squared current? Now, h(X) = X?. Therefore, 


20 
3 |20 


| x fox) des | 0.05x^ dx = 0.05 - 
0 


0 


— 133.33 


In the previous example, the expected value of X? does not equal E(X) squared. However, in 
the special case that A(X) = aX + b for any constants a and b, E|h(X)| = aE(X) + b. This 
can be shown from the properties of integrals. 


For the drilling operation in Example 4-2, the mean of X is 
X 20e 296— 12.5) dx 


Integration by parts can be used to show that 


oO 


e 206-12.5) 


E(X — —20(x— 12.5) pes 
(X) = —xe 20 


= 12.5 + 0.05 = 12.55 
12.5 





The variance of X is 
(x — 12.55Y^f(x) dx 


Although more difficult, integration by parts can be used two times to show that V(X) = 0.0025. 


EXERCISES FOR SECTION 4-4 


4-22. 


Suppose f(x) = 0.25 for 0 < x < 4. Determine the 


4-25. Suppose that f(x) = x/8 for 3 <x < 5. Determine 


mean and variance of X. 

4-23. Suppose f(x) = 0.125x for 0 < x < 4. Determine the 
mean and variance of X. 

4-24. Suppose f(x) = 1.5x* for —1 <x < 1. Determine 
the mean and variance of X. 


the mean and variance for x. 

4-26. Determine the mean and variance of the weight of 
packages in Exercise 4.7. 

4-27. The thickness of a conductive coating in micrometers 
has a density function of 600x ? for 100 wm < x < 120 um. 


(a) Determine the mean and variance of the coating thickness. 

(b) If the coating costs $0.50 per micrometer of thickness on 
each part, what is the average cost of the coating per 
part? 

4-28. Suppose that contamination particle size (in microm- 

eters) can be modeled as f(x) = 2x ? for 1 < x. Determine 

the mean of X. 


4-29. [Integration by parts is required. The probability den- 
sity function for the diameter of a drilled hole in millimeters is 
10e 1*5 for x > 5 mm. Although the target diameter is 5 
millimeters, vibrations, tool wear, and other nuisances pro- 
duce diameters larger than 5 millimeters. 
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(a) Determine the mean and variance of the diameter of the 
holes. 

(b) Determine the probability that a diameter exceeds 5.1 mil- 
limeters. 


4-30. Suppose the probability density function of the length 

of computer cables is f(x) = 0.1 from 1200 to 1210 millime- 

ters. 

(a) Determine the mean and standard deviation of the cable 
length. 

(b) If the length specifications are 1195 < x < 1205 
millimeters, what proportion of cables are within specifi- 
cations? 


4-5 CONTINUOUS UNIFORM DISTRIBUTION 


The simplest continuous distribution is analogous to its discrete counterpart. 


Definition 
A continuous random variable X with probability density function 


C= 5 =) 


f(x) = V(b — a), 


is a continuous uniform random variable. 





The probability density function of a continuous uniform random variable is shown in Fig. 4-8. 
The mean of the continuous uniform random variable X is 











The variance of X 1s 








These results are summarized as follows. 


If X is a continuous uniform random variable over a = x = b, 


(a T b) 
2 


(por 
12 


and o^ = V(X) = 
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EXAMPLE 4-9 


f(x) 


1 
b-a 





a b x 0 5 10 15 20 x 


Figure 4-8 Continuous uniform Figure 4-9 Probability for Example 4-9. 


probability density function. 


Let the continuous random variable X denote the current measured in a thin copper wire in 
milliamperes. Assume that the range of X is [0, 20 mA], and assume that the probability den- 
sity function of X is f(x) = 0.05,0 = x S 20. 

What is the probability that a measurement of current is between 5 and 10 milliamperes? 
The requested probability is shown as the shaded area in Fig. 4-9. 


IIS dx 
5(0.05) — 0.25 


P(5 « X « 10) 


The mean and variance formulas can be applied with a = 0 and b = 20. Therefore, 
E(X) -10mA and V(X) = 20°/12 = 3333 m^ 
Consequently, the standard deviation of X is 5.77 mA. 


The cumulative distribution function of a continuous uniform random variable 1s ob- 
tained by integration. If a < x < b, 


F(x) — | ue — a)du = x/(b — a) — a/(b — a) 


Therefore, the complete description of the cumulative distribution function of a continuous 
uniform random variable is 


0 x € d 
F(x) = $(x - ay(b—a) asx<b 
1 DEX 


An example of F(x) for a continuous uniform random variable is shown in Fig. 4-6. 


EXERCISES FOR SECTION 4-5 


4-31. Suppose X has a continuous uniform distribution over 
the interval [1.5, 5.5]. 

(a) Determine the mean, variance, and standard deviation of X. 
(b) What is P(X < 2.5)? 

4-32. Suppose X has a continuous uniform distribution over 
the interval | — 1, 1]. 


(a) Determine the mean, variance, and standard deviation of X. 

(b) Determine the value for x such that P( —x « X « x) — 0.90. 

4-33. The net weight in pounds of a packaged chemical her- 

bicide is uniform for 49.75 < x « 50.25 pounds. 

(a) Determine the mean and variance of the weight of pack- 
ages. 


(b) Determine the cumulative distribution function of the 
weight of packages. 
(c) Determine P(X < 50.1). 


4-34. The thickness of a flange on an aircraft component is 

uniformly distributed between 0.95 and 1.05 millimeters. 

(a) Determine the cumulative distribution function of flange 
thickness. 

(b) Determine the proportion of flanges that exceeds 1.02 
millimeters. 

(c) What thickness is exceeded by 90% of the flanges? 

(d) Determine the mean and variance of flange thickness. 


4-35. Suppose the time it takes a data collection operator to 

fill out an electronic form for a database is uniformly between 

1.5 and 2.2 minutes. 

(a) What is the mean and variance of the time it takes an op- 
erator to fill out the form? 

(b) What is the probability that it will take less than two min- 
utes to fill out the form? 

(c) Determine the cumulative distribution function of the time 
it takes to fill out the form. 


4-36. The probability density function of the time it takes a 
hematology cell counter to complete a test on a blood sample 
is f(x) = 0.2 for 50 < x < 75 seconds. 
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(a) What percentage of tests require more than 70 seconds to 
complete. 

(b) What percentage of tests require less than one minute to 
complete. 

(c) Determine the mean and variance of the time to complete 
a test on a sample. 


4-57. The thickness of photoresist applied to wafers in 

semiconductor manufacturing at a particular location on the 

wafer is uniformly distributed between 0.2050 and 0.2150 

micrometers. 

(a) Determine the cumulative distribution function of pho- 
toresist thickness. 

(b) Determine the proportion of wafers that exceeds 0.2125 
micrometers in photoresist thickness. 

(c) What thickness is exceeded by 1096 of the wafers? 

(d) Determine the mean and variance of photoresist thickness. 


4-38. The probability density function of the time required 

to complete an assembly operation is f(x) 0.1 for 

30 « x « 40 seconds. 

(a) Determine the proportion of assemblies that requires more 
than 35 seconds to complete. 

(b) What time is exceeded by 90% of the assemblies? 

(c) Determine the mean and variance of time of assembly. 


Undoubtedly, the most widely used model for the distribution of a random variable is a normal 
distribution. Whenever a random experiment is replicated, the random variable that equals the 
average (or total) result over the replicates tends to have a normal distribution as the number of 
replicates becomes large. De Moivre presented this fundamental result, known as the central 
limit theorem, in 1733. Unfortunately, his work was lost for some time, and Gauss independ- 
ently developed a normal distribution nearly 100 years later. Although De Moivre was later 
credited with the derivation, a normal distribution 1s also referred to as a Gaussian distribution. 

When do we average (or total) results? Almost always. For example, an automotive engi- 
neer may plan a study to average pull-off force measurements from several connectors. If we 
assume that each measurement results from a replicate of a random experiment, the normal 
distribution can be used to make approximate conclusions about this average. These conclu- 
sions are the primary topics in the subsequent chapters of this book. 

Furthermore, sometimes the central limit theorem is less obvious. For example, assume that 
the deviation (or error) in the length of a machined part 1s the sum of a large number of in- 
finitesimal effects, such as temperature and humidity drifts, vibrations, cutting angle variations, 
cutting tool wear, bearing wear, rotational speed variations, mounting and fixturing variations, 
variations in numerous raw material characteristics, and variation in levels of contamination. If 
the component errors are independent and equally likely to be positive or negative, the total error 
can be shown to have an approximate normal distribution. Furthermore, the normal distribution 
arises in the study of numerous basic physical phenomena. For example, the physicist Maxwell 
developed a normal distribution from simple assumptions regarding the velocities of molecules. 

The theoretical basis of a normal distribution 1s mentioned to justify the somewhat com- 
plex form of the probability density function. Our objective now is to calculate probabilities 
for a normal random variable. The central limit theorem will be stated more carefully later. 
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Definition 


EXAMPLE 4-10 


T0 o?-1 G2 md 





u=15 x 


Figure 4-10 Normal probability density functions for 
selected values of the parameters u and o. 


Random variables with different means and variances can be modeled by normal proba- 
bility density functions with appropriate choices of the center and width of the curve. The 
value of E(X) = p determines the center of the probability density function and the value of 
V(X) = o^ determines the width. Figure 4-10 illustrates several normal probability density 
functions with selected values of u and o?. Each has the characteristic symmetric bell-shaped 
curve, but the centers and dispersions differ. The following definition provides the formula for 
normal probability density functions. 


A random variable X with probability density function 


zo 


l 2 
20 a e lue (4-8) 


Evo 


is a normal random variable with parameters jj, where —o» < p < œ, and o > 0. 
Also, 


HOC ew mud VCO =o (4-9) 


and the notation N(w, o?) is used to denote the distribution. The mean and variance 
of X are shown to equal u and o^, respectively, at the end of this Section 5-6. 





Assume that the current measurements in a strip of wire follow a normal distribution with a 
mean of 10 milliamperes and a variance of 4 (milliamperes)". What is the probability that a 
measurement exceeds 13 milliamperes? 

Let X denote the current in milliamperes. The requested probability can be represented as 
P(X > 13). This probability is shown as the shaded area under the normal probability density 
function in Fig. 4-11. Unfortunately, there is no closed-form expression for the integral of a 
normal probability density function, and probabilities based on the normal distribution are 
typically found numerically or from a table (that we will later introduce). 


Some useful results concerning a normal distribution are summarized below and in 
Fig. 4-12. For any normal random variable, 


P(w — o € X € y + o) = 0.6827 
P(w — 20 < X < pt 20) = 0.9545 
P(w — 30 < X < y + 30) = 0.9973 


Also, from the symmetry of f(x), P(X > u) = P(X < p) = 0.5. Because f(x) is positive for 
all x, this model assigns some probability to each interval of the real line. However, the 


f (x) 
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f (x) 





p-3e U-20 p-o u ptc u+20o p+30o x 


I— 68% —>| 
10 13 " 99.7% —— — — — —- 








Figure 4-11 Probability that X > 13 fora normal ran- Figure 4-12 Probabilities associated with a normal 
dom variable with u = 10 and o? = 4. distribution. 


Definition 


EXAMPLE 4-11 


probability density function decreases as x moves farther from u. Consequently, the probability 
that a measurement falls far from p is small, and at some distance from pu the probability of an 
interval can be approximated as zero. 

The area under a normal probability density function beyond 3o from the mean is quite 
small. This fact is convenient for quick, rough sketches of a normal probability density func- 
tion. The sketches help us determine probabilities. Because more than 0.9973 of the probabil- 
ity of a normal distribution is within the interval (u — 3o, u + 30), 60 is often referred to as 
the width of a normal distribution. Advanced integration methods can be used to show that the 
area under the normal probability density function from —% < x < o is 1. 


A normal random variable with 


p 70 and o 


is called a standard normal random variable and is denoted as Z. 
The cumulative distribution function of a standard normal random variable is 
denoted as 





Appendix Table II provides cumulative probability values for ®(z), for a standard normal 
random variable. Cumulative distribution functions for normal random variables are also 
widely available in computer packages. They can be used in the same manner as Appendix 
Table II to obtain probabilities for these random variables. The use of Table II is illustrated by 
the following example. 


Assume Z is a standard normal random variable. Appendix Table II provides probabilities of 
the form P(Z = z). The use of Table II to find P(Z < 1.5) is illustrated in Fig. 4-13. Read 
down the z column to the row that equals 1.5. The probability is read from the adjacent col- 
umn, labeled 0.00, to be 0.93319. 

The column headings refer to the hundredth's digit of the value of z in P(Z = z). For ex- 
ample, P(Z < 1.53) is found by reading down the z column to the row 1.5 and then selecting 
the probability from the column labeled 0.03 to be 0.93699. 
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P(Z < 1.5) = (1.5) 
= shaded area 


0.00 0.01 0.02 0.03 





O | 0.50000 0.50399 0.50398 0.51197 
Figure 4-13 Standard 


normal probability den- 1.5 | 0.93319 0.93448 0.93574 0.93699 
sity function. 0 1:5 z 


Probabilities that are not of the form P(Z x z ) are found by using the basic rules of prob- 
ability and the symmetry of the normal distribution along with Appendix Table IT. The fol- 
lowing examples illustrate the method. 


EXAMPLE 4-12 The following calculations are shown pictorially in Fig. 4-14. In practice, a probability is of- 
ten rounded to one or two significant digits. 


(1) P(Z>1.26)=1— P(Z- 126) = 1 — 0.89616 = 0.10384 
(2) P(Z- —0.86) = 0.19490. 
(3) P(Z> —137) = P(Z < 1.37) = 0.91465 


(4) P(—1.25 < Z « 0.37). This probability can be found from the difference of two 
areas, P(Z < 0.37) — P(Z < —1.25). Now, 


P(Z < 0.37) = 0.64431 and P(Z< —1.25) = 0.10565 


Therefore, 


P(—1.25 < Z « 0.37) = 0.64431 — 0.10565 = 0.53866 


NS P uu SN 
0 ie -4.6 -3.99 
AN 4M 
-0.86 0 z 21.65 
(3) 
NC = NT 0.005 0.005 
1:37 0 z= 2.58 
LN. NC LN. 
1,95 0 0.37 0 0.37 -1.25 


Figure 4-14 Graphical displays for standard normal distributions. 


EXAMPLE 4-13 
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(5 P(Z < —4.6) cannot be found exactly from Appendix Table II. However, the last 
entry in the table can be used to find that P(Z = —3.99) = 0.00003. Because 
P(Z = —4.6) < P(Z = —3.99), P(Z = —4.6) is nearly zero. 


(6) Find the value z such that P(Z > z) = 0.05. This probability expression can be writ- 
ten as P(Z X z) = 0.95. Now, Table II is used in reverse. We search through the 
probabilities to find the value that corresponds to 0.95. The solution is illustrated in 
Fig. 4-14. We do not find 0.95 exactly; the nearest value is 0.95053, corresponding 
tog 1.65. 


(7) Find the value of z such that P(—z < Z < z) = 0.99. Because of the symmetry of 
the normal distribution, if the area of the shaded region in Fig. 4-14(7) is to equal 
0.99, the area in each tail of the distribution must equal 0.005. Therefore, the value 
for z corresponds to a probability of 0.995 in Table II. The nearest probability in 
Table II is 0.99506, when z = 2.58. 


The preceding examples show how to calculate probabilities for standard normal random 
variables. To use the same approach for an arbitrary normal random variable would require a 
separate table for every possible pair of values for u and ø. Fortunately, all normal probability 
distributions are related algebraically, and Appendix Table II can be used to find the probabili- 
ties associated with an arbitrary normal random variable by first using a simple transformation. 


If X is a normal random variable with E(X) = u and V(X) = o^, the random variable 


(4-10) 


is a normal random variable with E(Z) — 0 and V(Z) — 1. That is, Z is a standard 
normal random variable. 


Creating a new random variable by this transformation is referred to as standardizing. 
The random variable Z represents the distance of X from its mean in terms of standard devia- 
tions. It is the key step to calculate a probability for an arbitrary normal random variable. 


Suppose the current measurements in a strip of wire are assumed to follow a normal distribu- 
tion with a mean of 10 milliamperes and a variance of 4 (milliamperes)”. What is the proba- 
bility that a measurement will exceed 13 milliamperes? 

Let X denote the current in milliamperes. The requested probability can be represented as 
P(X > 13). Let Z = (X — 10)/2. The relationship between the several values of X and the 
transformed values of Z are shown in Fig. 4-15. We note that X > 13 corresponds to Z > 1.5. 
Therefore, from Appendix Table II, 


P(X > 13) = P(Z > 1.5) = 1 — P(Zx 15) = 1 — 0.93319 = 0.06681 
Rather than using Fig. 4-15, the probability can be found from the inequality X > 13. That is, 


(X= 10) _ (13 = 10) 
2 2 








P(X > 13) = p( = P(Z > 1.5) = 0.06681 
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Distribution of Z = me 








Oo 


Distribution of X 






4 7 9 1011 13 16 x 


-3 -1.5 -0.50 0.5 1.5 3 z 
10 13 x 


Figure 4-15 Standardizing a normal random variable. 





EXAMPLE 4-14 


In the preceding example, the value 13 is transformed to 1.5 by standardizing, and 1.5 is 
often referred to as the z-value associated with a probability. The following summarizes the 
calculation of probabilities derived from normal random variables. 


Suppose X is a normal random variable with mean p and variance o^. Then, 


D c. MENS ur 
Li+) ze) (4-11) 


g= 
where Z is a standard normal random variable, and z = l - u) is the z-value 
obtained by standardizing X. 

The probability is obtained by entering Appendix Table II with z = (x — p)/o. 


Continuing the previous example, what is the probability that a current measurement is be- 
tween 9 and 11 milliamperes? From Fig. 4-15, or by proceeding algebraically, we have 


P(9 < X < 11) = P((9 — 10)/2 < (X — 10)/2 < (11 — 10)/2) 
= P(-0.5 < Z < 0.5) = P(Z < 0.5) - P(Z < -0.5) 
— 0.69146 — 0.30854 — 0.38292 


Determine the value for which the probability that a current measurement is below 
this value 1s 0.98. The requested value is shown graphically in Fig. 4-16. We need the value of 
x such that P(X « x) = 0.98. By standardizing, this probability expression can be written as 


P(X < x) = P((X — 10/2 < (x — 10)/2) 
= P(Z < (x — 10)/2) 
= 0.98 


Appendix Table II is used to find the z-value such that P(Z < z) = 0.98. The nearest proba- 
bility from Table II results in 


P(Z « 2.05) — 0.97982 
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Figure 4-16 Deter- 
mining the value of x 
to meet a specified 
probability. 





Therefore, (x — 10)/2 — 2.05, and the standardizing transformation is used in reverse to solve 
for x. The result is 


x = 2(2.05) + 10 = 14.1 milliamperes 


EXAMPLE 4-15 Assume that in the detection of a digital signal the background noise follows a normal distri- 
bution with a mean of 0 volt and standard deviation of 0.45 volt. The system assumes a digi- 
tal 1 has been transmitted when the voltage exceeds 0.9. What is the probability of detecting 
a digital 1 when none was sent? 

Let the random variable N denote the voltage of noise. The requested probability is 


N 0.9 


0.45 22) P(Z > 2) = 1 — 0.97725 = 0.02275 


PIN > 09) = P( 


This probability can be described as the probability of a false detection. 
Determine symmetric bounds about 0 that include 99% of all noise readings. The question 
requires us to find x such that P( —x < N < x) = 0.99. A graph is shown in Fig. 4-17. Now, 


P(—x < N < x) = P(—x/0.45 < N/0.45 < x/0.45) 
= P(—x/0.45 < Z < x/0.45) = 0.99 


From Appendix Table II 


P(—2.58 « Z « 2.58) = 0.99 


Distribution of N 


Figure 4-17 Deter- Standardized distribution of E 

mining the value of x 

to meet a specified 

probability. -z 0 z -x O0 x 
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EXAMPLE 4-16 


Figure 4-18 
Distribution for 
Example 4-16. 


Therefore, 
x/0.45 — 2.58 
and 
x = 2.58(0.45) = 1.16 
Suppose a digital 1 is represented as a shift in the mean of the noise distribution to 1.8 


volts. What is the probability that a digital 1 is not detected? Let the random variable S denote 
the voltage when a digital 1 is transmitted. Then, 





S—18 09-18 
c T | = P(Z < —2) = 0.02275 


< 0.9) = 
iR d ( 0.45 0.45 
This probability can be interpreted as the probability of a missed signal. 


The diameter of a shaft in an optical storage drive is normally distributed with mean 0.2508 
inch and standard deviation 0.0005 inch. The specifications on the shaft are 0.2500 + 0.0015 
inch. What proportion of shafts conforms to specifications? 

Let X denote the shaft diameter in inches. The requested probability is shown in Fig. 4-18 and 


0.2485 — 0.2508 JS ie es x) 

— rre ee 
0.0005 0.0005 

= P(—4.6 < Z < 14) = P(Z < 14) - P(Z < —4.6) 

= 0.91924 — 0.0000 = 0.91924 


P(0.2485 < X < 0.2515) = P ( 


Most of the nonconforming shafts are too large, because the process mean is located very near 
to the upper specification limit. If the process is centered so that the process mean is equal to 
the target value of 0.2500, 


0.2485 — 0.2500 0.2515 = 0250") 


« X « 0. = a7 = 
P(0.2485 < X < 0.2515) p( 0.0005 0.0005 


xd <7 = 3) 
= P(Z < 3) — P(Z < o 
= 0.99865 — 0.00135 
= 0.9973 
By recentering the process, the yield is increased to approximately 99.73%. 


f(x) Specifications ————> 





0.2485 / 0.2508 0.2515 x 
0.25 
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Mean and Variance of the Normal Distribution (CD Only) 


EXERCISES FOR SECTION 4-6 


4-39. Use Appendix Table II to determine the following 
probabilities for the standard normal random variable Z: 

(a) P(Z « 1.32) (b) P(Z « 3.0) 

(c) P(Z > 1.45) (d) P(Z > =2.15) 

(e) P(—2.34 < Z < 1.76) 

4-40. Use Appendix Table II to determine the following 
probabilities for the standard normal random variable Z: 

(a) P(—-1«72«€1) (b)P(-2€2792) 

(c) P(-3 —Z=— 3) (d)PRZ 3) 

(e) PO<Z<1) 

4-41. Assume Z has a standard normal distribution. Use 
Appendix Table II to determine the value for z that solves each 
of the following: 

(a) P( Z € z) ^ 0.9 

(c) P(Zz)-0. 

(e) P(-1.24<Z<z)=0.8 
4-42. Assume Z has a standard normal distribution. Use 
Appendix Table II to determine the value for z that solves each 
of the following: 

(a P(-z<Z<z)=0.95 (b) P(—-z € Z «z) ^ 0.99 

(c) P(-z<Z<z)=0.68 (d) P(—z < Z < z) = 0.9973 
4-45. Assume X is normally distributed with a mean of 10 
and a standard deviation of 2. Determine the following: 

(a) P(X « 13) (b) P(X > 9) 

(c) P'6« X«14) (dA P2 « X « 4) 

(e) P(-2«X«8) 

4-44. Assume X is normally distributed with a mean of 10 
and a standard deviation of 2. Determine the value for x that 
solves each of the following: 

(a) P(X > x) = 0.5 

(b) P(X > x) = 0.95 

(c) P(x <X < 10) 2 02 

(d) P(—x < X — 10 < x) = 0.95 

(e) P(—x« X-— 10 < x) = 0.99 

4-45. Assume X is normally distributed with a mean of 5 
and a standard deviation of 4. Determine the following: 

(a) P(X « 11) (b) P(X > 0) 

(c) P(3-— X T) (d) P(—~2<X <9) 

(e) P(2«— X« 8) 

4-46. Assume X is normally distributed with a mean of 5 
and a standard deviation of 4. Determine the value for x that 
solves each of the following: 
(a) P(X > x) ^ 0.5 

(c) Px «X«9)—-02 

(e) P(—x < X < x) = 0.99 
4-47. The compressive strength of samples of cement can 
be modeled by a normal distribution with a mean of 6000 kilo- 
grams per square centimeter and a standard deviation of 100 
kilograms per square centimeter. 


(b) P(Z € z) = 0.5 
(d) P(Z > 2) = 0.9 


(b) P(X > x) = 0.95 
(d) PG < X < x) = 0.95 


(a) What is the probability that a sample's strength is less than 
6250 Kg/cm?? 

(b) What is the probability that a sample's strength is between 
5800 and 5900 Kg/cm?? 

(c) What strength is exceeded by 95% of the samples? 


4-48. The tensile strength of paper is modeled by a normal 

distribution with a mean of 35 pounds per square inch and a 

standard deviation of 2 pounds per square inch. 

(a) What is the probability that the strength of a sample is less 
than 40 Ib/in?? 

(b) If the specifications require the tensile strength to 
exceed 30 Ib/in?, what proportion of the samples is 
scrapped? 


4-49. The line width of for semiconductor manufacturing is 

assumed to be normally distributed with a mean of 0.5 mi- 

crometer and a standard deviation of 0.05 micrometer. 

(a) What is the probability that a line width is greater than 
0.62 micrometer? 

(b) What is the probability that a line width is between 0.47 
and 0.63 micrometer? 

(c) The line width of 90% of samples is below what value? 


4-50. The fill volume of an automated filling machine used 

for filling cans of carbonated beverage is normally distributed 

with a mean of 12.4 fluid ounces and a standard deviation of 

0.1 fluid ounce. 

(a) What is the probability a fill volume is less than 12 fluid 
ounces? 

(b) If all cans less than 12.1 or greater than 12.6 ounces are 
scrapped, what proportion of cans is scrapped? 

(c) Determine specifications that are symmetric about the 
mean that include 99% of all cans. 


4-51. The time it takes a cell to divide (called mitosis) is 

normally distributed with an average time of one hour and a 

standard deviation of 5 minutes. 

(a) What is the probability that a cell divides in less than 
45 minutes? 

(b) What is the probability that it takes a cell more than 
65 minutes to divide? 

(c) What is the time that it takes approximately 99% of all 
cells to complete mitosis? 


4-52. Inthe previous exercise, suppose that the mean of the 

filling operation can be adjusted easily, but the standard devi- 

ation remains at 0.1 ounce. 

(a) At what value should the mean be set so that 99.9% of all 
cans exceed 12 ounces? 

(b) At what value should the mean be set so that 99.9% of all 
cans exceed 12 ounces if the standard deviation can be re- 
duced to 0.05 fluid ounce? 
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4-53. The reaction time of a driver to visual stimulus is nor- 

mally distributed with a mean of 0.4 seconds and a standard 

deviation of 0.05 seconds. 

(a) What is the probability that a reaction requires more than 
0.5 seconds? 

(b) What is the probability that a reaction requires between 
0.4 and 0.5 seconds? 

(c) What is the reaction time that is exceeded 90% of the 
time? 

4-54. The speed of a file transfer from a server on campus to 

a personal computer at a student's home on a weekday 

evening is normally distributed with a mean of 60 kilobits per 

second and a standard deviation of 4 kilobits per second. 

(a) What is the probability that the file will transfer at a speed 
of 70 kilobits per second or more? 

(b) What is the probability that the file will transfer at a speed 
of less than 58 kilobits per second? 

(c) If the file is 1 megabyte, what is the average time it will 
take to transfer the file? (Assume eight bits per byte.) 


4-55. The length of an injection-molded plastic case that 

holds magnetic tape is normally distributed with a length of 

90.2 millimeters and a standard deviation of 0.1 millimeter. 

(a) What is the probability that a part is longer than 90.3 mil- 
limeters or shorter than 89.7 millimeters? 

(b) What should the process mean be set at to obtain the great- 
est number of parts between 89.7 and 90.3 millimeters? 

(c) If parts that are not between 89.7 and 90.3 millimeters are 
scrapped, what is the yield for the process mean that you 
selected in part (b)? 


4-56. In the previous exercise assume that the process is 

centered so that the mean is 90 millimeters and the standard 

deviation is 0.1 millimeter. Suppose that 10 cases are meas- 

ured, and they are assumed to be independent. 

(a) What is the probability that all 10 cases are between 89.7 
and 90.3 millimeters? 

(b) What is the expected number of the 10 cases that are be- 
tween 89.7 and 90.3 millimeters? 


4-57. The sick-leave time of employees in a firm in a month 

is normally distributed with a mean of 100 hours and a stan- 

dard deviation of 20 hours. 

(a) What is the probability that the sick-leave time for next 
month will be between 50 and 80 hours? 

(b) How much time should be budgeted for sick leave if the 
budgeted amount should be exceeded with a probability 
of only 10%? 

4-58. The life of a semiconductor laser at a constant power 

is normally distributed with a mean of 7000 hours and a stan- 

dard deviation of 600 hours. 

(a) What is the probability that a laser fails before 5000 
hours? 

(b) What is the life in hours that 95% of the lasers exceed? 

(c) If three lasers are used in a product and they are assumed 
to fail independently, what is the probability that all three 
are still operating after 7000 hours? 


4-59. The diameter of the dot produced by a printer is nor- 

mally distributed with a mean diameter of 0.002 inch and a 

standard deviation of 0.0004 inch. 

(a) What is the probability that the diameter of a dot exceeds 
0.0026 inch? 

(b) What is the probability that a diameter is between 0.0014 
and 0.0026 inch? 

(c) What standard deviation of diameters is needed so that the 
probability in part (b) is 0.995? 

4-60. The weight of a sophisticated running shoe is nor- 

mally distributed with a mean of 12 ounces and a standard de- 

viation of 0.5 ounce. 

(a) What is the probability that a shoe weighs more than 13 
ounces? 

(b) What must the standard deviation of weight be in order for 
the company to state that 99.9% of its shoes are less than 
13 ounces? 

(c) If the standard deviation remains at 0.5 ounce, what must 
the mean weight be in order for the company to state that 
99.9% of its shoes are less than 13 ounces? 


4-7 NORMAL APPROXIMATION TO THE BINOMIAL 


AND POISSON DISTRIBUTIONS 


We began our section on the normal distribution with the central limit theorem and the nor- 
mal distribution as an approximation to a random variable with a large number of trials. 
Consequently, it should not be a surprise to learn that the normal distribution can be used 
to approximate binomial probabilities for cases in which n is large. The following example 
illustrates that for many physical systems the binomial model is appropriate with an ex- 
tremely large value for n. In these cases, it is difficult to calculate probabilities by using the 
binomial distribution. Fortunately, the normal approximation is most effective in these 
cases. An illustration is provided in Fig. 4-19. The area of each bar equals the binomial 
probability of x. Notice that the area of bars can be approximated by areas under the nor- 


mal density function. 


Figure 4-19 Normal 
approximation to the 
binomial distribution. 


EXAMPLE 4-17 


Normal 
Approximation to 
the Binomial 
Distribution 
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TOR) 


0.10 


0.05 


In a digital communication channel, assume that the number of bits received in error can be 
modeled by a binomial random variable, and assume that the probability that a bit 1s received 
in error is 1 X 10 ?. If 16 million bits are transmitted, what is the probability that more than 
150 errors occur? 

Let the random variable X denote the number of errors. Then X is a binomial random vari- 
able and 


150 /16.000,000 
P(X > 150) = 1 — Pw s 150) = 1 — >( > " J aoa E 1 075)16,000,000—x 
x=0 


Clearly, the probability in Example 4-17 is difficult to compute. Fortunately, the normal 
distribution can be used to provide an excellent approximation in this example. 


If X is a binomial random variable, 


_ X — np 
Vnp(l — p) 


(4-12) 
is approximately a standard normal random variable. The approximation is good for 


a = Sahn ial p S 


Recall that for a binomial variable X, E(X) = np and V(X) = np(1 — p). Consequently, the ex- 
pression in Equation 4-12 is nothing more than the formula for standardizing the random vari- 
able X. Probabilities involving X can be approximated by using a standard normal distribution. 
The approximation is good when n is large relative to p. 
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EXAMPLE 4-18 


EXAMPLE 4-19 


. The digital communication problem in the previous example is solved as follows: 


X — 160 150 — 160 
P(X > 150) = P 


V 160(1 — 107°) 7 V 160(1 — 107°) 


= P(Z > -0.79) = P(Z < 0.79) = 0.785 


Because np = (16 X 109)(1 10^?) = 160and n(1 — p) is much larger, the approximation 


. 1s expected to work well in this case. 


Again consider the transmission of bits in Example 4-18. To judge how well the normal 
approximation works, assume only n — 50 bits are to be transmitted and that the probability 
of an error is p — 0.1. The exact probability that 2 or less errors occur is 


50 
P(X =2)= G Joss + "n 0.1(0.9%7) + (5) 0.1^(0.9*5) = 0.112 


Based on the normal approximation 


pregar g jenze a00 
posue 2.12 312 |” AZ =N, 


. Even for a sample as small as 50 bits, the normal approximation is reasonable. 


If np or n(1 — p) is small, the binomial distribution is quite skewed and the symmetric 
normal distribution is not a good approximation. Two cases are illustrated in Fig. 4-20. 
However, a correction factor can be used that will further improve the approximation. This 
factor is called a continuity correction and it is discussed in Section 4-8 on the CD. 




















p 
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Figure 4-20 Binomial , 
distribution is not ES 
symmetrical if p is near 6 7 8 9 10 


0 or 1. x 


Normal 
Approximation to 
the Poisson 
Distribution 


EXAMPLE 4-20 
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hypergometric ~ binomial ~ normal 
distribution B ond distribution np > 5 distribution 
N l nil — p)>5 


Figure 4-21 Conditions for approximating hypergeometric and binomial probabilities. 


Recall that the binomial distribution is a satisfactory approximation to the hypergeomet- 
ric distribution when n, the sample size, is small relative to N, the size of the population from 
which the sample is selected. A rule of thumb is that the binomial approximation is effective 
if n/N < 0.1. Recall that for a hypergeometric distribution p is defined as p = K/N. That is, 
p is interpreted as the number of successes in the population. Therefore, the normal distribu- 
tion can provide an effective approximation of hypergeometric probabilities when n/N < 0.1, 
np > 5 and n(1 — p) > 5. Figure 4-21 provides a summary of these guidelines. 

Recall that the Poisson distribution was developed as the limit of a binomial distribution as 
the number of trials increased to infinity. Consequently, it should not be surprising to find that the 
normal distribution can also be used to approximate probabilities of a Poisson random variable. 


If X is a Poisson random variable with E(X) = A and V(X) = A, 


(4-13) 


is approximately a standard normal random variable. The approximation is good for 


hy 2 S 





Assume that the number of asbestos particles in a squared meter of dust on a surface follows 
a Poisson distribution with a mean of 1000. If a squared meter of dust is analyzed, what is the 
probability that less than 950 particles are found? 

This probability can be expressed exactly as 


950 B 1000,.1000 


P(X = 950) = © 
x=0 x! 


The computational difficulty is clear. The probability can be approximated as 


950 — 1000 
< Lu 


prs) =P(z= VUTODU )-»z- —1.58) = 0.057 


EXERCISES FOR SECTION 4-7 


4-61. Suppose that X is a binomial random variable with 4-62. Suppose that X is a binomial random variable with 


n = 200 and p = 0.4. 


n — 100 and p = 0.1. 


(a) Approximate the probability that X 1s less than or equal (a) Compute the exact probability that X is less than 4. 


to 70. (b) Approximate the probability that X is less than 4 and com- 
(b) Approximate the probability that X 1s greater than 70 and pare to the result in part (a). 
less than 90. (c) Approximate the probability that 8 < X < 12. 
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4-63. The manufacturing of semiconductor chips produces 

2% defective chips. Assume the chips are independent and 

that a lot contains 1000 chips. 

(a) Approximate the probability that more than 25 chips are 
defective. 

(b) Approximate the probability that between 20 and 30 chips 
are defective. 


4-64. A supplier ships a lot of 1000 electrical connectors. A 

sample of 25 is selected at random, without replacement. 

Assume the lot contains 100 defective connectors. 

(a) Using a binomial approximation, what 1s the probability 
that there are no defective connectors in the sample? 

(b) Use the normal approximation to answer the result in part 
(a). Is the approximation satisfactory? 

(c) Redo parts (a) and (b) assuming the lot size 1s 500. Is the nor- 
mal approximation to the probability that there are no defec- 
tive connectors in the sample satisfactory in this case? 


4-65. An electronic office product contains 5000 elec- 
tronic components. Assume that the probability that each 
component operates without failure during the useful life of 
the product is 0.999, and assume that the components fail 
independently. Approximate the probability that 10 or more 
of the original 5000 components fail during the useful life of 
the product. 

4-66. Suppose that the number of asbestos particles in a sam- 
ple of 1 squared centimeter of dust is a Poisson random variable 
with a mean of 1000. What 1s the probability that 10 squared cen- 
timeters of dust contains more than 10,000 particles? 

4-67. A corporate Web site contains errors on 50 of 1000 
pages. If 100 pages are sampled randomly, without replace- 


ment, approximate the probability that at least 1 of the pages 
in error are in the sample. 


4-68. Hits to a high-volume Web site are assumed to follow 

a Poisson distribution with a mean of 10,000 per day. 

Approximate each of the following: 

(a) The probability of more than 20,000 hits in a day 

(b) The probability of less than 9900 hits in a day 

(c) The value such that the probability that the number of hits 
in a day exceed the value is 0.01 


4-69. Continuation of Exercise 4-68. 

(a) Approximate the expected number of days in a year (365 
days) that exceed 10,200 hits. 

(b) Approximate the probability that over a year (365 days) 
more than 15 days each have more than 10,200 hits. 


4-70. The percentage of people exposed to a bacteria who 

become ill is 20%. Assume that people are independent. Assume 

that 1000 people are exposed to the bacteria. Approximate each 

of the following: 

(a) The probability that more than 225 become ill 

(b) The probability that between 175 and 225 become ill 

(c) The value such that the probability that the number of peo- 
ple that become ill exceeds the value is 0.01 


4-71. A high-volume printer produces minor print-quality 

errors on a test pattern of 1000 pages of text according to a 

Poisson distribution with a mean of 0.4 per page. 

(a) Why are the number of errors on each page independent 
random variables? 

(b) What is the mean number of pages with errors (one or more)? 

(c) Approximate the probability that more than 350 pages 
contain errors (one or more). 


4-8 CONTINUITY CORRECTION TO IMPROVE 


THE APPROXIMATION (CD ONLY) 


4-9 EXPONENTIAL DISTRIBUTION 


The discussion of the Poisson distribution defined a random variable to be the number of 
flaws along a length of copper wire. The distance between flaws is another random variable 
that is often of interest. Let the random variable X denote the length from any starting point on 


the wire until a flaw 1s detected. 


As you might expect, the distribution of X can be obtained from knowledge of the 
distribution of the number of flaws. The key to the relationship is the following concept. The 
distance to the first flaw exceeds 3 millimeters if and only if there are no flaws within a length 
of 3 millimeters—simple, but sufficient for an analysis of the distribution of X. 

In general, let the random variable N denote the number of flaws in x millimeters of wire. 
If the mean number of flaws is A per millimeter, NV has a Poisson distribution with mean Ax. 
We assume that the wire is longer than the value of x. Now, 


PX x) 


p 


0! 


Definition 


EXAMPLE 4-21 
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Therefore, 


is the cumulative distribution function of X. By differentiating F(x), the probability density 
function of X is calculated to be 


isie", 20 


The derivation of the distribution of X depends only on the assumption that the flaws in 
the wire follow a Poisson process. Also, the starting point for measuring X doesn’t matter 
because the probability of the number of flaws in an interval of a Poisson process depends 
only on the length of the interval, not on the location. For any Poisson process, the following 
general result applies. 


The random variable X that equals the distance between successive counts of a 
Poisson process with mean A > 0 is an exponential random variable with parame- 


ter A. The probability density function of X is 


Nem ea (cesse ces (4-14) 





The exponential distribution obtains its name from the exponential function in the proba- 
bility density function. Plots of the exponential distribution for selected values of A are shown 
in Fig. 4-22. For any value of A, the exponential distribution is quite skewed. The following 
results are easily obtained and are left as an exercise. 


If the random variable X has an exponential distribution with parameter A, 


and o^ = 





Itis important to use consistent units in the calculation of probabilities, means, and variances 
involving exponential random variables. The following example illustrates unit conversions. 


In a large corporate computer network, user log-ons to the system can be modeled as a Pois- 
son process with a mean of 25 log-ons per hour. What is the probability that there are no log- 
ons in an interval of 6 minutes? 

Let X denote the time in hours from the start of the interval until the first log-on. Then, X 
has an exponential distribution with A = 25 log-ons per hour. We are interested in the proba- 
bility that X exceeds 6 minutes. Because A is given in log-ons per hour, we express all time 
units in hours. That is, 6 minutes = 0.1 hour. The probability requested is shown as the shaded 
area under the probability density function in Fig. 4-23. Therefore, 


oO 


P(X > 0.1) = | 25e Jp ex g 9990 = 0085 
0 


a! 
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f(x) 
f(x) 
O 2 4 6 8 10 12 
x 0.1 x 
Figure 4-22 Probability density function of expo- Figure 4-23 Probability for the expo- 
nential random variables for selected values of A. nential distribution in Example 4-21. 


Also, the cumulative distribution function can be used to obtain the same result as follows: 
P(X > 0.1) = 1 — F(0.1) = e 230.0 


An identical answer is obtained by expressing the mean number of log-ons as 0.417 log- 
ons per minute and computing the probability that the time until the next log-on exceeds 6 
minutes. Try it. 

What is the probability that the time until the next log-on is between 2 and 3 minutes? 
Upon converting all units to hours, 


0.05 
0.05 
P(0.033 < X < 0.05) = | 25e ^" dx = —g ^* = 0.152 
0.033 
0.033 


An alternative solution is 
P(0.033 < X < 0.05) = F(0.05) — F(0.033) = 0.152 


Determine the interval of time such that the probability that no log-on occurs in the inter- 
val is 0.90. The question asks for the length of time x such that PLY > x) = 0.90. Now, 


P(X2x)-—&9*-090 
Take the (natural) log of both sides to obtain —25x = In(0.90) = —0.1054. Therefore, 


x — 0.00421 hour — 0.25 minute 


EXAMPLE 4-22 
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Furthermore, the mean time until the next log-on is 
u = 1/25 = 0.04 hour = 2.4 minutes 
The standard deviation of the time until the next log-on is 
o = 1/25 hours = 2.4 minutes 


In the previous example, the probability that there are no log-ons in a 6-minute interval is 
0.082 regardless of the starting time of the interval. A Poisson process assumes that events oc- 
cur uniformly throughout the interval of observation; that 1s, there is no clustering of events. 
If the log-ons are well modeled by a Poisson process, the probability that the first log-on after 
noon occurs after 12:06 P.M. is the same as the probability that the first log-on after 3:00 P.M. 
occurs after 3:06 P.M. And if someone logs on at 2:22 P.M., the probability the next log-on 
occurs after 2:28 P.M. is still 0.082. 

Our starting point for observing the system does not matter. However, if there are 
high-use periods during the day, such as right after 8:00 A.M., followed by a period of low 
use, a Poisson process is not an appropriate model for log-ons and the distribution is not 
appropriate for computing probabilities. It might be reasonable to model each of the high- 
and low-use periods by a separate Poisson process, employing a larger value for A during 
the high-use periods and a smaller value otherwise. Then, an exponential distribution with 
the corresponding value of A can be used to calculate log-on probabilities for the high- and 
low-use periods. 


Lack of Memory Property 
An even more interesting property of an exponential random variable is concerned with con- 
ditional probabilities. 


Let X denote the time between detections of a particle with a geiger counter and assume that 
X has an exponential distribution with A = 1.4 minutes. The probability that we detect a par- 
ticle within 30 seconds of starting the counter is 


P(X < 0.5 minute) = F(0.5) = 1 — e 91^ = 0.30 


In this calculation, all units are converted to minutes. Now, suppose we turn on the geiger 
counter and wait 3 minutes without detecting a particle. What 1s the probability that a particle 
is detected in the next 30 seconds? 

Because we have already been waiting for 3 minutes, we feel that we are “due.” That 
is, the probability of a detection in the next 30 seconds should be greater than 0.3. However, 
for an exponential distribution, this is not true. The requested probability can be expressed 
as the conditional probability that P(X < 3.5| X > 3). From the definition of conditional 
probability, 


PX<35|X >3)= PC «X < 3.5/P(X > 3) 
where 


P(3 < X < 3.5) = F(35) — F(3) = [1 e ^"^ — [1 — e ?/^] = 0.0035 
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Lack of 
Memory 
Property 


Figure 4-24 Lack of 
memory property of 
an exponential 
distribution. 


and 

P(X > 3) =1- FG) =e% = 0117 
Therefore, 

P(X 35 (xX > 3) = 0035/0117 = 030 


After waiting for 3 minutes without a detection, the probability of a detection in the next 30 
seconds is the same as the probability of a detection in the 30 seconds immediately after start- 
ing the counter. The fact that you have waited 3 minutes without a detection does not change 
the probability of a detection in the next 30 seconds. 


Example 4-22 illustrates the lack of memory property of an exponential random vari- 
able and a general statement of the property follows. In fact, the exponential distribution is the 
only continuous distribution with this property. 


For an exponential random variable X, 


POA i cc DM 





Figure 4-24 graphically illustrates the lack of memory property. The area of region A divided 
by the total area under the probability density function (4 + B + C + D = 1) equals 
P(X < ty). The area of region C divided by the area C + D equals P(X < t, + h| X > fj). The 
lack of memory property implies that the proportion of the total area that is in A equals the 
proportion of the area in C and D that is in C. The mathematical verification of the lack of 
memory property is left as a mind-expanding exercise. 

The lack of memory property is not that surprising when you consider the development 
of a Poisson process. In that development, we assumed that an interval could be partitioned 
into small intervals that were independent. These subintervals are similar to independent 
Bernoulli trials that comprise a binomial process; knowledge of previous results does not af- 
fect the probabilities of events in future subintervals. An exponential random variable is the 
continuous analog of a geometric random variable, and they share a similar lack of memory 
property. 

The exponential distribution 1s often used in reliability studies as the model for the 
time until failure of a device. For example, the lifetime of a semiconductor chip might be 
modeled as an exponential random variable with a mean of 40,000 hours. The lack of 


f(x) 
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memory property of the exponential distribution implies that the device does not wear out. 
That is, regardless of how long the device has been operating, the probability of a failure 
in the next 1000 hours is the same as the probability of a failure in the first 1000 hours of 
operation. The lifetime L of a device with failures caused by random shocks might be ap- 
propriately modeled as an exponential random variable. However, the lifetime L of a 
device that suffers slow mechanical wear, such as bearing wear, 1s better modeled by a dis- 
tribution such that P(L < t + At| L > f) increases with t. Distributions such as the Weibull 
distribution are often used, in practice, to model the failure time of this type of device. The 
Weibull distribution is presented in a later section. 


EXERCISES FOR SECTION 4-9 


4-72. Suppose X has an exponential distribution with X = 2. 
Determine the following: 

(a P(X-0) (b) PX = 2) 

(c) P(Xx 1) (d) P(1« X « 2) 

(e) Find the value of x such that PLY < x) = 0.05. 


4-73. Suppose X has an exponential distribution with mean 
equal to 10. Determine the following: 

(a) P(X > 10) 

(b) P(X > 20) 

(c) P(X > 30) 

(d) Find the value of x such that P(X < x) = 0.95. 


4-74. Suppose the counts recorded by a geiger counter follow 

a Poisson process with an average of two counts per minute. 

(a) What is the probability that there are no counts in a 30- 
second interval? 

(b) What is the probability that the first count occurs in less 
than 10 seconds? 

(c) What is the probability that the first count occurs between 
1 and 2 minutes after start-up? 


4-75. Suppose that the log-ons to a computer network fol- 

low a Poisson process with an average of 3 counts per minute. 

(a) What is the mean time between counts? 

(b) What is the standard deviation of the time between counts? 

(c) Determine x such that the probability that at least one 
count occurs before time x minutes is 0.95. 


4-76. The time to failure (in hours) for a laser in a cytome- 

try machine is modeled by an exponential distribution with 

A = 0.00004. 

(a) What is the probability that the laser will last at least 
20,000 hours? 

(b) What is the probability that the laser will last at most 
30,000 hours? 

(c) What is the probability that the laser will last between 
20,000 and 30,000 hours? 


4-77. The time between calls to a plumbing supply business 

is exponentially distributed with a mean time between calls of 

15 minutes. 

(a) What is the probability that there are no calls within a 30- 
minute interval? 


(b) What is the probability that at least one call arrives within 
a 10-minute interval? 

(c) What is the probability that the first call arrives within 5 
and 10 minutes after opening? 

(d) Determine the length of an interval of time such that the 
probability of at least one call in the interval is 0.90. 


4-78. The life of automobile voltage regulators has an expo- 

nential distribution with a mean life of six years. You purchase 

an automobile that is six years old, with a working voltage 

regulator, and plan to own it for six years. 

(a) What is the probability that the voltage regulator fails dur- 
ing your ownership? 

(b) If your regulator fails after you own the automobile three 
years and it is replaced, what is the mean time until the 
next failure? 


4-79. The time to failure (in hours) of fans in a personal com- 
puter can be modeled by an exponential distribution with 
A = 0.0003. 

(a) What proportion of the fans will last at least 10,000 hours? 
(b) What proportion of the fans will last at most 7000 hours? 


4-80. The time between the arrival of electronic messages at 
your computer is exponentially distributed with a mean of two 
hours. 

(a) What is the probability that you do not receive a message 
during a two-hour period? 

(b) If you have not had a message in the last four hours, what 
is the probability that you do not receive a message in the 
next two hours? 

(c) What is the expected time between your fifth and sixth 
messages? 


4-81. The time between arrivals of taxis at a busy intersec- 

tion is exponentially distributed with a mean of 10 minutes. 

(a) What is the probability that you wait longer than one hour 
for a taxi? 

(b) Suppose you have already been waiting for one hour for a 
taxi, what is the probability that one arrives within the 
next 10 minutes? 

4-82. Continuation of Exercise 4-81. 

(a) Determine x such that the probability that you wait more 
than x minutes is 0.10. 
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(b) Determine x such that the probability that you wait less 
than x minutes is 0.90. 

(c) Determine x such that the probability that you wait less 
than x minutes is 0.50. 


4-85. The distance between major cracks in a highway fol- 

lows an exponential distribution with a mean of 5 miles. 

(a) What is the probability that there are no major cracks in a 
10-mile stretch of the highway? 

(b) What 1s the probability that there are two major cracks in 
a 10-mile stretch of the highway? 

(c) What is the standard deviation of the distance between 
major cracks? 


4-84. Continuation of Exercise 4-83. 

(a) What is the probability that the first major crack occurs 
between 12 and 15 miles of the start of inspection? 

(b) What is the probability that there are no major cracks in 
two separate 5-mile stretches of the highway? 

(c) Given that there are no cracks in the first 5 miles in- 
spected, what is the probability that there are no major 
cracks in the next 10 miles inspected? 


4-85. The lifetime of a mechanical assembly in a vibration 

test is exponentially distributed with a mean of 400 hours. 

(a) What is the probability that an assembly on test fails in 
less than 100 hours? 

(b) What is the probability that an assembly operates for more 
than 500 hours before failure? 

(c) If an assembly has been on test for 400 hours without a fail- 
ure, what is the probability of a failure in the next 100 hours? 


4-86. Continuation of Exercise 4-85. 

(a) If 10 assemblies are tested, what is the probability that at 
least one fails in less than 100 hours? Assume that the as- 
semblies fail independently. 

(b) If 10 assemblies are tested, what is the probability that all 
have failed by 800 hours? Assume the assemblies fail 
independently. 


4-87. When a bus service reduces fares, a particular trip 
from New York City to Albany, New York, is very popular. 
A small bus can carry four passengers. The time between calls 
for tickets is exponentially distributed with a mean of 30 min- 
utes. Assume that each call orders one ticket. What is the prob- 
ability that the bus is filled in less than 3 hours from the time 
of the fare reduction? 


4-88. The time between arrivals of small aircraft at a county 
airport is exponentially distributed with a mean of one hour. 
What is the probability that more than three aircraft arrive 
within an hour? 


4-89. Continuation of Exercise 4-88. 

(a) If 30 separate one-hour intervals are chosen, what is the 
probability that no interval contains more than three arrivals? 

(b) Determine the length of an interval of time (in hours) such 
that the probability that no arrivals occur during the inter- 
val is 0.10. 


4-90. The time between calls to a corporate office is expo- 

nentially distributed with a mean of 10 minutes. 

(a) What is the probability that there are more than three calls 
in one-half hour? 

(b) What is the probability that there are no calls within one- 
half hour? 

(c) Determine x such that the probability that there are no 
calls within x hours 1s 0.01. 


4-91. Continuation of Exercise 4-90. 

(a) What is the probability that there are no calls within a two- 
hour interval? 

(b) If four nonoverlapping one-half hour intervals are se- 
lected, what is the probability that none of these intervals 
contains any call? 

(c) Explain the relationship between the results in part (a) 
and (b). 

4-92. Ifthe random variable X has an exponential distribu- 

tion with mean 0, determine the following: 

(a P(X29) (b) P(X- 20) 

(c) P(X 30) 

(d) How do the results depend on 0? 

4-95. Assume that the flaws along a magnetic tape follow a 

Poisson distribution with a mean of 0.2 flaw per meter. Let X 

denote the distance between two successive flaws. 

(a) What 1s the mean of X? 

(b) What is the probability that there are no flaws in 10 con- 
secutive meters of tape? 

(c) Does your answer to part (b) change if the 10 meters are 
not consecutive? 

(d) How many meters of tape need to be inspected so that the 
probability that at least one flaw is found is 90%? 


4-94. Continuation of Exercise 4-93. (More difficult ques- 

tions.) 

(a) What is the probability that the first time the distance be- 
tween two flaws exceeds 8 meters is at the fifth flaw? 

(b) What is the mean number of flaws before a distance be- 
tween two flaws exceeds 8 meters? 


4-95. Derive the formula for the mean and variance of an 
exponential random variable. 


4-10 ERLANG AND GAMMA DISTRIBUTIONS 


4-10.1 Erlang Distribution 


An exponential random variable describes the length until the first count is obtained in a 
Poisson process. A generalization of the exponential distribution is the length until r counts 


EXAMPLE 4-23 


Definition 
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occur in a Poisson process. The random variable that equals the interval length until r counts 
occur in a Poisson process has an Erlang random variable. 


The failures of the central processor units of large computer systems are often modeled as a 
Poisson process. Typically, failures are not caused by components wearing out, but by more 
random failures of the large number of semiconductor circuits in the units. Assume that the 
units that fail are immediately repaired, and assume that the mean number of failures per hour 
is 0.0001. Let X denote the time until four failures occur in a system. Determine the probabil- 
ity that X exceeds 40,000 hours. 

Let the random variable N denote the number of failures in 40,000 hours of operation. 
The time until four failures occur exceeds 40,000 hours if and only if the number of failures 
in 40,000 hours 1s three or less. Therefore, 


P(X > 40,000) = P(N = 3) 


The assumption that the failures follow a Poisson process implies that N has a Poisson distri- 
bution with 


E(N) = 40,000(0.0001) = 4 failures per 40,000 hours 


Therefore, 





P(X > 40,000) = P(N <3) = S— = 0.433 


The cumulative distribution function of a general Erlang random variable X can be obtained 
from P(X = x) = 1 — P(X > x), and P(X > x) can be determined as in the previous exam- 
ple. Then, the probability density function of X can be obtained by differentiating the cumula- 
tive distribution function and using a great deal of algebraic simplification. The details are left 
as an exercise. In general, we can obtain the following result. 


The random variable X that equals the interval length until r counts occur in a 
Poisson process with mean A > 0 has an Erlang random variable with parameters 
A and r. The probability density function of X is 


na ped 
f(x) = ac for x > Oandr = 1,2,... (4-17) 





Sketches of the Erlang probability density function for several values of r and A are 
shown in Fig. 4-25. Clearly, an Erlang random variable with r = 1 is an exponential 
random variable. Probabilities involving Erlang random variables are often determined by 
computing a summation of Poisson random variables as in Example 4-23. The probability 
density function of an Erlang random variable can be used to determine probabilities; 
however, integrating by parts 1s often necessary. As was the case for the exponential 
distribution, one must be careful to define the random variable and the parameter in 
consistent units. 
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EXAMPLE 4-24 


f (x) 





Figure 4-25 Erlang probability density functions 
for selected values of r and A. 


An alternative approach to computing the probability requested in Example 4-24 is to inte- 
grate the probability density function of X. That is, 


CO CO 


r= l Ax 
P(X > 40,000) = | f(x) dx — | A dei 


d 
(rc T t 
40,000 40,000 


where r = 4andA = 0.0001. Integration by parts can be used to verify the result obtained 
previously. 


An Erlang random variable can be thought of as the continuous analog of a negative 
binomial random variable. A negative binomial random variable can be expressed as the sum 
of r geometric random variables. Similarly, an Erlang random variable can be represented as 
the sum of r exponential random variables. Using this conclusion, we can obtain the follow- 
ing plausible result. Sums of random variables are studied in Chapter 5. 


If X is an Erlang random variable with parameters A and r, 


Ww 24800) e c aml a WOO = apr 





4.10.2. Gamma Distribution 


The Erlang distribution is a special case of the gamma distribution. If the parameter r of 
an Erlang random variable is not an integer, but r > 0, the random variable has a gamma 
distribution. However, in the Erlang density function, the parameter r appears as r factorial. 
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Therefore, to define a gamma random variable, we require a generalization of the factorial 
function. 


Definition 
The gamma function is 


oo 


I(r) = [xe goto 


0 





It can be shown that the integral in the definition of I(r) is finite. Furthermore, by using inte- 
gration by parts it can be shown that 


I(r) = (r — Dr — 1) 
This result is left as an exercise. Therefore, if r is a positive integer (as in the Erlang distribution), 
I(r) ^ (r — 1)! 
Also, l'(1) = 0! = 1 and it can be shown that I' (1/2) = m. The gamma function can be in- 
terpreted as a generalization to noninteger values of r of the term (r — 1)! that is used in the 


Erlang probability density function. 
Now the gamma probability density function can be stated. 


Definition 


The random variable X with probability density function 


ryr =] =A 


m aa >0 4-20 
T(r Or X ( ) 
has a gamma random variable with parameters A > 0 andr > 0. If r is an integer, 
X has an Erlang distribution. 





Sketches of the gamma distribution for several values of A and r are shown in Fig. 4-26. It can 
be shown that f(x) satisfies the properties of a probability density function, and the following 
result can be obtained. Repeated integration by parts can be used, but the details are lengthy. 


If X is a gamma random variable with parameters A and r, 


we EA em m = wx 





Although the gamma distribution is not frequently used as a model for a physical system, 
the special case of the Erlang distribution is very useful for modeling random experiments. The 
exercises provide illustrations. Furthermore, the chi-squared distribution is a special case of 
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Figure 4-26 Gamma probability density functions 


for selected values of A and r. 


the gamma distribution in which A = 1/2 and r equals one of the values 1/2, 1, 3/2, 2,.... This 
distribution is used extensively in interval estimation and tests of hypotheses that are discussed 


in subsequent chapters. 


EXERCISES FOR SECTION 4-10 


4-96. Calls to a telephone system follow a Poisson distribu- 

tion with a mean of five calls per minute. 

(a) What is the name applied to the distribution and parame- 
ter values of the time until the tenth call? 

(b) What is the mean time until the tenth call? 

(c) What is the mean time between the ninth and tenth calls? 


4-97. Continuation of Exercise 4-96. 

(a) What is the probability that exactly four calls occur within 
one minute? 

(b) If 10 separate one-minute intervals are chosen, what is the 
probability that all intervals contain more than two calls? 


4-98. Raw materials are studied for contamination. Suppose 

that the number of particles of contamination per pound of 

material is a Poisson random variable with a mean of 0.01 par- 

ticle per pound. 

(a) What is the expected number of pounds of material re- 
quired to obtain 15 particles of contamination? 

(b) What is the standard deviation of the pounds of materials 
required to obtain 15 particles of contamination? 

4-99. The time between failures of a laser in a cytogenics ma- 

chine is exponentially distributed with a mean of 25,000 hours. 

(a) What is the expected time until the second failure? 


(b) What is the probability that the time until the third failure 
exceeds 50,000 hours? 


4-100. Ina data communication system, several messages 

that arrive at a node are bundled into a packet before they 

are transmitted over the network. Assume the messages ar- 

rive at the node according to a Poisson process with 7 = 30 

messages per minute. Five messages are used to form a 

packet. 

(a) What is the mean time until a packet is formed, that is, un- 
til five messages arrived at the node? 

(b) What is the standard deviation of the time until a packet is 
formed? 

(c) What is the probability that a packet is formed in less than 
10 seconds? 

(d) What is the probability that a packet is formed in less than 
5 seconds? 


4-101. Errors caused by contamination on optical disks oc- 

cur at the rate of one error every 10° bits. Assume the errors 

follow a Poisson distribution. 

(a) What is the mean number of bits until five errors occur? 

(b) What is the standard deviation of the number of bits until 
five errors occur? 


(c) The error-correcting code might be ineffective if there are 
three or more errors within 10? bits. What is the probabil- 
ity of this event? 


4-102. Calls to the help line of a large computer distributor 

follow a Possion distribution with a mean of 20 calls per minute. 

(a) What is the mean time until the one-hundredth call? 

(b) What is the mean time between call numbers 50 and 80? 

(c) What is the probability that three or more calls occur 
within 15 seconds? 


4-103. The time between arrivals of customers at an auto- 

matic teller machine is an exponential random variable with a 

mean of 5 minutes. 

(a) What is the probability that more than three customers 
arrive in 10 minutes? 

(b) What is the probability that the time until the fifth cus- 
tomer arrives 1s less than 15 minutes? 
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4-104. The time between process problems in a manufac- 

turing line is exponentially distributed with a mean of 30 days. 

(a) What is the expected time until the fourth problem? 

(b) What is the probability that the time until the fourth prob- 
lem exceeds 120 days? 

4-105. Use the properties of the gamma function to evaluate 

the following: 

(a) T(6) 

(c) 1(9/2) 

4-106. Use integration by parts to show that l'(r) = (r — 1) 

tr Ty 

4-107. Show that the gamma density function f(x, A,r) in- 

tegrates to 1. 


(b) 1(5/2) 


4-108. Use the result for the gamma distribution to determine 
the mean and variance of a chi-square distribution with r — 7/2. 


As mentioned previously, the Weibull distribution is often used to model the time until failure 
of many different physical systems. The parameters in the distribution provide a great deal of 
flexibility to model systems in which the number of failures increases with time (bearing 
wear), decreases with time (some semiconductors), or remains constant with time (failures 
caused by external shocks to the system). 


Definition 


The random variable X with probability density function 


| 


g-1 AN 
) ap|-(ž) | torx — 0 


(4-22) 


is a Weibull random variable with scale parameter ò > 0 and shape parameter B > 0. 





The flexibility of the Weibull distribution is illustrated by the graphs of selected probability 
density functions in Fig. 4-27. By inspecting the probability density function, it is seen that 
when B = 1, the Weibull distribution is identical to the exponential distribution. 

The cumulative distribution function is often used to compute probabilities. The follow- 


ing result can be obtained. 


If X has a Weibull distribution with parameters ò and B, then the cumulative distri- 


bution function of X is 





(4-23) 
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Figure 4-27 Weibull probability density functions 
for selected values of ò and p. 


Also, the following result can be obtained. 


If X has a Weibull distribution with parameters 6 and p, 


qve 809) = Ih (i + *) "adl qr = esp = snp (i 4 z) 7 sr t 3! 


(4-24) 


The time to failure (in hours) of a bearing in a mechanical shaft is satisfactorily modeled as a 
Weibull random variable with B = 1/2, and à = 5000 hours. Determine the mean time until 
failure. 

From the expression for the mean, 


E(X) = 5000T[1 + (1/0.5)] = 5000T[3] = 5000 x 2! = 10,000 hours 


Determine the probability that a bearing lasts at least 6000 hours. Now 


6000 y —1.095 
P(x > 6000) = 1 — F(6000) = exp-|| zc) | = e 1 = 0334 


Consequently, only 33.4% of all bearings last at least 6000 hours. 


EXERCISES FOR SECTION 4-11 


4-109. Suppose that X has a Weibull distribution with 
D = 0.2 and ò = 100 hours. Determine the mean and vari- 
ance of X. 


4-110. Suppose that X has a Weibull distribution B = 0.2 

and 6 = 100 hours. Determine the following: 

(a) P(X < 10,000) (b) P(X > 5000) 

4-111. Assume that the life of a roller bearing follows a 

Weibull distribution with parameters B = 2 and 6 = 10,000 

hours. 

(a) Determine the probability that a bearing lasts at least 8000 
hours. 

(b) Determine the mean time until failure of a bearing. 

(c) If 10 bearings are in use and failures occur independently, 
what is the probability that all 10 bearings last at least 
8000 hours? 


4-112. The life (in hours) of a computer processing unit 

(CPU) is modeled by a Weibull distribution with parameters 

B = 3 andò = 900 hours. 

(a) Determine the mean life of the CPU. 

(b) Determine the variance of the life of the CPU. 

(c) What is the probability that the CPU fails before 500 
hours? 
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4-113. Assume the life of a packaged magnetic disk exposed 

to corrosive gases has a Weibull distribution with B = 0.5 and 

the mean life is 600 hours. 

(a) Determine the probability that a packaged disk lasts at 
least 500 hours. 

(b) Determine the probability that a packaged disk fails be- 
fore 400 hours. 


4-114. The life of a recirculating pump follows a Weibull 

distribution with parameters B = 2, and ò = 700 hours. 

(a) Determine the mean life of a pump. 

(b) Determine the variance of the life of a pump. 

(c) What is the probability that a pump will last longer than its 
mean? 


4-115. The life (in hours) of a magnetic resonance imagin- 
ing machine (MRI) is modeled by a Weibull distribution with 
parameters B = 2 and ò = 500 hours. 

(a) Determine the mean life of the MRI. 

(b) Determine the variance of the life of the MRI. 

(c) What is the probability that the MRI fails before 250 hours? 


4-116. If X 1s a Weibull random variable with B = 1, and 
ò = 1000, what is another name for the distribution of X and 
what is the mean of X? 


Variables in a system sometimes follow an exponential relationship as x = exp(w). If the 
exponent is a random variable, say W, X = exp(W) is a random variable and the distribu- 
tion of X is of interest. An important special case occurs when W has a normal distribution. 
In that case, the distribution of X 1s called a lognormal distribution. The name follows 
from the transformation In (X) = W. That is, the natural logarithm of X is normally dis- 


tributed. 


Probabilities for X are obtained from the transformation to W, but we need to recognize 
that the range of X is (0, ©). Suppose that Wis normally distributed with mean 0 and variance 
Q^; then the cumulative distribution function for X is 


F(x) = P[X = x] 


-Pz- 


= Plexp(W) = x] = P|W x In (x)| 
0 


ins) = 3 E a | 


for x > 0, where Z is a standard normal random variable. Therefore, Appendix Table II can be 
used to determine the probability. Also, F(x) = 0, for x = 0. 

The probability density function of X can be obtained from the derivative of F(x). 
This derivative is applied to the last term in the expression for F(x), the integral of the stan- 
dard normal density function. Furthermore, from the probability density function, the 
mean and variance of X can be derived. The details are omitted, but a summary of results 


follows. 
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EXAMPLE 4-26 
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Figure 4-28 Lognormal probability density functions with 
0 = 0 for selected values of w. 


Let W have a normal distribution mean 0 and variance o^; then X = exp(W) is a log- 
normal random variable with probability density function 


=] 
U) a 


l 
Me) = xo V2 exp - Ds 


The mean and variance of X are 





The parameters of a lognormal distribution are 0 and «^, but care is needed to interpret that 
these are the mean and variance of the normal random variable W. The mean and variance of 
X are the functions of these parameters shown in (4-25). Figure 4-28 illustrates lognormal dis- 
tributions for selected values of the parameters. 

The lifetime of a product that degrades over time 1s often modeled by a lognormal ran- 
dom variable. For example, this is a common distribution for the lifetime of a semiconductor 
laser. A Weibull distribution can also be used in this type of application, and with an appro- 
priate choice for parameters, it can approximate a selected lognormal distribution. However, 
a lognormal distribution is derived from a simple exponential function of a normal random 
variable, so it is easy to understand and easy to evaluate probabilities. 


The lifetime of a semiconductor laser has a lognormal distribution with 0 — 10 hours and 
w = 1.5 hours. What is the probability the lifetime exceeds 10,000 hours? 
From the cumulative distribution function for X 


P(X > 10,000) = 1 — P[exp(W) = 10,000] = 1 — P[W = 1n(10,000)] 
=. — 10 
= D —————————————— 


= Ja 1 — $(-0.52) = 1 — 0.30 = 0.70 


4-12 LOGNORMAL DISTRIBUTION 137 


What lifetime is exceeded by 9995 of lasers? The question is to determine x such that 


P(X > x) = 0.99. Therefore, 


P(X > x) = Plexp(W) > x] 


= PW -1n(x)] = 1 — (EL — 0.99 


1.5 


From Appendix Table II, 1 — ®(z) = 0.99 when z = — 2.33. Therefore, 


In(x) — 10 
1.5 


== 2.55 


and x = exp(6.505) = 668.48 hours. 


Determine the mean and standard deviation of lifetime. Now, 


E(X) = e9* 7 = exp(10 + 1.125) = 67,846.3 


V(X) = e * v (go — 


1) = exp(20 + 225)[exp(2.25) — 1] = 39,070,059,886.6 


so the standard deviation of X is 197,661.5 hours. Notice that the standard deviation of life- 


time is large relative to the mean. 


EXERCISES FOR SECTION 4-12 


4-117. Suppose that X has a lognormal distribution with 
parameters 0 = 5 and o? = 9. Determine the following: 

(a) P(X « 13,300) 

(b) The value for x such that P(X = x) = 0.95 

(c) The mean and variance of X 


4-118. Suppose that X has a lognormal distribution with 
parameters 0 = —2 and o = 9. Determine the following: 
(a) P(500 < X < 1000) 

(b) The value for x such that PLY < x) = 0.1 

(c) The mean and variance of X 


4-119. Suppose that X has a lognormal distribution with pa- 

rameters 0 — 2 and o — 4. Determine the following: 

(a) P(X « 500) 

(b) The conditional probability that X < 1500 given that 
X > 1000 

(c) What does the difference between the probabilities in 
parts (a) and (b) imply about lifetimes of lognormal ran- 
dom variables? 


4-120. The length of time (in seconds) that a user views a 

page on a Web site before moving to another page is a lognor- 

mal random variable with parameters 0 = 0.5 and o? = 1. 

(a) What is the probability that a page is viewed for more than 
10 seconds? 

(b) What is the length of time that 50% of users view the page? 

(c) What is the mean and standard deviation of the time until 
a user moves from the page? 


4-121. Suppose that X has a lognormal distribution and that 
the mean and variance of X are 100 and 85,000, respectively. 


Determine the parameters 0 and o^ of the lognormal distribu- 

tion. (Hint: define x = exp(0) and y = exp(w^) and write two 

equations in terms of x and y.) 

4-122. The lifetime of a semiconductor laser has a log- 

normal distribution, and it is known that the mean and stan- 

dard deviation of lifetime are 10,000 and 20,000, respec- 

tively. 

(a) Calculate the parameters of the lognormal distribution 

(b) Determine the probability that a lifetime exceeds 10,000 
hours 

(c) Determine the lifetime that is exceeded by 90% of lasers 


4-123. Derive the probability density function of a lognor- 
mal random variable from the derivative of the cumulative 
distribution function. 


Supplemental Exercises 


4-124. Suppose that f(x) = 0.5x —1 for 2 «x « 4. 
Determine the following: 

(a) P(X « 2.5) 

(b) P(X > 3) 

(c) P(2.5 < X < 3.5) 

4-125. Continuation of Exercise 4-124. Determine the 
cumulative distribution function of the random variable. 
4-126. Continuation of Exercise 4-124. Determine the 
mean and variance of the random variable. 
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4-127. The time between calls is exponentially distributed 

with a mean time between calls of 10 minutes. 

(a) What is the probability that the time until the first call 1s 
less than 5 minutes? 

(b) What is the probability that the time until the first call is 
between 5 and 15 minutes? 

(c) Determine the length of an interval of time such that the 
probability of at least one call in the interval 1s 0.90. 

4-128. Continuation of Exercise 4-127. 

(a) If there has not been a call in 10 minutes, what is the proba- 
bility that the time until the next call is less than 5 minutes? 

(b) What is the probability that there are no calls in the inter- 
vals from 10:00 to 10:05, from 11:30 to 11:35, and from 
2:00 to 2:05? 

4-129. Continuation of Exercise 4-127. 

(a) What is the probability that the time until the third call is 
greater than 30 minutes? 

(b) What is the mean time until the fifth call? 


4-130. The CPU of a personal computer has a lifetime that 
is exponentially distributed with a mean lifetime of six years. 
You have owned this CPU for three years. What 1s the proba- 
bility that the CPU fails in the next three years? 

4-131. Continuation of Exercise 4-130. Assume that your 
corporation has owned 10 CPUS for three years, and assume 
that the CPUs fail independently. What is the probability that 
at least one fails within the next three years? 


4-132. Suppose that X has a lognormal distribution with 
parameters 0 — 0 and o? — 4. Determine the following: 

(a) P(10 « X « 50) 

(b) The value for x such that P(X < x) = 0.05 

(c) The mean and variance of X 


4-133. Suppose that X has a lognormal distribution and that 
the mean and variance of X are 50 and 4000, respectively. 
Determine the following: 

(a) The parameters 0 and o of the lognormal distribution 
(b) The probability that X is less than 150 


4-134. Asbestos fibers in a dust sample are identified by an 
electron microscope after sample preparation. Suppose that 
the number of fibers is a Poisson random variable and the 
mean number of fibers per squared centimeter of surface dust 
is 100. A sample of 800 square centimeters of dust 1s analyzed. 
Assume a particular grid cell under the microscope represents 
1/160,000 of the sample. 
(a) What is the probability that at least one fiber is visible in 
the grid cell? 
(b) What is the mean of the number of grid cells that need to 
be viewed to observe 10 that contain fibers? 
(c) What is the standard deviation of the number of grid cells 
that need to be viewed to observe 10 that contain fibers? 
4-135. Without an automated irrigation system, the height 
of plants two weeks after germination is normally distributed 
with a mean of 2.5 centimeters and a standard deviation of 0.5 
centimeters. 


(a) What is the probability that a plant's height is greater than 
2.25 centimeters? 

(b) What is the probability that a plant's height is between 2.0 
and 3.0 centimeters? 

(c) What height is exceeded by 90% of the plants? 


4-136. Continuation of Exercise 4-135. With an automated 

irrigation system, a plant grows to a height of 3.5 centimeters 

two weeks after germination. 

(a) What is the probability of obtaining a plant of this height or 
greater from the distribution of heights in Exercise 4-135. 

(b) Do you think the automated irrigation system increases 
the plant height at two weeks after germination? 


4-137. The thickness of a laminated covering for a wood 
surface is normally distributed with a mean of 5 millimeters 
and a standard deviation of 0.2 millimeter. 

(a) What is the probability that a covering thickness is greater 
than 5.5 millimeters? 

(b) If the specifications require the thickness to be between 
4.5 and 5.5 millimeters, what proportion of coverings do 
not meet specifications? 

(c) The covering thickness of 95% of samples is below what 
value? 


4-138. The diameter of the dot produced by a printer is nor- 
mally distributed with a mean diameter of 0.002 inch. 
Suppose that the specifications require the dot diameter to be 
between 0.0014 and 0.0026 inch. If the probability that a dot 
meets specifications is to be 0.9973, what standard deviation 
is needed? 


4-139. Continuation of Exercise 4-138. Assume that the stan- 
dard deviation of the size of a dot is 0.0004 inch. If the proba- 
bility that a dot meets specifications is to be 0.9973, what spec- 
ifications are needed? Assume that the specifications are to be 
chosen symmetrically around the mean of 0.002. 


4-140. The life of a semiconductor laser at a constant power 

is normally distributed with a mean of 7000 hours and a stan- 

dard deviation of 600 hours. 

(a) What is the probability that a laser fails before 5,800 
hours? 

(b) What is the life in hours that 90% of the lasers exceed? 


4-141. Continuation of Exercise 4-140. What should the 
mean life equal in order for 99% of the lasers to exceed 10,000 
hours before failure? 


4-142. Continuation of Exercise 4-140. A product contains 
three lasers, and the product fails if any of the lasers fails. 
Assume the lasers fail independently. What should the mean 
life equal in order for 99% of the products to exceed 10,000 
hours before failure? 

4-143. Continuation of Exercise 140. Rework parts (a) and 
(b). Assume that the lifetime is an exponential random vari- 
able with the same mean. 

4-144. Continuation of Exercise 4-140. Rework parts (a) 
and (b). Assume that the lifetime is a lognormal random vari- 
able with the same mean and standard deviation. 


4-145. A square inch of carpeting contains 50 carpet fibers. 

The probability of a damaged fiber is 0.0001. Assume the 

damaged fibers occur independently. 

(a) Approximate the probability of one or more damaged 
fibers in 1 square yard of carpeting. 

(b) Approximate the probability of four or more damaged 
fibers in 1 square yard of carpeting. 


4-146. An airline makes 200 reservations for a flight that 
holds 185 passengers. The probability that a passenger arrives 
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for the flight 1s 0.9 and the passengers are assumed to be inde- 

pendent. 

(a) Approximate the probability that all the passengers that 
arrive can be seated. 

(b) Approximate the probability that there are empty seats. 

(c) Approximate the number of reservations that the airline 
should make so that the probability that everyone who ar- 
rives can be seated is 0.95. | Hint: Successively try values 
for the number of reservations. | 


MIND-EXPANDING EXERCISES 


4-147. The steps in this exercise lead to the probabil- 

ity density function of an Erlang random variable X with 

parameters A and r, f(x) =A” le ^*/(r — 1)!,x 0, 

punc er 

(a) Use the Poisson distribution to express P(X > x). 

(b) Use the result from part (a) to determine the cumu- 
lative distribution function of X. 

(c) Differentiate the cumulative distribution function in 
part (b) and simplify to obtain the probability den- 
sity function of X. 

4-148. A bearing assembly contains 10 bearings. The 

bearing diameters are assumed to be independent and 

normally distributed with a mean of 1.5 millimeters and 

a standard deviation of 0.025 millimeter. What is the 

probability that the maximum diameter bearing in the 

assembly exceeds 1.6 millimeters? 


4-149. Let the random variable X denote a measure- 

ment from a manufactured product. Suppose the target 

value for the measurement is m. For example, X could 

denote a dimensional length, and the target might be 10 

millimeters. The quality loss of the process producing 

the product is defined to be the expected value of 
$k(X — m)*, where k is a constant that relates a devia- 
tion from target to a loss measured in dollars. 

(a) Suppose X is a continuous random variable with 
E(X) — m and V(X) — o?. What is the quality loss 
of the process? 

(b) Suppose X is a continuous random variable with 
E(X) = p and V(X) = o°. What is the quality loss 
of the process? 


4-150. The lifetime of an electronic amplifier is mod- 
eled as an exponential random variable. If 10% of the 


amplifiers have a mean of 20,000 hours and the remain- 
ing amplifiers have a mean of 50,000 hours, what pro- 
portion of the amplifiers fail before 60,000 hours? 


4-151. Lack of Memory Property. Show that for 
an exponential random variable X, P(X « t, + t, | 
X >t) = P(X < t) 

4-152. A process is said to be of six-sigma quality if 

the process mean is at least six standard deviations from 

the nearest specification. Assume a normally distributed 
measurement. 

(a) If a process mean is centered between the upper and 
lower specifications at a distance of six standard de- 
viations from each, what is the probability that a 
product does not meet specifications? Using the 
result that 0.000001 equals one part per million, 
express the answer in parts per million. 

(b) Because it is difficult to maintain a process mean 
centered between the specifications, the probability 
of a product not meeting specifications is often cal- 
culated after assuming the process shifts. If the 
process mean positioned as in part (a) shifts upward 
by 1.5 standard deviations, what is the probability 
that a product does not meet specifications? Express 
the answer in parts per million. 

(c) Rework part (a). Assume that the process mean is 
at a distance of three standard deviations. 

(d) Rework part (b). Assume that the process mean is at 
a distance of three standard deviations and then 
shifts upward by 1.5 standard deviations. 

(e) Compare the results in parts (b) and (d) and comment. 
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IMPORTANT TERMS AND CONCEPTS 


In the E-book, click on any 
term or concept below to 
go to that subject. 

Chi-squared 
distribution 

Continuous uniform 
distribution 

Cumulative probability 
distribution function- 
continuous random 
variable 

Erlang distribution 

Exponential distribution 


Gamma distribution 

Lack of memory 
property-continuous 
random variable 

Lognormal 
distribution 

Mean-continuous 
random variable 

Mean-function of a 
continuous random 
variable 


Normal approximation to 
binomial and Poisson 
probabilities 

Normal distribution 

Probability density 
function 

Probability distribution- 
continuous random 
variable 

Standard deviation- 
continuous random 
variable 


Standard normal 
distribution 

Standardizing 

Variance-continuous 


random variable 
Weibull distribution 


CD MATERIAL 
Continuity correction 
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Mean and Variance of the Normal Distribution (CD Only) 
In the derivations below, the mean and variance of a normal random variable are shown 
to be u and o”, respectively. The mean of x is 
eg ("ne 


E(X) = [ae 


— 00 


By making the change of variable y = (x — w)/o, the integral becomes 


e "^? -ymQ 
E(X) » | dro | yw 
-yn 





e 
The first integral in the expression above equals 1 because US is a probability density 
T 


function and the second integral is found to be 0 by either formally making the change of vari- 
able u = —y?/2 or noticing the symmetry of the integrand about y = 0. Therefore, E(X) = p. 
The variance of X is 


e (Ae 


V(X) = |e- u) E 


By making the change of variable y = (x — w)/o, the integral becomes 





-yn 
Upon integrating by parts with u = y and dv = y EE dy, V(X) is found to be o. 
V 27 


4-8 CONTINUITY CORRECTIONS TO IMPROVE 
THE APPROXIMATION 


From Fig. 4-19 it can be seen that a probability such as P(3 < X S 7) is better approximated 
by the area under the normal curve from 2.5 to 7.5. This observation provides a method to im- 
prove the approximation of binomial probabilities. Because a continuous normal distribution 
is used to approximate a discrete binomial distribution, the modification is referred to as a 
continuity correction. 


If X is a binomial random variable with parameters n and p, and if x = 0, 1, 2, ..., n, 
the continuity correction to improve approximations obtained from the normal dis- 
tribution is 


coe Ui. NNP 
P(X = x) = 


P(X =x+05)=P| a Mo 
Vo = 99) 


XOS 
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EXAMPLE $4-1 


A way to remember the approximation is to write the probability in terms of S or = and 
then add or subtract the 0.5 correction factor to make the probability greater. 


Consider the situation in Example 4-20 with n = 50 and p = 0.1. The probability P(X = 2) 
is better approximated as 


Zor 9.959 


= = — = 
x P(Z = —1.18) = 0.119 


PX = 2) = Fx 25)» P(z« 


and this result is closer to the exact probability of 0.112 than the previous result of 0.08. 
As another example, P(8 < X) = P(9 < X) and this is better approximated as 


=). = 5 


Po =X) = PBS =x) = P( 215 = Z) = P(1.65 = Z) = 005 


We can even approximate P(X = 5) = P(5 = X < 5) as 


S= 0s- S VO cm 
= Z s — ] = P(-0.24 < Z x 024) = 0.19 


< < a 
PORE p( 20 212 


and this compares well with the exact answer of 0.1849. 


EXERCISES FOR SECTION 4-8 


S4-1. Continuity correction. The normal approximation of 
a binomial probability is sometimes modified by a correction 
factor of 0.5 that improves the approximation. Suppose that X 
is binomial with n = 50 and p = 0.1. Because X is a discrete 
random variable, P(X < 2) = P(X x 2.5). However, the nor- 
mal approximation to P(X < 2) can be improved by applying 
the approximation to P(X = 2.5). 

(a) Approximate P(X = 2) by computing the z-value corre- 
sponding to x — 2.5. 

(b) Approximate P(X < 2) by computing the z-value corre- 
sponding to x = 2. 

(c) Compare the results in parts (a) and (b) to the exact value 
of P(X S 2) to evaluate the effectiveness of the continuity 
correction. 

(d) Use the continuity correction to approximate P(X < 10). 


S4-2. Continuity correction. Suppose that X is binomial 

with n = 50 and p = 0.1. Because X is a discrete random vari- 

able, P(X = 2) = P(X = 1.5). However, the normal approxi- 

mation to P(X = 2) can be improved by applying the approxi- 

mation to P(X = 1.5). The continuity correction of 0.5 is either 

added or subtracted. The easy rule to remember is that the con- 

tinuity correction is always applied to make the approximating 

normal probability greatest. 

(a) Approximate P(X = 2) by computing the z-value corre- 
sponding to 1.5. 

(b) Approximate P(X = 2) by computing the z-value corre- 
sponding to 2. 

(c) Compare the results in parts (a) and (b) to the exact value 
of P(X = 2) to evaluate the effectiveness of the continuity 
correction. 


(d) Use the continuity correction to approximate P(X > 6). 


94-3. Continuity correction. Suppose that X is binomial 

with n — 50 and p — 0.1. Because X is a discrete random vari- 

able, P(2 = Xx 5) = P(1.5 E X x 5.5). However, the normal 

approximation to P(2 < X < 5) can be improved by applying 

the approximation to P(1.5 = X x 5.5). 

(a) Approximate P(2 = X < 5) by computing the z-values 
corresponding to 1.5 and 5.5. 

(b) Approximate P(2 = X < 5) by computing the z-values 
corresponding to 2 and 5. 

S4-4. Continuity correction. Suppose that X is binomial 

with n = 50 and p = 0.1. Then, P(X = 10) = POO S Xx 10). 

Using the results for the continuity corrections, we can ap- 

proximate P(10 < X < 10) by applying the normal standardi- 

zation to P(9.5 = X x 10.5). 

(a) Approximate P(X — 10) by computing the z-values corre- 
sponding to 9.5 and 10.5. 

(b) Approximate P(X — 5). 

S4-5. Continuity correction. The manufacturing of 

semiconductor chips produces 2% defective chips. Assume 

that the chips are independent and that a lot contains 1000 

chips. 

(a) Use the continuity correction to approximate the probabil- 
ity that 20 to 30 chips in the lot are defective. 

(b) Use the continuity correction to approximate the probabil- 
ity that exactly 20 chips are defective. 

(c) Determine the number of defective chips, x, such that the 
normal approximation for the probability of obtaining x 
defective chips is greatest. 
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5-10 CHEBYSHEV'S INEQUALITY 


(CD ONLY) 


After careful study of this chapter you should be able to do the following: 


Ui» W Ne 


probability density function 


. Use joint probability mass functions and joint probability density functions to calculate probabilities 
. Calculate marginal and conditional probability distributions from joint probability distributions 
. Use the multinomial distribution to determine probabilities 

. Interpret and calculate covariances and correlations between random variables 


. Understand properties of a bivariate normal distribution and be able to draw contour plots for the 


6. Calculate means and variance for linear combinations of random variables and calculate proba- 
bilities for linear combinations of normally distributed random variables 
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CD MATERIAL 

7. Determine the distribution of a function of one or more random variables 

8. Calculate moment generating functions and use them to determine moments for random variables 
and use the uniqueness property to determine the distribution of a random variable 


9. Provide bounds on probabilities for arbitrary distributions based on Chebyshev's inequality 


Answers for most odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e- Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for the text sections that appear on CD only. These exercises may be found within the 

e- Text immediately following the section they accompany. 


In Chapters 3 and 4 we studied probability distributions for a single random variable. However, 
it is often useful to have more than one random variable defined in a random experiment. For ex- 
ample, in the classification of transmitted and received signals, each signal can be classified as 
high, medium, or low quality. We might define the random variable X to be the number of high- 
quality signals received and the random variable Y to be the number of low-quality signals 
received. In another example, the continuous random variable X can denote the length of one di- 
mension of an injection-molded part, and the continuous random variable Y might denote the 
length of another dimension. We might be interested in probabilities that can be expressed in 
terms of both X and Y. For example, 1f the specifications for X and Y are (2.95 to 3.05) and (7.60 
to 7.80) millimeters, respectively, we might be interested in the probability that a part satisfies 
both specifications; that is, P(2.95 « X « 3.05 and 7.60 « Y « 7.80). 

In general, if X and Y are two random variables, the probability distribution that defines 
their simultaneous behavior is called a joint probability distribution. In this chapter, we 
investigate some important properties of these joint distributions. 


5-1 TWO DISCRETE RANDOM VARIABLES 


5.1.1 Joint Probability Distributions 


EXAMPLE 5-1 


For simplicity, we begin by considering random experiments in which only two random vari- 
ables are studied. In later sections, we generalize the presentation to the joint probability 
distribution of more than two random variables. 


In the development of a new receiver for the transmission of digital information, each re- 
ceived bit is rated as acceptable, suspect, or unacceptable, depending on the quality of the 
received signal, with probabilities 0.9, 0.08, and 0.02, respectively. Assume that the ratings of 
each bit are independent. 

In the first four bits transmitted, let 


X denote the number of acceptable bits 


Y denote the number of suspect bits 


Then, the distribution of X is binomial with n = 4 and p = 0.9, and the distribution of Y is 
binomial with n = 4 and p = 0.08. However, because only four bits are being rated, the possible 
values of X and Y are restricted to the points shown in the graph in Fig. 5-1. Although the possi- 
ble values of X are 0, 1, 2, 3, or 4, if y = 3, x = 0 or 1. By specifying the probability of each of 
the points in Fig. 5-1, we specify the joint probability distribution of X and Y. Similarly to an in- 
dividual random variable, we define the range of the random variables (X, Y) to be the set of 
points (x, y) in two-dimensional space for which the probability that X = x and Y = y is positive. 


Figure 5-1 Joint 
probability distribution 
of X and Y in Example 
5-1. 


Definition 


EXAMPLE 5-2 
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If X and Y are discrete random variables, the joint probability distribution of X and Y is a 
description of the set of points (x, y) in the range of (X, Y) along with the probability of each point. 
The joint probability distribution of two random variables is sometimes referred to as the bivari- 
ate probability distribution or bivariate distribution of the random variables. One way to 
describe the joint probability distribution of two discrete random variables is through a joint 
probability mass function. Also, PLY = x and Y = y) is usually written as PLY = x, Y = y). 


The joint probability mass function of the discrete random variables X and Y, 
denoted as fyy(x, y), satisfies 


(1) fxy Go y) = 0 


(2) ie) = 


(3) fxyGo y) E IHE c X, E) 





Subscripts are used to indicate the random variables in the bivariate probability distribution. 
Just as the probability mass function of a single random variable X is assumed to be zero at all 
values outside the range of X, so the joint probability mass function of X and Y is assumed to 
be zero at values for which a probability is not specified. 


Probabilities for each point in Fig. 5-1 are determined as follows. For example, PLY = 2, Y = 1) 
is the probability that exactly two acceptable bits and exactly one suspect bit are received among 
the four bits transferred. Let a, s, and u denote acceptable, suspect, and unacceptable bits, respec- 
tively. By the assumption of independence, 


P(aasu) = 0.9(0.9)(0.08)(0.02) = 0.0013 


The number of possible sequences consisting of two a's, one s, and one u is shown in the CD 
material for Chapter 2: 


4! 
2H! 





Therefore, 


P(aasu) = 12(0.0013) = 0.0156 
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and 
Ju Larrea 2.7 = 1) = 00156 


The probabilities for all points in Fig. 5-1 are shown next to the point and the figure describes 
the joint probability distribution of X and Y. 


5.1.2 Marginal Probability Distributions 


EXAMPLE 5-3 


Figure 5-2 Marginal 
probability distribu- 
tions of X and Y from 
Fig. 5-1. 


If more than one random variable is defined in a random experiment, it is important to distin- 
guish between the joint probability distribution of X and Y and the probability distribution of 
each variable individually. The individual probability distribution of a random variable 1s re- 
ferred to as its marginal probability distribution. In Example 5-1, we mentioned that the 
marginal probability distribution of X is binomial with n — 4 and p — 0.9 and the marginal 
probability distribution of Y is binomial with n — 4 and p — 0.08. 

In general, the marginal probability distribution of X can be determined from the joint 
probability distribution of X and other random variables. For example, to determine P(X — x), 
we sum P(X — x, Y — y) over all points in the range of (X, Y) for which X — x. Subscripts on 
the probability mass functions distinguish between the random variables. 


The joint probability distribution of X and Y in Fig. 5-1 can be used to find the marginal prob- 
ability distribution of X. For example, 


P(X = 3) = P(X = 3, Y = 0) + P(X =3,Y=1) 
= 0.0583 + 0.2333 = 0.292 


As expected, this probability matches the result obtained from the binomial probability distribu- 
tion for X; that is, P(X = 3) = (3)0.9°0.1' = 0.292. The marginal probability distribution for X 
is found by summing the probabilities in each column, whereas the marginal probability distribu- 
tion for Y is found by summing the probabilities in each row. The results are shown in Fig. 5-2. 


Although the marginal probability distribution of X in the previous example can be 
determined directly from the description of the experiment, in some problems the marginal 
probability distribution is determined from the joint probability distribution. 


fy(y = 
0.00004 4e 


0.00188 3e 
0.03250 2e 
0.24925 le 


0.71637 Oe e e e e 
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fx) = 0.0001 0.0036 0.0486 0.2916 0.6561 
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Definition 
If X and Y are discrete random variables with joint probability mass function fy, (x, y), 
then the marginal probability mass functions of X and Y are 


EG x) > fa y) sn pes Je es re > fao y) 


(5-2) 


where R, denotes the set of all points in the range of (X, Y) for which X = x and 
R, denotes the set of all points in the range of (X, Y) for which Y = y 





Given a joint probability mass function for random variables X and Y, E(X) and V(X) can 
be obtained directly from the joint probability distribution of X and Y or by first calculating the 
marginal probability distribution of X and then determining E(X) and V(X) by the usual 
method. This is shown in the following equation. 


Mean and 
Variance from If the marginal probability distribution of X has the probability mass function f, (x), 
Joint 
then 


Distribution 


OG) oye D> xfer) m x4 > footss9)) B 2v Dhar y) 


X X 


= 2, xfxyGx. y) (5-3) 


= De = Ly) fyl Js Die z wy) > f. y) 
> 2, [vom Ly) fxy(x, = 2, (s Ly) fxy (x, y) 


where A, denotes the set of all points in the range of (X, Y) for which X = x and R 
denotes the set of all points in the range of (X, Y) 





EXAMPLE 5-4 In Example 5-1, E(X) can be found as 


E(X) = OL fyy(0, 0) + fyy(0, 1) + fxy(0, 2) + fxy(0. 3) + fxr(0, 4)] 
+ I fxy(1, 0)  fxy(1, 1) + fxy(1, 2) + Av, 3)] 
+ 2| fxy(2, 0) + fxy(2. 1)  fxy(2. 2)] 
+ 3[ fxy(3, 0)  fxy(3, 1)] 
T Al fxy(4, 0)] 
= 0[0.0001] + 1[0.0036] + 2[0.0486] + 3[0.02916] + 4[0.6561] = 3.6 


Alternatively, because the marginal probability distribution of X is binomial, 


E(X) = np = 4(0.9) = 3.6 
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The calculation using the joint probability distribution can be used to determine E(X) even in 
cases in which the marginal probability distribution of X 1s not known. As practice, you can 
use the joint probability distribution to verify that E(Y) — 0.32 in Example 5-1. 

Also, 


V(X) = np(1 — p) = 4(0.9)(1 — 0.9) = 0.36 


Verify that the same result can be obtained from the joint probability distribution of X and Y. 


5-1.3 Conditional Probability Distributions 


EXAMPLE 5-5 


Definition 


When two random variables are defined in a random experiment, knowledge of one can change 
the probabilities that we associate with the values of the other. Recall that in Example 5-1, X 
denotes the number of acceptable bits and Y denotes the number of suspect bits received by a 
receiver. Because only four bits are transmitted, if X = 4, Y must equal 0. Using the notation for 
conditional probabilities from Chapter 2, we can write this result as P(Y = 0|X = 4) = 1. If 
X = 3, Y can only equal 0 or 1. Consequently, the random variables X and Y can be considered 
to be dependent. Knowledge of the value obtained for X changes the probabilities associated 
with the values of Y. 

Recall that the definition of conditional probability for events A and B is P(B|A) = 
P(A N B)/P(A). This definition can be applied with the event A4 defined to be X = x and event 
B defined to be Y — y. 


For Example 5-1, X and Y denote the number of acceptable and suspect bits received, respec- 
tively. The remaining bits are unacceptable. 


P(Y = 0|X = 3) = P(X = 3, Y = 0yP(X = 3) 
= fyy(3, 0)/fx(3) = 0.05832/0.2916 = 0.200 


The probability that Y = 1 given that X = 3 is 


PY=1 X = 3) HP xe 3,¥ = PA= 3) 
= fry(3, 1)/fr(3) = 0.2333/0.2916 = 0.800 


Given that X = 3, the only possible values for Y are 0 and 1. Notice that P(Y = 0|X = 3) + 
P(Y = 1|X = 3) = 1. The values 0 and 1 for Y along with the probabilities 0.200 and 0.800 
define the conditional probability distribution of Y given that X = 3. 


Example 5-5 illustrates that the conditional probabilities that Y = y given that X = x can be 
thought of as a new probability distribution. The following definition generalizes these ideas. 


Given discrete random variables X and Y with joint probability mass function fyy(x, y) 
the conditional probability mass function of Y given X = x is 


frixQ) = fxy Co y Mfx(x) for fy(x) > 0 OD 
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The function fj,.( y) is used to find the probabilities of the possible values for Y given that X = x. 
That is, it is the probability mass function for the possible values of Y given that X = x. More pre- 
cisely, let R, denote the set of all points in the range of (X, Y) for which X = x. The conditional 
probability mass function provides the conditional probabilities for the values of Y in the set R,. 


Because a conditional probability mass function fy,(y) is a probability mass func- 
tion for all y in R, the following properties are satisfied: 


DO fry) = 0 


(2) 2, fy(y) = 1 


G) P(Y = y|X = x) = fy.) 





EXAMPLE 5-6 For the joint probability distribution in Fig. 5-1, fy,( y) is found by dividing each fyy(x, y) by 
fx (x). Here, f(x) is simply the sum of the probabilities in each column of Fig. 5-1. The func- 
tion fy( y) is shown in Fig. 5-3. In Fig. 5-3, each column sums to one because it is a proba- 
bility distribution. 


Properties of random variables can be extended to a conditional probability distribution 
of Y given X — x. The usual formulas for mean and variance can be applied to a conditional 
probability mass function. 


Definition 
Let R, denote the set of all points in the range of (X, Y) for which X = x. The 
conditional mean of Y given X = x, denoted as E(Y|x) or py, is 


E(Y|x) — > Vf yx) (5-6) 


and the conditional variance of Y given X = x, denoted as V(Y|x) or of» is 


WYl) = X (»- un fr») = > PI — wt 


X 





y. 
0.410 
4€ 
0.410 0.511 
3e ® 
0.154 0.383 0.640 
2e o e 


Figure 5-3 
Conditional probability 1 


distributions of Y given 
X — x, fy) in 0 , 0.0016 : 0.008 R 0.040 , 9.200 1.0 


Example 5-6. 0 1 2 B 4 m 


0.0256 0.096 0.320 0.800 
e e e e 
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EXAMPLE 5-7 


For the random variables in Example 5-1, the conditional mean of Y given X — 2 1s obtained 
from the conditional distribution in Fig. 5-3: 


E(Y|2) = py = 0(0.040) + 1(0.320) + 2(0.640) = 1.6 


The conditional mean is interpreted as the expected number of acceptable bits given that two 
of the four bits transmitted are suspect. The conditional variance of Y given X = 2 is 


V(Y|2) = (0 — pyj2)°(0.040) + (1 — 1.55:) (0.320) + (2 — 1512) (0.640) = 0.32 


5.1.4 Independence 


EXAMPLE 5-8 


Figure 5-4 (a) Joint 
and marginal probabil- 
ity distributions of X 
and Y in Example 5-8. 
(b) Conditional proba- 
bility distribution of Y 
given X — x in 
Example 5-8. 


In some random experiments, knowledge of the values of X does not change any of the prob- 
abilities associated with the values for Y. 


In a plastic molding operation, each part is classified as to whether it conforms to color and 
length specifications. Define the random variable X and Y as 


y= l 1 ifthe part conforms to color specifications 
0 otherwise 

l 1 ifthe part conforms to length specifications 
0 otherwise 


Assume the joint probability distribution of X and Y is defined by fyy(x, y) in Fig. 5-4(a). 
The marginal probability distributions of X and Y are also shown in Fig. 5-4(a). Note that 
hy, y) = fx) fy Cy). The conditional probability mass function fyj,( y) is shown in Fig. 
5-4(b). Notice that for any x, fy, ( y) = fyCy). That is, knowledge of whether or not the part meets 
color specifications does not change the probability that it meets length specifications. 


By analogy with independent events, we define two random variables to be independent 
whenever fyy(x, y) = fy(x) fy(y) for all x and y. Notice that independence implies that 
fxy GG, y) = fx(x) fyCy) for all x and y. If we find one pair of x and y in which the equality fails, 
X and Y are not independent. If two random variables are independent, then 


— yx. y) _ (x) fy») _ 
fn») = A Sf) Try) 





With similar calculations, the following equivalent statements can be shown. 


y y 
fy) = 
0.0098 0.9702 0.98 0.98 
0.98 le ® ] e e 
0.0002 0.0198 0.02 0.02 
0.02 e e Oe e 
0 1 x 0 1 x 
fx()= 0.01 0.99 
(a) (b) 


EXAMPLE 5-9 
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For discrete random variables X and Y, if any one of the following properties is true, 
the others are also true, and X and Y are independent. 


(1) falx y) = fa(x)fv(v) forall x and y 


(2) /fyw(y)- fy(y) for all x and y with fy(x) > 0 
(3) fxy(x)- fx(x) for all x and y with fy(y) > 0 


(4) P(X€ A, Y € B) = P(X € A)P(Y € B) for any sets A and B in the range 
of X and Y, respectively. (5-7) 





Rectangular Range for (X, Y)! 

If the set of points in two-dimensional space that receive positive probability under 
fxy (X, y) does not form a rectangle, X and Y are not independent because knowledge of X 
can restrict the range of values of Y that receive positive probability. In Example 5-1 
knowledge that X — 3 implies that Y can equal only 0 or 1. Consequently, the marginal 
probability distribution of Y does not equal the conditional probability distribution fyjs( y) 
for X — 3. Using this idea, we know immediately that the random variables X and Y with 
joint probability mass function in Fig. 5-1 are not independent. If the set of points in two- 
dimensional space that receives positive probability under fyy(x, y) forms a rectangle, 
independence is possible but not demonstrated. One of the conditions in Equation 5-7 must 
still be verified. 

Rather than verifying independence from a joint probability distribution, knowledge of 
the random experiment is often used to assume that two random variables are independent. 
Then, the joint probability mass function of X and Y is computed from the product of the 
marginal probability mass functions. 


In a large shipment of parts, 1% of the parts do not conform to specifications. The supplier 
inspects a random sample of 30 parts, and the random variable X denotes the number of parts 
in the sample that do not conform to specifications. The purchaser inspects another random 
sample of 20 parts, and the random variable Y denotes the number of parts in this sample that 
do not conform to specifications. What is the probability that X = 1 and Y S$ 1? 

Although the samples are typically selected without replacement, if the shipment is large, 
relative to the sample sizes being used, approximate probabilities can be computed by assum- 
ing the sampling is with replacement and that X and Y are independent. With this assumption, 
the marginal probability distribution of X is binomial with n = 30 and p = 0.01, and the mar- 
ginal probability distribution of Y 1s binomial with n — 20 and p — 0.01. 

If independence between X and Y were not assumed, the solution would have to proceed 
as follows: 


PIX <1,Y<1)=P(X=0,Y=0)+ P(X = 1 Y = 0) 
+ P(X =0,Y=1) + P(X =1,Y=1) 
= fxy(0, 0) + fev, 0) + fxy(0, 1) + fxy(1, 1) 


However, with independence, property (4) of Equation 5-7 can be used as 


PX S1,YS1)=PX¥S1)P(YS1) 
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and the binomial distributions for X and Y can be used to determine these probabilities as 
P(X = 1) = 0.9639 and P(Y = 1) = 0.9831. Therefore, P(X = 1, Y x 1) = 0.948. 
Consequently, the probability that the shipment is accepted for use in manufacturing is 
0.948 even if 1% of the parts do not conform to specifications. If the supplier and the pur- 
chaser change their policies so that the shipment is acceptable only if zero nonconforming 
parts are found in the sample, the probability that the shipment is accepted for production is 


still quite high. That is, 


P(X = 0, Y = 0) = P(X = 0)P(Y = 0) = 0.605 


This example shows that inspection is not an effective means of achieving quality. 


EXERCISES FOR SECTION 5-1 


5-1. Show that the following function satisfies the proper- 
ties of a joint probability mass function. 


x y Jxy Gs, y) 
l l 1/4 
1.5 2 1/8 
1.5 3 1/4 
as 4 1/4 
3 5 1/8 


5-2. Continuation of Exercise 5-1. Determine the following 
probabilities: 

(a P(X < 2.5,Y «3) (b) P(X « 2.5) 

(c) P(Y « 3) (d) P(X > 1.8, Y > 4.7) 

5-3. Continuation of Exercise 5-1. Determine E(X ) and E(Y). 


5-4. Continuation of Exercise 5-1. Determine 

(a) The marginal probability distribution of the random 
variable X. 

(b) The conditional probability distribution of Y given that 
X — 1.5. 

(c) The conditional probability distribution of X given that 
Y — 2. 

(d) E(Y|X = 1.5) 

(e) Are X and Y independent? 

5-5. Determine the value of c that makes the function 

f(x, y) = c(x + y) a joint probability mass function over the 

nine points with x = 1,2, 3 and y = 1, 2, 3. 

5-6. Continuation of Exercise 5-5. Determine the following 

probabilities: 

(a P(X=1,Y<4) (b) P(X » 1) 

(c) P(Y = 2) (d) Pxe2,r,42) 

5-7. Continuation of Exercise 5-5. Determine E(X), E(Y), 

V(X), and V(Y). 

5-8. Continuation of Exercise 5-5. Determine 

(a) The marginal probability distribution of the random 
variable X. 


(b) The conditional probability distribution of Y given that 
X — |. 

(c) The conditional probability distribution of X given that 
Y — 2. 

(d) E(Y| X = 1) 

(e) Are X and Y independent? 

5-9. Show that the following function satisfies the proper- 

ties of a joint probability mass function. 


x y Jxy Gs, y) 
-1 —2 1/8 
—0.5 -1 1/4 
0.5 l 1/2 
l 2 1/8 


5-10. Continuation of Exercise 5-9. Determine the follow- 
ing probabilities: 

(a) PUX < 0.5, Y < 1:5) 
(c) P(Y < 1.5) 


(b) P(X < 0.5) 
(d) P(X > 0.25, Y < 4.5) 


5-11. Continuation of Exercise 5-9. Determine E(X) and 
E(Y). 
5-12. Continuation of Exercise 5-9. Determine 


(a) The marginal probability distribution of the random 
variable X. 

(b) The conditional probability distribution of Y given that 
X= I. 

(c) The conditional probability distribution of X given that 
Y — I. 

(d) E(X|y = 1) 

(e) Are X and Y independent? 

5-13. Four electronic printers are selected from a large lot 

of damaged printers. Each printer is inspected and classified 

as containing either a major or a minor defect. Let the random 

variables X and Y denote the number of printers with major 

and minor defects, respectively. Determine the range of the 

joint probability distribution of X and Y. 


5-14. Inthe transmission of digital information, the probabil- 
ity that a bit has high, moderate, and low distortion is 0.01, 0.10, 
and 0.95, respectively. Suppose that three bits are transmitted 
and that the amount of distortion of each bit is assumed to be in- 
dependent. Let X and Y denote the number of bits with high and 
moderate distortion out of the three, respectively. Determine 


(a) fxy(x, y) (b) fx(x) 

(c) E(X) (d) frl) 

(e) E(Y|X ^ 1) (f) Are X and Y independent? 

5-15. A small-business Web site contains 100 pages and 


60%, 30%, and 10% of the pages contain low, moderate, and 
high graphic content, respectively. A sample of four pages is 
selected without replacement, and X and Y denote the number 
of pages with moderate and high graphics output in the 
sample. Determine 


(a) fxy c. y) (b) fx(x) 
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(c) E(X) 

(e) E(Y|.X = 3) 

(g) Are X and Y independent? 
5-16. A manufacturing company employs two inspecting 
devices to sample a fraction of their output for quality control 
purposes. The first inspection monitor is able to accurately 
detect 99.3% of the defective items it receives, whereas the 
second is able to do so in 99.7% of the cases. Assume that four 
defective items are produced and sent out for inspection. Let X 
and Y denote the number of items that will be identified as 
defective by inspecting devices 1 and 2, respectively. Assume 
the devices are independent. Determine 


(d) fr (y) 
(f) W(Y|X = 3) 


(a) fry y) (b) f(x) 
(c) E(X) (d) fy» (v) 
(e) E(Y|X = 2) (f) V(Y|X = 2) 


(g) Are X and Y independent? 
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5.2.1 Joint Probability Distributions 


EXAMPLE 5-10 


In some cases, more than two random variables are defined in a random experiment, and 


the concepts presented earlier in the chapter can easily be extended. The notation can be 
cumbersome and if doubts arise, it 1s helpful to refer to the equivalent concept for two ran- 
dom variables. Suppose that the quality of each bit received in Example 5-1 is categorized 
even more finely into one of the four classes, excellent, good, fair, or poor, denoted by 
E, G, F, and P, respectively. Also, let the random variables Xj, X5, X5, and X, denote the 
number of bits that are E, G, F, and P, respectively, in a transmission of 20 bits. In this 
example, we are interested in the joint probability distribution of four random variables. 
Because each of the 20 bits 1s categorized into one of the four classes, only values for 
X1, X5, X4, and x, such that x, + x; + x; + x, = 20 receive positive probability in the prob- 


ability distribution. 


In general, given discrete random variables Xj, X», X5, ... 


, X», the joint probability dis- 


tribution of X, X2, X3, ... , X, is a description of the set of points (x), x2, x3, ... , Xp) in the 


range of Xj, X», X5, ... 


AN P 


, X,, along with the probability of each point. A joint probability mass 


function is a simple extension of a bivariate probability mass function. 


Definition 


The joint probability mass function of Xj, X5, ..., X, 1s 


ji xx XQ, e) 


for all points (x1, x», ... , x) in the range of X4, X5, ... ,X, 


Xp 


p 


) — P(X, = X1, X> =Œ Ky ooo 


p 





A marginal probability distribution is a simple extension of the result for two random 


variables. 
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Definition 
If Xi, Xo, X3, ... ,.X,, are discrete random variables with joint probability mass func- 
tion fy y,... x Xi, PME s the marginal probability mass function of any X; is 


p "2 ee Ce) (59) 


where R, denotes the set of points in the range of (X;, X, ... , X,) for which X; = x;. 





EXAMPLE 5-11 Points that have positive probability in the joint probability distribution of three random variables 
X, X, X, are shown in Fig. 5-5. The range is the nonnegative integers with x, + x, + x; = 3. 
The marginal probability distribution of X, is found as follows. 


P(X, = 0) = fy x, x, (3, 0,0) + fx,x,x,(0, 0, 3) fux, x (1,0, 2) f x, x, (2. 9, 1) 
P(X, = 1) = fy, xx, (2. 1,0) + fx, x, (0. 1,2) + fx, xo, 1, 1) 

PS 2) = fx ale 2; 0): F fnn 0:2 1) 

PS =3) = fx x,x,(0, 3, 0) 


Furthermore, E(X;) and V(X;) for i = 1,2,...,p can be determined from the marginal 
probability distribution of X; or from the joint probability distribution of X3, X5, ... , Xp as 
follows. 


Mean and 
Variance from 
Joint 
Distribution 


= > a x... x, X2, ++. 


= >( (x; v Wwy) * fx, x. PACTI ee., 
R 


where R is the set of all points in the range of X1, X>,..., 





With several random variables, we might be interested in the probability distribution of some 
subset of the collection of variables. The probability distribution of Xj, X5, ..., X, k < p can 
be obtained from the joint probability distribution of X;, X5, ... , X, as follows. 


3e X2 


Figure 5-5 Joint i 1 e 
probability distribution 
of X, X», and X5. O 1 2 3 X1 


Distribution of 
a Subset of 
Random 
Variables 


Definition 
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If Xi, X», X3, ... , X, are discrete random variables with joint probability mass function 
fx, x s... x?) the joint probability mass function of Xi, X5 ..., Xj, 
k « p, is 


a EE ) = P(X = Xi A = X2... AN E: Xy) 


z » P(X EL LS (5-11) 
R 


X1X2... Xk 


where Ryx...x, denotes the set of all points in the range of X1, X2, ... , Xp for which 
Xi = X1, X> = Xy ooo LAG. = Xj. 





That is, P(X) = x1, X; = Xo, ..., X, = x;)is the sum of the probabilities over all points in the 
range of Xi, X5, X3,...,X, for which X, = xj, X; = x2,..., and X, = x} An example is 
presented in the next section. Any k random variables can be used in the definition. The first k 
simplifies the notation. 


Conditional Probability Distributions 

Conditional probability distributions can be developed for multiple discrete random variables 
by an extension of the ideas used for two discrete random variables. For example, the condi- 
tional joint probability mass function of X;, X5, X, given X4, X; 1s 


f ( ) FX, x, x x x X X25 X3, X4, Xs) 
LA X]. X2, X3) = 
129 X3 xaxs Velo 72» fx, xa. Xs) 


for fy y.(x4, xs) > 0. The conditional joint probability mass function of X, X», X; given X4, X; 
provides the conditional probabilities at all points in the range of X, X5, Xs, X4, X; for which 
» — X4 and X; = Xs. 


The concept of independence can be extended to multiple discrete random variables. 


Discrete variables X, X5, ... , X, are independent if and only if 


Sa x n X25 +65 25) me (1) fx, (x2) en fe Dos) 


for all vp 





Similar to the result for bivariate random variables, independence implies that Equation 5-12 
holds for all x1, x», ... , Xp. If we find one point for which the equality fails, X1, X5, ... , X, 
are not independent. It can be shown that if X1, X2, ... , X, are independent, 


P(X, E Ay, X> Ed ae E Ap) = P(X, € A,)P(X) € 45)... P(X, e 4.) 


for any sets Aj, Ao, ..., Áp. 
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5-2.2 Multinomial Probability Distribution 


EXAMPLE 5-12 


Multinomial 
Distribution 


A joint probability distribution for multiple discrete random variables that is quite useful 1s an 
extension of the binomial. The random experiment that generates the probability distribution 
consists of a series of independent trials. However, the results from each trial can be catego- 
rized into one of k classes. 


We might be interested in a probability such as the following. Of the 20 bits received, what is 
the probability that 14 are excellent, 3 are good, 2 are fair, and 1 1s poor? Assume that the clas- 
sifications of individual bits are independent events and that the probabilities of E, G, F, and 
P are 0.6, 0.3, 0.08, and 0.02, respectively. One sequence of 20 bits that produces the speci- 
fied numbers of bits in each class can be represented as 


EEEEEEEEEEEEEEGGGFFP 
Using independence, we find that the probability of this sequence is 
P(EEEEEEEEEEEEEEGGGFFP) = 0.6'*0.3?0.08^0.02! = 2.708 x 10? 


Clearly, all sequences that consist of the same numbers of E's, G's, F's, and P's have the same 
probability. Consequently, the requested probability can be found by multiplying 2.708 X 
10^? by the number of sequences with 14 E’s, three G’s, two F’s, and one P. The number of 
sequences is found from the CD material for Chapter 2 to be 


NE NEN 2325600 
141312! — 


Therefore, the requested probability is 
P(14E’s, three G’s, two F's, and one P) = 2325600(2.708 X 10?) = 0.0063 


Example 5-12 leads to the following generalization of a binomial experiment and a bino- 
mial distribution. 


Suppose a random experiment consists of a series of n trials. Assume that 


(1) The result of each trial is classified into one of k classes. 
(2) The probability of a trial generating a result in class 1, class 2, ... , class k 
is constant over the trials and equal to p,, p», ... , Pp respectively. 


(3) The trials are independent. 


The random variables X;, X, ..., X, that denote the number of trials that result in 
class 1, class 2,..., class k, respectively, have a multinomial distribution and the 
joint probability mass function is 


n! 


= X X X 
a A 


P(X; = X1, X5 = Np ore d: = xj) 


HO Soy, "e X ue c ce Xe. = Ty SUCI UO -- a 





EXAMPLE 5-13 


EXAMPLE 5-14 
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The multinomial distribution is considered a multivariable extension of the binomial 
distribution. 


In Example 5-12, let the random variables Xj, X5, X;, and X, denote the number of bits that are 
E, G, F, and P, respectively, in a transmission of 20 bits. The probability that 12 of the bits 
received are E, 6 are G, 2 are F, and 0 are P is 


20! 


120 260 (920 070 — 
= o. 02^ = 0.0358 
612101 9.9 0.3°0.08°0.0 0.0 


P(X, = 12, X; — 6, X4 = 2.24 = 0) 


Each trial in a multinomial random experiment can be regarded as either generating or not 
generating a result in class i, for each i = 1, 2,..., k. Because the random variable X; is the 
number of trials that result in class i, X; has a binomial distribution. 


If X, X5, ... , X, have a multinomial distribution, the marginal probability distribu- 
tion of X; is binomial with 


jun dno AUR) (5-14) 





In Example 5-13, the marginal probability distribution of X, is binomial with n = 20 and 
p = 0.3. Furthermore, the joint marginal probability distribution of X, and X; is found as 
follows. The P(X = x;, X4 = x3) is the probability that exactly x, trials result in G and that x4 
result in F. The remaining n — x, — x; trials must result in either E or P. Consequently, we can 
consider each trial in the experiment to result in one of three classes, {G}, {F}, or (E, P}, with 
probabilities 0.3, 0.08, and 0.6 + 0.02 = 0.62, respectively. With these new classes, we can 
consider the trials to comprise a new multinomial experiment. Therefore, 


Fx, x, (X2, x3) = P(X = x3, Xs = x3) 
n! 
= ———————— — ——— (0.3 ¥2(0.08 73(0.62)" ^ 5 
Xy!x3! (n. — x, — x3)! aOR en) 


The joint probability distribution of other sets of variables can be found similarly. 


EXERCISES FOR SECTION 5-2 


5-17. Suppose the random variables X, Y, and Z have the Determine the following: 


following joint probability distribution (a) P(X — 2) (b P(X = 1, Y= 2) 
(c) P(Z« 1.5) (d) P(X=1 or Z=2) 

x y z I, y, Z) (e) E(X) 
1 1 1 0.05 5-18. Continuation of Exercise 5-17. Determine the follow- 
l l 2 0.10 ing: 
l 2 1 0.15 (a) P(X — 1|Y — 1) (by PteirseiZs2 
1 2 2 0.20 (OP 29 d i ysi1,7=2) 
p, 1 l 0.20 5-19. Continuation of Exercise 5-17. Determine the condi- 
P 1 5 0.15 tional probability distribution of X given that Y = 1 and Z = 2. 
2 y) l 0.10 5-20. Based on the number of voids, a ferrite slab is classi- 
2 2 2 0.05 fied as either high, medium, or low. Historically, 5% of the 


slabs are classified as high, 85% as medium, and 10% as low. 
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A sample of 20 slabs is selected for testing. Let X, Y, and Z 

denote the number of slabs that are independently classified as 

high, medium, and low, respectively. 

(a) What is the name and the values of the parameters of the 
joint probability distribution of X, Y, and Z? 

(b) What is the range of the joint probability distribution of 
P dM A 

(c) What is the name and the values of the parameters of the 
marginal probability distribution of X? 

(d) Determine E(X)and V(X). 

5-21. Continuation of Exercise 5-20. Determine the 

following: 

(a P(X = 1, Y = 17, Z = 3) 

(b P(X $1, Y = 17, Z = 3) 

(c) P(X - 1) 

(d) E(X) 

5-22. Continuation of Exercise 5-20. Determine the 

following: 

(a) P(X = 2,Z = 3|Y = 17) 

(c) E(X|Y = 17) 

5-23. An order of 15 printers contains four with a graphics- 

enhancement feature, five with extra memory, and six with 

both features. Four printers are selected at random, without 

replacement, from this set. Let the random variables X, Y, 

and Z denote the number of printers in the sample 

with graphics enhancement only, extra memory only, and 

both, respectively. 

(a) Describe the range of the joint probability distribution of 
X, Y, and Z. 

(b) Is the probability distribution of X, Y, and Z multinomial? 
Why or why not? 


(6) POre»yerTm, 


5-24. Continuation of Exercise 5-23. Determine the condi- 

tional probability distribution of X given that Y — 2. 

5-25. 

ing: 

(a) PX = 1, Y= 2,Z= 1) 

(c) E(X) and V(X) 

5-26. Continuation of Exercise 5-23. Determine the 

following: 

(a) PX =1,Y=2|Z=1) (b) Px =2|Y=2) 

(c) The conditional probability distribution of X given that 
Y = OandZ = 3. 

5-27. Four electronic ovens that were dropped during ship- 

ment are inspected and classified as containing either a major, 

a minor, or no defect. In the past, 60% of dropped ovens had 

a major defect, 30% had a minor defect, and 10% had no 

defect. Assume that the defects on the four ovens occur 

independently. 

(a) Is the probability distribution of the count of ovens in each 
category multinomial? Why or why not? 

(b) What is the probability that, of the four dropped ovens, two 
have a major defect and two have a minor defect? 

(c) What is the probability that no oven has a defect? 


Continuation of Exercise 5-23. Determine the follow- 


(b P(X = 1,Y=1) 


5-28. Continuation of Exercise 5-27. Determine the 

following: 

(a) The joint probability mass function of the number of ovens 
with a major defect and the number with a minor defect. 

(b) The expected number of ovens with a major defect. 

(c) The expected number of ovens with a minor defect. 


5-29. 

ing: 

(a) The conditional probability that two ovens have major 
defects given that two ovens have minor defects 

(b) The conditional probability that three ovens have major 
defects given that two ovens have minor defects 

(c) The conditional probability distribution of the number of 
ovens with major defects given that two ovens have minor 
defects 

(d) The conditional mean of the number of ovens with major 
defects given that two ovens have minor defects 


5-30. In the transmission of digital information, the proba- 

bility that a bit has high, moderate, or low distortion is 0.01, 

0.04, and 0.95, respectively. Suppose that three bits are trans- 

mitted and that the amount of distortion of each bit is assumed 

to be independent. 

(a) What is the probability that two bits have high distortion 
and one has moderate distortion? 

(b) What is the probability that all three bits have low 
distortion? 


5-31. Continuation of Exercise 5-30. Let X and Y denote the 

number of bits with high and moderate distortion out of the 

three transmitted, respectively. Determine the following: 

(a) The probability distribution, mean and variance of X. 

(b) The conditional probability distribution, conditional mean 
and conditional variance of X given that Y — 2. 


Continuation of Exercise 5-27. Determine the follow- 


5-32. A marketing company performed a risk analysis for a 

manufacturer of synthetic fibers and concluded that new com- 

petitors present no risk 13% of the time (due mostly to the di- 

versity of fibers manufactured), moderate risk 7296 of the time 

(some overlapping of products), and very high risk (competi- 

tor manufactures the exact same products) 1596 of the time. It 

is known that 12 international companies are planning to open 

new facilities for the manufacture of synthetic fibers within 

the next three years. Assume the companies are independent. 

Let X, Y, and Z denote the number of new competitors that will 

pose no, moderate, and very high risk for the interested com- 

pany, respectively. 

(a) What is the range of the joint probability distribution of 
X, Y, and Z? 

(b) Determine P(X = 1, Y = 3, Z = 1) 

(c) Determine P(Z < 2) 

5-33. Continuation of Exercise 5-32. Determine the 

following: 

(a) P(Z—2|Y —1,X = 10) 

(cO PYs zs red 


(b) P(Z =< 1|X = 10) 
(d) E(Z|X = 10) 
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5.3 TWO CONTINUOUS RANDOM VARIABLES 
5.3.1 Joint Probability Distributions 


Our presentation of the joint probability distribution of two continuous random variables is 
similar to our discussion of two discrete random variables. As an example, let the continuous 
random variable X denote the length of one dimenson of an injection-molded part, and let the 
continuous random variable Y denote the length of another dimension. The sample space of 
the random experiment consists of points in two dimensions. 

We can study each random variable separately. However, because the two random vari- 
ables are measurements from the same part, small disturbances in the injection-molding 
process, such as pressure and temperature variations, might be more likely to generate values 
for X and Y in specific regions of two-dimensional space. For example, a small pressure in- 
crease might generate parts such that both X and Y are greater than their respective targets and 
a small pressure decrease might generate parts such that X and Y are both less than their re- 
spective targets. Therefore, based on pressure variations, we expect that the probability of a 
part with X much greater than its target and Y much less than its target is small. Knowledge of 
the joint probability distribution of X and Y provides information that is not obvious from the 
marginal probability distributions. 

The joint probability distribution of two continuous random variables X and Y can be 
specified by providing a method for calculating the probability that X and Y assume a value in 
any region A of two-dimensional space. Analogous to the probability density function of a sin- 
gle continuous random variable, a joint probability density function can be defined over 
two-dimensional space. The double integral of fyy(x, y) over a region R provides the proba- 
bility that (X, Y) assumes a value in R. This integral can be interpreted as the volume under the 
surface fyy(x, y) over the region R. 

A joint probability density function for X and Y is shown in Fig. 5-6. The probability 
that (X, Y) assumes a value in the region R equals the volume of the shaded region in 
Fig. 5-6. In this manner, a joint probability density function is used to determine probabil- 
ities for X and Y. 


Definition 
A joint probability density function for the continuous random variables X and Y, 
denoted as fyy(x, y), satisfies the following properties: 


(1) fyy(x, y) = 0 for all x, y 
O | | fobs ddr - 1 


X LO 


(3) For any region R of two-dimensional space 


PUK Y] E R) = || fert) dc 


R 





Typically, fyy(x,y) is defined over all of two-dimensional space by assuming that 
fxy(x, y) = 0 for all points for which fyy(x, y) is not specified. 
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f xy (x, y) 
A 
fxy 65 y) 4 
i 
| 
JL r - | ; 
% 
x SO 3,09 
Probability that (X, Y) is in the region R is determined » 
by the volume of fyy(x, y) over the region R. ‘60 2.95 
Figure 5-6 Joint probability density function for Figure 5-7 Joint probability density function for the lengths 
random variables X and Y. of different dimensions of an injection-molded part. 


EXAMPLE 5-15 


At the start of this chapter, the lengths of different dimensions of an injection-molded part 
were presented as an example of two random variables. Each length might be modeled by a 
normal distribution. However, because the measurements are from the same part, the random 
variables are typically not independent. A probability distribution for two normal random vari- 
ables that are not independent is important in many applications and it is presented later in this 
chapter. If the specifications for X and Y are 2.95 to 3.05 and 7.60 to 7.80 millimeters, respec- 
tively, we might be interested in the probability that a part satisfies both specifications; that is, 
P(2.95 « X « 3.05, 7.60 « Y « 7.80). Suppose that fyy(x, y) is shown in Fig. 5-7. The re- 
quired probability is the volume of fyy(x, y) within the specifications. Often a probability such 
as this must be determined from a numerical integration. 


Let the random variable X denote the time until a computer server connects to your machine 
(in milliseconds), and let Y denote the time until the server authorizes you as a valid user (in 
milliseconds). Each of these random variables measures the wait from a common starting time 
and X « Y. Assume that the joint probability density function for X and Y is 


fy, y) = 6 X 107° exp(—0.001x — 0.002y) forx < y 


Reasonable assumptions can be used to develop such a distribution, but for now, our focus is 
only on the joint probability density function. 

The region with nonzero probability is shaded in Fig. 5-8. The property that this joint 
probability density function integrates to 1 can be verified by the integral of fyy (x, y) over this 
region as follows: 


| [foto y) dy dx — | fs KIO o ee ee y Nak 


= 6x107 6 f ney hy e —0.001x dx 


X 


e 9 002x 





—6x10- 6 e 0001x dx 


| 
[( 


1 
— —0.003x — 
0.003 f dx | — 0.003 (sos a) 1 
0 
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y y 
2000 
O 
O x O 1000 x 
Figure 5-8 The joint probability Figure 5-9 Region of integration for 
density function of X and Y is the probability that X < 1000 and Y < 
nonzero over the shaded region. 2000 is darkly shaded. 


The probability that X < 1000 and Y < 2000 is determined as the integral over the 
darkly shaded region in Fig. 5-9. 


1000 2000 
P(X = 1000, Y = 2000) = | | fers y) dy dx 
0 x 

1000 / 2000 

— 6 x 1076 | | g 0902» gy | e700 gy 
0 x 
UR —0.002x —4 

g e up 

— 6X 1076 —0.001x q 

| 0.002 ) ü 


1000 


= 0.003 | e 0.003x uU e 4 e 0901x dx 


0 


- oos ss) - Car) 
ub 0.003 ^ X 0.001 


= 0.003(316.738 — 11.578) = 0.915 


5-3.2 Marginal Probability Distributions 


similar to joint discrete random variables, we can find the marginal probability distributions 
of X and Y from the joint probability distribution. 


Definition 
If the joint probability density function of continuous random variables X and Y is 
fxy (X, y), the marginal probability density functions of X and Y are 


HO IE DE E | a (5-16) 


R, R, 


where R, denotes the set of all points in the range of (X, Y) for which X = x and 
R, denotes the set of all points in the range of (X, Y) for which Y = y 
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A probability involving only one random variable, say, for example, P(a « X « b), 
can be found from the marginal probability distribution of X or from the joint probability 
distribution of X and Y. For example, P(a < X < b) equals P(a < X < b, =œ < Y < oo). 
Therefore, 


X 


P(a < X< b) = | [sna - | | fond dx = m 


a 


similarly, E(X) and V(X) can be obtained directly from the joint probability distribution of X 
and Y or by first calculating the marginal probability distribution of X. The details, shown in 
the following equations, are similar to those used for discrete random variables. 


Mean and 
Variance from 00 oo 
Joint 
= = dy | d 
Distribution | Xfx (x) dx | X | Try x, M» ) yY | ax 


EX T 


= || Xfxy(x, y) dx dy 


R 


n = ti) eode = fe - py) | far y) dy | dx 


Ry 


[|e = frt) ded 


R 


where Ry denotes the set of all points in the range of (X, Y) for which X = x and 
Ry denotes the set of all points in the range of (X, Y) 





EXAMPLE 5-16 For the random variables that denote times in Example 5-15, calculate the probability that Y 
exceeds 2000 milliseconds. 
This probability is determined as the integral of fyy(x, y) over the darkly shaded region 
in Fig. 5-10. The region is partitioned into two parts and different limits of integration are de- 
termined for each part. 


2000 


P(Y > 2000) = | | GX 10 ee nay 175- 
2000 


0 


CO CO 


ns | Je x 10 ge ORTA dx 


2000 \ x 
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2000 


0 
0 2000 x 


Figure 5-10 Region of integration 
for the probability that Y < 2000 is 
darkly shaded and it is partitioned 
into two regions with x < 2000 and 
x > 2000. 


The first integral is 








2000 DS Z 2000 
g v ps 6x10 
6 x 10 9 | e 0901x dx = g’ | e 0901x dx 
— 0.002 0.002 
: 2000 4 
|.6X 106 “(2 » =) — 
= 0.002 0.001 / ^ 


The second integral is 


—' g^ 0002» 


oO 
oo 





—6 
e 0901x dx — 6 X 10 | e 0003x dx 








—0.002 |. 0.002 
2000 2000 
-xE e^ ) = 0.0025 
= 0.002 W0.003/ ` 


Therefore, 
P(Y > 2000) = 0.0475 + 0.0025 = 0.05. 


Alternatively, the probability can be calculated from the marginal probability distribution of Y 
as follows. For y > 0 


i x 
fr) = fo x 10 %e7 0.00 1x —0.002y dx —6X 10 Se 9.002» | ea 
f 0 
—0.001x 


i 6 —0.002»( 1 — p 
f | |26x10-9e 992»| —— — —— 
—0.001 ) e ( 0.001 


=6< 10°C (1979999. dopy os 


— 6X io se wo ( 
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We have obtained the marginal probability density function of Y. Now, 


oO 


P(Y > 2000) = 6 x 10? | ea esq Oy 


2000 
eer (os oc E (s 














on 





—0.002 
-4 ~6 

= 6x 10°] 8 - Z| = 005 
0.002 0.003 


5-3.3 Conditional Probability Distributions 


Definition 


EXAMPLE 5-17 





Analogous to discrete random variables, we can define the conditional probability distribution 
of Y given X — x. 


Given continuous random variables X and Y with joint probability density function 
fxy (X, y), the conditional probability density function of Y given X = x is 


fao) =" si l fo f) 0 (5-18) 





The function fy,(y) is used to find the probabilities of the possible values for Y given 
that X = x. Let R, denote the set of all points in the range of (X, Y) for which X = x. The 
conditional probability density function provides the conditional probabilities for the values 
of Y in the set R,. 


Because the conditional probability density function /fy),(y) is a probability density 
function for all y in R, the following properties are satisfied: 


Q) | fuos d 


Ry 


3) P(YEB|X=x) = [io dy for any set B in the range of Y 


B 


(5-19) 


It is important to state the region in which a joint, marginal, or conditional probability 
density function is not zero. The following example illustrates this. 


For the random variables that denote times in Example 5-15, determine the conditional prob- 
ability density function for Y given that X — x. 
First the marginal density function of x is determined. For x > 0 


Figure 5-11 The 
conditional probability 
density function for Y, 
given that x — 1500, 1s 
nonzero over the solid 
line. 


Definition 
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1500 


0 
0 1500 x 








l —0.002y | % 
fs) = |6 x 1075 0:001x— 0.0027 J, = 6 x 19769-0001 e y 
—0.002 |. 
—0.002x 
= —6 .—0.001x e - — 0.003x 
6 X 10 "e 0.002 0.003e for o) 


This is an exponential distribution with A = 0.003. Now, for 0 < x and x < y the conditional 
probability density function is 


6 x 10" 6g 9.001x— 0.002y 
frx) =a a (x) = 0.0032 - 9-003« 


=(0002e"" (2 dore sca and — xy 


The conditional probability density function of Y, given that x — 1500, 1s nonzero on the solid 
line in Fig. 5-11. 

Determine the probability that Y exceeds 2000, given that x = 1500. That is, determine 
P(Y > 2000 | x = 1500)The conditional probability density function is integrated as follows: 


oo CO 


P(Y > 2000|x = 1500) = | frisoo(y) dy = | 000029 00 ay 


2000 2000 


—0.002y | ec E 
— 0.002 = 0.368 
—0.002 u t (s) 








0.0022 ( 





Let R, denote the set of all points in the range of (X, Y) for which X = x. The condi- 
tional mean of Y given X = x, denoted as E(Y |x) or j.y|,, is 


E(Y|zx) = | vbt 


and the conditional variance of Y given X = x, denoted as V(Y | x) or o},, is 


4042 P3 = [o E Lyi) fis) dy = fo? fru) dy fay ae (5-20) 


Re Ry 
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EXAMPLE 5-18 


For the random variables that denote times in Example 5-15, determine the conditional mean 
for Y given that x — 1500. 

The conditional probability density function for Y was determined in Example 5-17. 
Because fy;5oX( y) is nonzero for y > 1500, 


E(Y|x = 1500) = | y (0.002,.002(1500)70002») dy = 0.002e° | et dy 
1500 1500 


Integrate by parts as follows: 


CO CO 











| e tay =y 
1500 


eg 0902» |% e 0002y 

- | (S) 
— 0.002 1500 — 0.002 

1500 

1500 3 _ a uT j ) 
0.002 © (—0.002)(—0.002) lis 
1500 _, e? o e” 


-- ——— t —ha( à — 
0.002^ ` (0.002)(0.002) 0.002 








(2000) 


With the constant 0.002&? reapplied 


E(Y|x = 1500) = 2000 


5-3.4 Independence 


Definition 


EXAMPLE 5-19 


The definition of independence for continuous random variables is similar to the definition for 
discrete random variables. If fyy(x, y) = fy(x) fy(y) for all x and y, X and Y are independent. 
Independence implies that fyy(x, y) = fy(x) fy(y) for all x and y. If we find one pair of x and y 
in which the equality fails, X and Y are not independent. 


For continuous random variables X and Y, if any one of the following properties is 
true, the others are also true, and X and Y are said to be independent. 


(1) fer, y) = fx(x)fv(v) for all x and y 


(2) frily) =fr(y) forall x and y with fy(x) > 0 


(3) fxy(x) = fx(x) for all x and y with f(y) > 0 


(4) P(X€ A, Y € B) = P(X € A)P(Y € B) for any sets A and B in the range 
of X and Y, respectively. (5-21) 





For the joint distribution of times in Example 5-15, the 


e Marginal distribution of Y was determined in Example 5-16. 


e Conditional distribution of Y given X = x was determined in Example 5-17. 


Because the marginal and conditional probability densities are not the same for all values of 
x, property (2) of Equation 5-20 implies that the random variables are not independent. The 


EXAMPLE 5-20 


EXAMPLE 5-21 
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fact that these variables are not independent can be determined quickly by noticing that the 
range of (X, Y), shown in Fig. 5-8, is not rectangular. Consequently, knowledge of X changes 
the interval of values for Y that receives nonzero probability. 


Suppose that Example 5-15 is modified so that the joint probability density function of X and Y 
is fyy(x, y) = 2 X 10 6e 0001x 7090» for x = 0 and y = 0. Show that X and Y are independ- 
ent and determine P(X > 1000, Y < 1000). 

The marginal probability density function of X is 


fx(x) = | 2x10 T wr ae OUTg- V opcs 
0 
The marginal probability density function of y is 
fly) = | 2 x 10-$e- 9901-0002» Jy = 9 0020-0992» fory > 0 
0 


Therefore, fyy(x, y) = fx(x) fyCy) for all x and y and X and Y are independent. 
To determine the probability requested, property (4) of Equation 5-21 and the fact that 
each random variable has an exponential distribution can be applied. 


P(X > 1000, Y < 1000) = P(X > 1000)P(Y < 1000) = e !(1— e?) = 0.318 


Often, based on knowledge of the system under study, random variables are assumed to be in- 
dependent. Then, probabilities involving both variables can be determined from the marginal 
probability distributions. 


Let the random variables X and Y denote the lengths of two dimensions of a machined part, re- 
spectively. Assume that X and Y are independent random variables, and further assume that the 
distribution of X is normal with mean 10.5 millimeters and variance 0.0025 (millimeter) and 
that the distribution of Y is normal with mean 3.2 millimeters and variance 0.0036 (millime- 
ter)". Determine the probability that 10.4 < X < 10.6 and 3.15 < Y < 3.25. 

Because X and Y are independent, 


P(10.4 < X < 10.6, 3.15 < Y < 325) = P(10.4 < X < 10.6)P(3.15 < Y < 325) 


10.4 — 10.5 10.6 — 10.5 2419 Oe PAS er 
= —— ee ete oe 
0.05 0.05 0.06 0.06 


= P(-2 < Z < 2)P(-0.833 < Z < 0.833) = 0.566 


where Z denotes a standard normal random variable. 


EXERCISES FOR SECTION 5-3 


5-34. Determine the value of c such that the function (a) P(X«2,Y«3) (b) PX < 2.5) 

f(x, y) = cxy for 0 <x < 3 and 0 < y < 3 satisfies the (OPI SS 225) (Pv Lela 25) 
properties of a joint probability density function. (e) E(X) (f) P(X « 0, Y « 4) 

5-55. Continuation of Exercise 5-34. Determine the 


following: 


166 


5-36. Continuation of Exercise 5-34. Determine the 
following: 

(a) Marginal probability distribution of the random variable X 
(b) Conditional probability distribution of Y given that X — 1.5 
(c) E(Y|X) = 1.5) 

(d) P(Y « 2|X = 1.5) 

(e) Conditional probability distribution of X given that Y — 2 
5-37. Determine the value of c that makes the function 
f(x, y) = c(x + y) a joint probability density function over the 
range 0 « x « 3and x «€ y € x * 2. 


5-38. Continuation. of Exercise 5-37. Determine the 
following: 

(a) (X<1,Y<2) (b PU<X< 2) 

œ) AY MV (d) AX = 2, Y <2) 

(e) E(X) 

5-39. Continuation of Exercise 5-37. Determine the 
following: 


(a) Marginal probability distribution of X 

(b) Conditional probability distribution of Y given that X = 1 

(c) E(Y|X = 1) 

(d) P(Y > 2|X = 1) 

(e) Conditional probability distribution of X given that 
Y=2 

5-40. Determine the value of c that makes the function 

f(x, y) = cxy a joint probability density function over the range 

O=< x= 3 and 0 =< y « x. 


5-41. Continuation of Exercise 5-40. Determine the 
following: 

(a) P(X«1,Y«2) b) P <X<2) 

(c) P(Y > 1) (d) P(X <2, Y< 2) 

(e) E(X) (f) E(Y) 

5-42. Continuation. of Exercise 5-40. Determine the 
following: 


(a) Marginal probability distribution of X 

(b) Conditional probability distribution of Y given X = 1 

(c) E(Y|X = 1) 

(d) P(Y > 2|X = 1) 

(e) Conditional probability distribution of X given Y — 2 
5-45. Determine the value of c that makes the function 
f(x,y) = ce ^* 7" a joint probability density function over 
the range 0 < x and 0 < y < x. 

5-44. Continuation. of Exercise 5-43. Determine the 


following: 

(a) P(X<1,Y<2) (b) P(1<X<2) 

(cy AY > 3) (d) P(X « 2, Y « 2) 

(e) E(X) (f) E(Y) 

5-45. Continuation. of Exercise 5-43. Determine the 


following: 

(a) Marginal probability distribution of X 

(b) Conditional probability distribution of Y given X = 1 
(c) E(Y|X = 1) 

(d) Conditional probability distribution of X given Y — 2 
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5.46. Determine the value of c that makes the function 
f(x,y) = ce ^*^?" a joint probability density function over 
the range 0 < x and x < y. 


5-47. Continuation. of Exercise 5-46. Determine the 
following: 

(a) PX< Ys 2) b) Pil <2) 

(c) P(Y > 3) (d) PX€«2,Y«2) 

(e) E(X) (f) EY) 

5-48. Continuation. of Exercise 5-46. Determine the 
following: 


(a) Marginal probability distribution of X 

(b) Conditional probability distribution of Y given X = 1 

(c) E(Y|X = 1) 

(d) P(Y < 2|X = 1) 

(e) Conditional probability distribution of X given Y = 2 
5-49. Two methods of measuring surface smoothness are 
used to evaluate a paper product. The measurements are 
recorded as deviations from the nominal surface smoothness 
in coded units. The joint probability distribution of the 
two measurements is a uniform distribution over the re- 
gion 0 <x < 4, 0< y, andx— 1 «y « x + 1. That is, 
fxy(x, y) = c for x and y in the region. Determine the value for 
c such that fy, (x, y) is a joint probability density function. 


5-50. Continuation of Exercise 5-49. Determine the 


following: 

(a) P(X < 0.5, Y < 0.5) (b) P(X « 0.5) 

(c) E(X) (d) E(Y) 

5-51. Continuation of Exercise 5-49. Determine the follow- 


ing: 

(a) Marginal probability distribution of X 

(b) Conditional probability distribution of Y given X = 1 

(c) E(Y|X — 1) 

(d) P(Y < 0.5|X = 1) 

5-52. The time between surface finish problems in a galva- 

nizing process is exponentially distributed with a mean of 

40 hours. A single plant operates three galvanizing lines that 

are assumed to operate independently. 

(a) What is the probability that none of the lines experiences 
a surface finish problem in 40 hours of operation? 

(b) What is the probability that all three lines experience a sur- 
face finish problem between 20 and 40 hours of operation? 

(c) Why is the joint probability density function not needed to 
answer the previous questions? 


5-55. A popular clothing manufacturer receives Internet 

orders via two different routing systems. The time between 

orders for each routing system in a typical day is known to be 

exponentially distributed with a mean of 3.2 minutes. Both 

systems operate independently. 

(a) What is the probability that no orders will be received in a 
5 minute period? In a 10 minute period? 

(b) What is the probability that both systems receive two 
orders between 10 and 15 minutes after the site is offi- 
cially open for business? 


(c) Why is the joint probability distribution not needed to 
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(a) Graph fyjx(y) = xe” for y > 0 for several values of x. 


answer the previous questions? Determine 
5.54. The conditional probability distribution of Y given (b) P(Y < 2|X=2) (c) E(Y|X = 2) 
X= xis fyjx(y) = xe ? for y > 0 and the marginal probabil- (d) E(Y|X = x) (e) fxy (x, y) 
ity distribution of X is a continuous uniform distribution over (f) fy(y) 


0 to 10. 
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As for discrete random variables, in some cases, more than two continuous random variables 
are defined in a random experiment. 
EXAMPLE 5-22 Many dimensions of a machined part are routinely measured during production. Let the ran- 
dom variables, X, X5, X3, and X, denote the lengths of four dimensions of a part. Then, at least 
four random variables are of interest in this study. 


The joint probability distribution of continuous random variables, X1, X», X5 ... , Xp 


can be specified by providing a method of calculating the probability that X1, X2, X5, ... , Xp 
assume a value in a region R of p-dimensional space. A joint probability density function 
fx x... X, (X1, X2, ... , Xp) is used to determine the probability that (X1, X5, X5, ... , Xp) assume a 


value in a region R by the multiple integral of fy. y... y. (X1, X5, ... , xy) over the region R. 


Definition 
A joint probability density function for the continuous random variables Xj, X5, 


X5, ... , Xp, denoted as fy, x... x, (X1, X2 +++» Xp), satisfies the following properties: 


(D fx xx 00s X» x) m0 


oo 


(2) | | S | fis een) d Tb Aq AX, = | 


LLL =C2 


(3) For any region B of p-dimensional space 


IA Gee aS) €B] = || ms prc ANON ans, (5-22) 


B 





Typically, fy, y... y, (X1, X2, ..., Xp) is defined over all of p-dimensional space by assum- 
ing that fx x... x Q1 X2, ... , X) = 0 for all points for which fx x... x 01 X2, 256 Xp) is not 
specified. 

EXAMPLE 5-23 In an electronic assembly, let the random variables Xj, X5, X4, X4 denote the lifetimes of four 


components in hours. Suppose that the joint probability density function of these variables is 


= —2 ,—0.001x, — 0.002x,— 0.001 5x3 — 0.003x 
P xxx s X2 X3, x4) Los 9 xX 10 e 1 2 3 4 


for x = 0, x» = 0, x3 = 0, x4 = 0 


What is the probability that the device operates for more than 1000 hours without any failures? 
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EXAMPLE 5-24 


Definition 





The requested probability is P(X, > 1000, X, > 1000, X4 > 1000, X, > 1000), which 
equals the multiple integral of fy, x, x, y, (x1, X25 x3, x4) over the region x, > 1000, x, > 1000, 
X4 > 1000, x, > 1000. The joint probability density function can be written as a product of 
exponential functions, and each integral is the simple integral of an exponential function. 
Therefore, 


P(X, > 1000, X, > 1000, X; > 1000, X, > 1000) = e ! ^ 137? = 0.00055 


Suppose that the joint probability density function of several continuous random vari- 
ables is a constant, say c over a region R (and zero elsewhere). In this special case, 


| | Y PET Xy) dx, dx; ... dx, = c X (volume of region R) = 1 


by property (2) of Equation 5-22. Therefore, c = 1/volume (R). Furthermore, by property (3) 
of Equation 5-22. 


Q3», a 5 dx, dx; ... dx, = c X volume (BAR) 


| 
——À 
—À 
—— 
= 
Os 


volume (B N R) 


volume (R) 


When the joint probability density function is constant, the probability that the random vari- 
ables assume a value in the region B is just the ratio of the volume of the region B N R to the 
volume of the region R for which the probability is positive. 


Suppose the joint probability density function of the continuous random variables X and Y is 
constant over the region x^ + y? < 4. Determine the probability that X^ + Y? < 1. 

The region that receives positive probability is a circle of radius 2. Therefore, the area of 
this region is 47. The area of the region x^ + y? < 1 is m. Consequently, the requested prob- 
ability is w/(47) = 1/4. 


If the joint probability density function of continuous random variables Xj, X5, ... , X, 
iS fy, x... x, (X1, X2 ..., xj) the marginal probability density function of X; is 


| | ju prn adog x) dx dx» qus dx; dX; 44 ioni dx, (5-23) 


Xi 


where R,, denotes the set of all points in the range of Xj, X5, ..., Xp for which 
X; = Ao 


Mean and 
Variance from 
Joint 
Distribution 


Distribution of 
a Subset of 
Random 
Variables 
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As for two random variables, a probability involving only one random variable, say, for 
example P(a < X; < b), can be determined from the marginal probability distribution of X; 
or from the joint probability distribution of X1, X5, ... , Xp. That is, 


Pla < X; < b) = P(—o < X, < o,..., -O < X; , « 0, qa « X; « b, 


uL Lo (TE Os es —% < X, < %) 


9 


Furthermore, E(X;) and V(X;), fori = 1, 2, ..., p, can be determined from the marginal prob- 
ability distribution of X; or from the joint probability distribution of X, X5, ... , X, as follows. 


(o0) 


E(X;) = | | ses nf xfi =) dx, dxo a GX, 


= 00 


(5-24) 


Por I)" fos x... x Q0 X2, +255 Xp) dx, dx» ... dx, 


The probability distribution of a subset of variables such as Xi, X5, ... , Xj, k < p, can be 
obtained from the joint probability distribution of X1, X2, X5, ... , X, as follows. 


If the joint probability density function of continuous random variables X;, X5, ... , X, 
iS fy, x... x, (X1, X2, ... , Xp), the probability density function of X;, X), ... , X, k < p, 


IS 


dieere lere CONTE 


= | | iris [eaka E 4.5 ... dx, (9-73 
R 


X1X2... Xk 


where A, |, denotes the set of all points in the range of Xj, X5, ... , X, for which 
Xj = Xp A = Ko ovo eG — Xx. 


Conditional Probability Distribution 
Conditional probability distributions can be developed for multiple continuous random vari- 
ables by an extension of the ideas used for two continuous random variables. 


r ( ) IX, x x x xD X2, X3, X4, Xs) 
Gs X15 X2, X3) = 
Vo Xs [xa xs N^ l> 42 fx x. Xs) 


for fy, x, (xa, xs) > 0. 
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Definition 


EXAMPLE 5-25 


EXAMPLE 5-26 


The concept of independence can be extended to multiple continuous random variables. 


Continuous random variables X;, X5, ... , X, are independent if and only if 


fx x... x XQ +++, 2) z Jx Yx oo) zm fx x) for all UU ex s Xp (5-26) 





Similar to the result for only two random variables, independence implies that Equation 5-26 
holds for all xi, x», ... , Xp. If we find one point for which the equality fails, X1, X», ... , X, are 
not independent. It is left as an exercise to show that if X1, Xz, ... , X, are independent, 


P(X, E A, X; E Ubi oss gcn € A,) = P(X, € A DPX Edo edo eu) 


for any regions 4j, Ap, ... , A, in the range of Xj, X5, ... , Xp, respectively. 

In Chapter 3, we showed that a negative binomial random variable with parameters p and r 
can be represented as a sum of r geometric random variables Xj, X5, ... , X,. Each geometric 
random variable represents the additional trials required to obtain the next success. Because 
the trials in a binomial experiment are independent, Xj, X2, ..., X, are independent random 
variables. 


Suppose X, X», and X; represent the thickness in micrometers of a substrate, an active layer, 
and a coating layer of a chemical product. Assume that Xj, X5, and X4 are independent 
and normally distributed with w, = 10000, p = 1000, u; = 80,0, = 250, 0» = 20, and 
04, = 4, respectively. The specifications for the thickness of the substrate, active layer, and 
coating layer are 9200 < x, « 10800, 950 < x, < 1050, and 75 < x4 « 85, respectively. 
What proportion of chemical products meets all thickness specifications? Which one of the 
three thicknesses has the least probability of meeting specifications? 

The requested probability is P(9200 < X, < 10800, 950 < X, < 1050, 75 < X; < 85. 
Because the random variables are independent, 


P(9200 < X, < 10800, 950 < X; < 1050, 75 < X; < 85) 
= P(9200 < X, < 10800)P(950 < X, < 1050)P(75 < X, < 85) 


After standardizing, the above equals 
P3922 50759 2E 2:9 955 $925) PL-125 = Z = 125) 


where Z 1s a standard normal random variable. From the table of the standard normal distri- 
bution, the above equals 


(0.99862)(0.98758)(0.78870) — 0.7778 


The thickness of the coating layer has the least probability of meeting specifications. 
Consequently, a priority should be to reduce variability in this part of the process. 
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EXERCISES FOR SECTION 5-4 


5-55. Suppose the random variables X, Y, and Z have the joint 

probability density function f(x, y, z) = 8xyz for0 <x < 1, 

0<y<1,and0<z< 1. Determine the following: 

(a) P(X < 0.5) (b) P(X < 0.5, Y < 0.5) 

(e) PiZ= 2) (d) P(X«05orZ = 2) 

(e) E(X) 

5-56. Continuation of Exercise 5-55. Determine the following: 

(a) P(X < 0.5|Y = 0.5) 

(b) P(X < 0.5, Y < 0.5|Z = 0.8) 

5-57. Continuation of Exercise 5-55. Determine the following: 

(a) Conditional probability distribution of X given that Y — 
0.5 and Z = 0.8 

(b) P(X < 0.5|Y = 0.5, Z = 0.8) 

5-58. Suppose the random variables X, Y, and Z have 

the joint probability density function fyyz(x, y, z) = c over 

the cylinder x? + 5? < 4 and 0 « z « 4. Determine the 

following. 

(a) The constant c so that fyyz(x, y, Z) is a probability density 


function 
(b) P(X? + Y? « 2) 
(c) AZ « 2) 
(d) E(X) 
5.59, Continuation of Exercise 5-58. Determine the 
following: 
(a) P(X«1|Y-1) (b) P(X? + Y? « 1IZ- 1) 
5-60. Continuation of Exercise 5-58. Determine the condi- 


tional probability distribution of Z given that X — 1 and 
Y — |. 

5-61. Determine the value of c that makes fyy7(x, y, Z) = c 
a joint probability density function over the region x > 0, 
yo") Vanda ye = 1, 

5-62. 
(a) P 
(b) P 
(c) P 
(d) E(X) 

5-63. Continuation of Exercise 5-61. Determine the following: 
(a) Marginal distribution of X 

(b) Joint distribution of X and Y 


Continuation of Exercise 5-61. Determine the following: 
AX € 05.1 € 05,7 « 0.5) 
A «05,1 0.) 
X « 0.5) 
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(c) Conditional probability distribution of X given that Y — 
0.5 and Z = 0.5 

(d) Conditional probability distribution of X given that 
Y — 0.5 

5-64. The yield in pounds from a day's production is nor- 

mally distributed with a mean of 1500 pounds and standard 

deviation of 100 pounds. Assume that the yields on different 

days are independent random variables. 

(a) What 1s the probability that the production yield exceeds 
1400 pounds on each of five days next week? 

(b) What is the probability that the production yield exceeds 
1400 pounds on at least four of the five days next week? 


5-65. The weights of adobe bricks used for construction are 

normally distributed with a mean of 3 pounds and a standard 

deviation of 0.25 pound. Assume that the weights of the bricks 

are independent and that a random sample of 20 bricks is 

selected. 

(a) What is the probability that all the bricks in the sample 
exceed 2.75 pounds? 

(b) What is the probability that the heaviest brick in the sam- 
ple exceeds 3.75 pounds? 


5-66. A manufacturer of electroluminescent lamps knows 

that the amount of luminescent ink deposited on one of 

its products is normally distributed with a mean of 1.2 

grams and a standard deviation of 0.03 grams. Any lamp 

with less than 1.14 grams of luminescent ink will fail 

to meet customer’s specifications. A random sample of 

25 lamps is collected and the mass of luminescent ink on 

each is measured. 

(a) What is the probability that at least 1 lamp fails to meet 
specifications? 

(b) What is the probability that 5 lamps or fewer fail to meet 
specifications? 

(c) What is the probability that all lamps conform to specifi- 
cations? 

(d) Why is the joint probability distribution of the 25 lamps 
not needed to answer the previous questions? 


When two or more random variables are defined on a probability space, it is useful to describe 
how they vary together; that is, it is useful to measure the relationship between the variables. 
A common measure of the relationship between two random variables is the covariance. To 
define the covariance, we need to describe the expected value of a function of two random 
variables h(X, Y). The definition simply extends that used for a function of a single random 


variable. 
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Definition 


EXAMPLE 5-27 


Definition 


Figure 5-12 Joint 
distribution of X and Y 
for Example 5-27. 


SDVAwY fry) —X Y discrete 
EAX Y] 2 V op 


|| h(x, y)fxy(x, y) d«dy X, Y continuous 


R 





That is, E[A(X, Y)] can be thought of as the weighted average of A(x, y) for each point in the 
range of (X, Y). The value of E[A(X, Y)] represents the average value of A(X, Y) that 1s expected 
in a long sequence of repeated trials of the random experiment. 


For the joint probability distribution of the two random variables in Fig. 5-12, calculate 
E[(X — px)(¥ — py)]. 


The result is obtained by multiplying x — py times y — py, times fyy(x, y) for each point 
in the range of (X, Y). First, wy and py are determined from Equation 5-3 as 
wy = 1X034+3 X 0.7 =2.4 
and 
uwy=1X034+2X04+4+3 X 0.3 = 2.0 
Therefore, 
E(X — up) — pp] = (1 — 2.4)(1 — 2.0) x 0.1 
+ (1 — 2.4)(2 — 2.0) X 0.2 + (3 — 2.4)(1 — 2.0) x 02 
TOO = 24)2 = 2.0) X02 413 =24)6 — 2.0) xX 03 =02 


The covariance is defined for both continuous and discrete random variables by the same formula. 


The covariance between the random variables X and Y, denoted as cov(X, Y) or oyy, is 


Oxy = El(X — py)(Y — py)] = E(XY) — myhy (5-28) 





y 
3 e 0.3 
2 e 0.2 e 0.2 
e O.1 e 0.2 
1 2 3 x 
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If the points in the joint probability distribution of X and Y that receive positive probabil- 
ity tend to fall along a line of positive (or negative) slope, oy, is positive (or negative). If the 
points tend to fall along a line of positive slope, X tends to be greater than jy when Y is greater 
than py. Therefore, the product of the two terms x — py and y — py tends to be positive. 
However, if the points tend to fall along a line of negative slope, x — wy tends to be positive 
when y — py is negative, and vice versa. Therefore, the product of x — wy and y — uy tends 
to be negative. In this sense, the covariance between X and Y describes the variation between 
the two random variables. Figure 5-13 shows examples of pairs of random variables with 
positive, negative, and zero covariance. 

Covariance 1s a measure of linear relationship between the random variables. If the re- 
lationship between the random variables is nonlinear, the covariance might not be sensitive to 
the relationship. This is illustrated in Fig. 5-13(d). The only points with nonzero probability 
are the points on the circle. There is an identifiable relationship between the variables. Still, 
the covariance is zero. 

The equality of the two expressions for covariance in Equation 5-28 1s shown for contin- 
uous random variables as follows. By writing the expectations as integrals, 


BUY = w= wad] = | | C= mad = dft) de 


= | | [xy — uyy — xpy + uyur] fyy(x, y) dxdy 


—00 —o 


y 
e e 
e (J e 
e © e e y 
e e e e > ° ° 
e * e e * " " 
e e e e " s * 
e e 
x x 
(a) Positive covariance (b) Zero covariance 
y All points are of Y 
i " equal probability 


(c) Negative covariance (d) Zero covariance 


Figure 5-13 Joint probability distributions and the sign of covariance between X and Y. 
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EXAMPLE 5-28 


Definition 


NOW 


| | xy fry(x, y) dxdy = wy | | yfxy Go y) dxdy | = uy 
Therefore, 


E[(X — my) — py)] = Xyfxy(x, y) dxdy — yy — pyby + wyby 


XyfxyGo y) dxdy — muyuy = E(XY) — toy 


a cc. eee. 
| — 8 | — 8 


In Example 5-1, the random variables X and Y are the number of acceptable and suspect bits 
among four bits received during a digital communication, respectively. Is the covariance 
between X and Y positive or negative? 

Because X and Y are the number of acceptable and suspect bits out of the four received, 
X + Y x 4. If X is near 4, Y must be near 0. Therefore, X and Y have a negative covariance. 
This can be verified from the joint probability distribution in Fig. 5-1. 


There is another measure of the relationship between two random variables that is often 
easier to interpret than the covariance. 


The correlation between random variables X and Y, denoted as pyy, is 


| cov(X,Y) — ayy 
PY VXV) | exer 





Because oy > 0 and oy > 0, if the covariance between X and Y is positive, negative, or zero, 
the correlation between X and Y is positive, negative, or zero, respectively. The following 
result can be shown. 


For any two random variables X and Y 


= St l 





The correlation just scales the covariance by the standard deviation of each variable. 
Consequently, the correlation is a dimensionless quantity that can be used to compare the 
linear relationships between pairs of variables in different units. 


EXAMPLE 5-29 


EXAMPLE 5-30 
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If the points in the joint probability distribution of X and Y that receive positive probabil- 
ity tend to fall along a line of positive (or negative) slope, pyy is near +1 (or —1). If pyy 
equals +1 or —1, it can be shown that the points in the joint probability distribution that 
receive positive probability fall exactly along a straight line. Two random variables with 
nonzero correlation are said to be correlated. Similar to covariance, the correlation 1s a meas- 
ure of the linear relationship between random variables. 


For the discrete random variables X and Y with the joint distribution shown in Fig. 5-14, 
determine o y, and pyy. 


The calculations for E(XY), E(X), and V(X) are as follows. 


E(XY) 20X0xX0.2- 1X 1xX0.1-1X2x0.1-*2X1x01 
+2X2xX014+3X3X04=45 
E(X)=0024+1xX02+2x02+3x04=1.8 
V(X) = (0 — 1.8} x 0.2 + (1 — 1.8) X 0.2 + (2 — 1.8) x 02 
+ (3 — 1.8) x 0.4 = 1.36 


Because the marginal probability distribution of Y is the same as for X, E(Y) = 1.8 and 
V(Y) = 1.36. Consequently, 


oyy = E(XY) — E(X)E(Y) = 4.5 — (1.8)(1.8) = 1.26 


Furthermore, 


PY ^ excy  (W136(V136) 


Suppose that the random variable X has the following distribution: P(X = 1) = 0.2, 
P(X = 2) = 0.6, P(X = 3) = 0.2. Let Y = 2X + 5. That is, P(Y = 7) = 0.2, P(Y = 9) = 0.6, 
P(Y = 11) = 02. Determine the correlation between X and Y. Refer to Fig. 5-15. 


Because X and Y are linearly related, p = 1. This can be verified by direct calculations: 
Try it. 


For independent random variables, we do not expect any relationship in their joint prob- 
ability distribution. The following result is left as an exercise. 


y y 
3 e 0.4 11 e 0.2 
2 e 0.1 eO.1 9 e 0.6 

1 e 0.1 eO. 7 e 0.2 

p=1 
0.2 

Oe 

O 1 2 3 x 1 2 3 x 


Figure 5-14 Joint distribution for Figure 5-15 Joint distribution for 
Example 5-29. Example 5-30. 
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If X and Y are independent random variables, 


Oxy = pyy = 9 





EXAMPLE 5-31 For the two random variables in Fig. 5-16, show that oy, = 0. 


The two random variables in this example are continuous random variables. In this case 
E(XY ) is defined as the double integral over the range of (X, Y). That is, 


42 472 4 
E(XY) = x = i PN — a 2| 3 i 
yfxy(x, y)dxdy T xy dx | dy ie | 9 x79 
0 
0 0 0 0 0 





4 
4 


- xl y[s/3]dy = L| 7/3 | =4[64/3] = 32/0 
16 )| 6 

















6 
0 
Also, 
4 2 A[2 4 
l 2 l al 
xfyy(x, y)dxdy = — x dx |dy = — | | XABI |dy 
16 16 0 
0 0 0 Lo 0 
= j| [8/3] = 1 [16/2] = 4/3 
16^ ^], 6 " 
4 2 4 
l 2 I DETA 
yfyy(x, y)dxdy = — | y xdx |dy = — | y| x°/2| |dy 
16 16 0 
0 0 0 0 
2} aai 1 
= — 3 = —164/3| = 8/3 
167 3| | 7 glows! = V 
y 
4 
3 
fy (5) = de xy 
2 
1 
O 1 2 x 


Figure 5-16 Random variables 
with zero covariance from Example 
5-31. 


Thus, 
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E(XY) — E(X)E(Y) = 32/9 — (4/3)(8/3) = 0 


It can be shown that these two random variables are independent. You can check that 
fxr, y) = fx) fy() for all x and y. 


However, if the correlation between two random variables is zero, we cannot immediately 
conclude that the random variables are independent. Figure 5-13(d) provides an example. 


EXERCISES FOR SECTION 5-5 


5-67. Determine the covariance and correlation for the 
following joint probability distribution: 


x l l 2 4 
3 4 3 6 


y 
tev, y) 1/8 1/4 1/2 1/8 


5-68. Determine the covariance and correlation for the 
following joint probability distribution: 

x =| —0.5 0.5 l 

y =l —] 1 2 

faxy) 1/8 1/4 1/2 1/8 


5-69. Determine the value for c and the covariance and 
correlation for the joint probability mass function fyy(x, y) = 
c(x + y)forx = 1,2,3 andy = 1,2,3. 

5-70. Determine the covariance and correlation for the joint 
probability distribution shown in Fig. 5-4(a) and described in 
Example 5-8. 


5-71. Determine the covariance and correlation for X, and 
X, in the joint distribution of the multinomial random vari- 
ables X;, X; and X; in with p, = p; = p; = l and n = 3. What 
can you conclude about the sign of the correlation between 
two random variables in a multinomial distribution? 

5-72. Determine the value for c and the covariance and cor- 
relation for the joint probability density function fyy(x, y) = 
cxy over the range 0 < x < 3 and 0 < y < x. 
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5-73. Determine the value for c and the covariance and cor- 
relation for the joint probability density function fyy(x, y) = c 
over the range 0 < x «5,0«y,andx—- 1«y«x- l1. 


5-74. Determine the covariance and correlation for the joint 
probability density function fyy (x, y) = 6 x 10 $e 9:001:70.002y 
over the range 0 < x and x < y from Example 5-15. 


5-15. Determine the covariance and correlation for the joint 
probability density function fyy(x, y) = e * ” over the range 
0 « x and 0 «€ y. 

5-76. Suppose that the correlation between X and Y is p. For 
constants a, b, c, and d, what 1s the correlation between the 
random variables U = aX + b and V = cY + d? 


5-77. The joint probability distribution is 


x =] 0 0 l 
y 0 =] l 0 
fxr, y) 1/4 1/4 1/4 1/4 


Show that the correlation between X and Y is zero, but X and Y 
are not independent. 

5-78. Suppose X and Y are independent continuous random 
variables. Show that o y, = 0. 


An extension of a normal distribution to two random variables is an important bivariate prob- 


ability distribution. 


EXAMPLE 5-32 


At the start of this chapter, the length of different dimensions of an injection-molded part was 


presented as an example of two random variables. Each length might be modeled by a normal 
distribution. However, because the measurements are from the same part, the random 
variables are typically not independent. A probability distribution for two normal random vari- 
ables that are not independent is important in many applications. As stated at the start of the 
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chapter, if the specifications for X and Y are 2.95 to 3.05 and 7.60 to 7.80 millimeters, respec- 
tively, we might be interested in the probability that a part satisfies both specifications; that 1s, 
P(2.95 < X < 3.05, 7.60 < Y < 7.80). 


Definition 
The probability density function of a bivariate normal distribution is 


| 
fxy(x. y; Oy, Oy, Wy, Py, p) m Imm p? exp is 


zz 
pus 
2710 y Oy aie 


for —o» < x < oo and —o < y < ©, with parameters oy > 0,0, > 0, —99 < uy < co, 
88 ie BS EO. ANG] e = 





The result that fyy(x, y; Oy, Ty, Wy, Wy, p) integrates to 1 is left as an exercise. Also, the bivari- 
ate normal probability density function is positive over the entire plane of real numbers. 

Two examples of bivariate normal distributions are illustrated in Fig. 5-17 along with 
corresponding contour plots. Each curve on the contour plots is a set of points for which the 
probability density function is constant. As seen in the contour plots, the bivariate normal 
probability density function is constant on ellipses in the (x, y) plane. (We can consider a circle 
to be a special case of an ellipse.) The center of each ellipse is at the point (wy, wy). If p > 0 
(p < 0), the major axis of each ellipse has positive (negative) slope, respectively. If p = 0, the 
major axis of the ellipse is aligned with either the x or y coordinate axis. 


—0.5(x? + y?) 


EXAMPLE 5-33 The joint probability density function fyy(x, y) = is a special case of a bivariate 


-r E 
V2 
normal distribution with oy = 1, oy = 1, py = 0, uy = 0, and p = 0. This probability density 
function is illustrated in Fig. 5-18. Notice that the contour plot consists of concentric circles about 
the origin. 


By completing the square in the exponent, the following results can be shown. The details 
are left as an exercise. 
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Figure 5-17 Examples of bivariate normal distributions. 
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fxy o y) 





X 
Figure 5-18 Bivariate normal probability density Figure 5-19 Marginal probability 
function with oy = 1, oy = 1, p = 0, uy = 0, and density functions of a bivariate 

Wy = 0. normal distribution. 


If X and Y have a bivariate normal distribution with joint probability density fyy(x, y; 


Oy, Oy, Wy, Wy, p), the marginal probability distributions of X and Y are normal 
with means wy and jp. y and standard deviations oy and oy, respectively. (5-33) 





Figure 5-19 illustrates that the marginal probability distributions of X and Y are normal. 
Furthermore, as the notation suggests, p represents the correlation between X and Y. The 
following result 1s left as an exercise. 


If X and Y have a bivariate normal distribution with joint probability density function 


fxy GG y; Ty, Ty, Wy, Wy, p), the correlation between X and Y is p. (5-34) 





The contour plots in Fig. 5-17 illustrate that as p moves from zero (left graph) to 0.9 (right 
graph), the ellipses narrow around the major axis. The probability is more concentrated about 
a line in the (x, y) plane and graphically displays greater correlation between the variables. If 
p = —1 or +1, all the probability is concentrated on a line in the (x, y) plane. That is, the 
probability that X and Y assume a value that is not on the line is zero. In this case, the bivari- 
ate normal probability density 1s not defined. 

In general, zero correlation does not imply independence. But in the special case that X 
and Y have a bivariate normal distribution, if p — 0, X and Y are independent. The details are 
left as an exercise. 


If X and Y have a bivariate normal distribution with p — 0, X and Y are independent. 


(5-35) 
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CHAPTER 5 JOINT PROBABILITY DISTRIBUTIONS 


An important use of the bivariate normal distribution 1s to calculate probabilities involving 
two correlated normal random variables. 


EXAMPLE 5-34 


Suppose that the X and Y dimensions of an injection-molded part have a bivariate normal 


distribution with oy = 0.04, oy = 0.08. wy = 3.00. wy = 7.70, and p = 0.8. Then, the prob- 
ability that a part satisfies both specifications is 


P(2.95 « X « 3.05, 7.60 « Y « 7.80) 


This probability can be obtained by integrating fyy(x, y; Oy, Oy, Wy Wy, p) over the region 
2.95 « x « 3.05 and 7.60 « y « 7.80, as shown in Fig. 5-7. Unfortunately, there is often no 
closed-form solution to probabilities involving bivariate normal distributions. In this case, the 
integration must be done numerically. 


EXERCISES FOR SECTION 5-6 


5-79. Let X and Y represent concentration and viscosity of a 
chemical product. Suppose X and Y have a bivariate normal 
distribution with oy = 4, oy = 1, wy = 2, and py = 1. Draw 
a rough contour plot of the joint probability density function 
for each of the following values for p: 


(a) p = 0 (b) p= 0.8 
(c) p = —0.8 
5-80. Let X and Y represent two dimensions of an injec- 


tion molded part. Suppose X and Y have a bivariate normal 
distribution with oy = 0.04, oy 0.08, py = 3.00, 
wy = 7.70, and py = 0. Determine P(2.95 « X < 3.05, 
7.60 < Y < 7.80). 


5-81. In the manufacture of electroluminescent lamps, 
several different layers of ink are deposited onto a plastic 
substrate. The thickness of these layers is critical if specifi- 
cations regarding the final color and intensity of light of 
the lamp are to be met. Let X and Y denote the thickness 
of two different layers of ink. It is known that X is nor- 
mally distributed with a mean of 0.1 millimeter and a 
standard deviation of 0.00031 millimeter, and Y is also 
normally distributed with a mean of 0.23 millimeter and a 
standard deviation of 0.00017 millimeter. The value of p for 
these variables is equal to zero. Specifications call for a 
lamp to have a thickness of the ink corresponding to X in 
the range of 0.099535 to 0.100465 millimeters and Y in 
the range of 0.22966 to 0.23034 millimeters. What 1s the 
probability that a randomly selected lamp will conform to 
specifications? 


5-82. Suppose that X and Y have a bivariate normal distri- 

bution with joint probability density function fy, (x, y; oy, Oy, 

Mos Hy, p). 

(a) Show that the conditional distribution of Y, given that 
X = x is normal. 

(b) Determine E(Y|X = x). 

(c) Determine V(Y|X = x). 

5-83. If X and Y have a bivariate normal distribution with 

p = 0, show that X and Y are independent. 


5-84. Show that the probability density function fyy(x, y; 
Oy, Oy, Wy, Wy, p) Of a bivariate normal distribution integrates 
to one. [Hint: Complete the square in the exponent and use the 
fact that the integral of a normal probability density function 
for a single variable 1s 1.] 


5-85. IfX and Y have a bivariate normal distribution with 
joint probability density fyy (x, y; Ty, Oy, Wy, Wy, p), show 
that the marginal probability distribution of X is normal 
with mean py and standard deviation oy. [Hint: Complete 
the square in the exponent and use the fact that the integral 
of a normal probability density function for a single variable 
is 1.] 

5-86. f X and Y have a bivariate normal distribution with 
joint probability density fyy(x, y; Oy, Ty, Wy, Wy, p), show that 
the correlation between X and Y is p. [Hint: Complete the 
square in the exponent]. 


5-7 LINEAR COMBINATIONS OF RANDOM VARIABLES 


A random variable is sometimes defined as a function of one or more random variables. 
The CD material presents methods to determine the distributions of general functions of 
random variables. Furthermore, moment-generating functions are introduced on the CD 


Definition 


Mean of a 
Linear 
Combination 


5-7 LINEAR COMBINATIONS OF RANDOM VARIABLES 181 


and used to determine the distribution of a sum of random variables. In this section, results 
for linear functions are highlighted because of their importance in the remainder of the 
book. References are made to the CD material as needed. For example, if the random vari- 
ables X, and X, denote the length and width, respectively, of a manufactured part, Y = 2X, 
+ 2X, 1s a random variable that represents the perimeter of the part. As another example, 
recall that the negative binomial random variable was represented as the sum of several 
geometric random variables. 

In this section, we develop results for random variables that are linear combinations of 
random variables. 


Given random variables X;, X, ... , X, and constants c;, c», ..., 


Y= CLXi E C5 X» JF 0o99 ote C 


is a linear combination of X4, X5, ..., Xp. 





Now, E(Y) can be found from the joint probability distribution of Xj, X5, ... , X, as follows. 
Assume X, X5, ... , X, are continuous random variables. An analogous calculation can be used 
for discrete random variables. 


oO 


E(Y) = | | - | (Cry "Pt opere Cp Xp )fx, x... X, 1; Diet) (Da OE vacio 
ji | X1 for x... x 0s X25 +++ 5 Xp) dx, dx... dx, 

raff | X2 f X... X (Xb X2 +++ » Xp) AX, dx, ... dx, + , 

ZI | Xp for a... X, X15 325 +++ Xp) dx, dx» ... dx, 

By using Equation 5-24 for each of the terms in this expression, we obtain the following. 


I= Ci Xi = C5 X» sus c 


I0. — eap) ue ee bar 2° or E) 





Furthermore, it is left as an exercise to show the following. 
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Variance of a 
Linear 
Combination 


EXAMPLE 5-35 


EXAMPLE 5-36 


If X1, X>,..., X, are random variables, and Y = cX; + cX, t + c,X,, then in 
general 


WO eG) ar AEG) ee ap GIG) ae AOS, OS, ces ove 20) (9-98) 


<) 
If Xj, X, ... , X, are independent, 


V(Y) = eiV(QX3) + e$V(X2) t + GV(X,) 





Note that the result for the variance in Equation 5-39 requires the random variables to be 
independent. To see why the independence is important, consider the following simple exam- 
ple. Let X, denote any random variable and define X, = — X,. Clearly, X, and X, are not inde- 
pendent. In fact, pyy = —1. Now, Y = X, + X; is 0 with probability 1. Therefore, (Y) = 0, 
regardless of the variances of X, and X. 


In Chapter 3, we found that if Y is a negative binomial random variable with parameters p and 
r, Y — X, +X +: + X, where each X, is a geometric random variable with parameter 
p and they are independent. Therefore, E(X;) = 1/p and E(X) = (1 — p)/p*. From Equation 
5-37, E(Y) = r/p and from Equation 5-39, V(Y) = r(1 — p)/p*. 


An approach similar to the one applied in the above example can be used to verify the 
formulas for the mean and variance of an Erlang random variable in Chapter 4. 


Suppose the random variables X, and X, denote the length and width, respectively, of a man- 
ufactured part. Assume E(X,) = 2 centimeters with standard deviation 0.1 centimeter and 
E(X,) = 5 centimeters with standard deviation 0.2 centimeter. Also, assume that the covari- 
ance between X, and X, is —0.005. Then, Y = 2X, + 2X, is a random variable that represents 
the perimeter of the part. From Equation 5-36, 


E(Y) = 2(2) + 2(5) = 14 centimeters 
and from Equation 5-38 


V(Y) = 2°(0.1*) + 27(0.27) + 2 X 2 x 2(—0.005) 
= 0.04 + 0.16 — 0.04 = 0.16 centimeters squared 


Therefore, the standard deviation of Y is 0.167 = 0.4 centimeters. 


The particular linear combination that represents the average of p random variables, with 
identical means and variances, is used quite often in the subsequent chapters. We highlight the 
results for this special case. 


5-7 LINEAR COMBINATIONS OF RANDOM VARIABLES 183 





Mean and 
Variance of an IfX—(X +X% +-+ A SOG) — quota d — Jb a 
Average 
E(X) = p 
if Xi, 25, ... , X, are also independent with V(X;) = Ooms — al ae D 
The conclusion for V(X ) is obtained as follows. Using Equation 5-39, with c; — 1/p and 
W(X) = c^, yields 
V(X) = (Mpya* + = + (1/pyo* = 07/p 
S e 
p terms 
Another useful result concerning linear combinations of random variables is a reproduc- 
tive property that holds for independent, normal random variables. 
Reproductive 
Property of the If Xi, X5, ..., X, are independent, normal random variables with E(X) = p; and 
_ Normal M ILIUM 
Distribution 


Jews ae Cpe aT pp 


is a normal random variable with 


JO) = C)M4 ae C22 ae ale Coby 


VY) = cio? + os +- + 





The mean and variance of Y follow from Equations 5-37 and 5-39. The fact that Y has a nor- 
mal distribution can be obtained from moment-generating functions discussed in Section 5-9 
in the CD material. 


EXAMPLE 5-37 Let the random variables X, and X, denote the length and width, respectively, of a manufac- 
tured part. Assume that X, is normal with E(X,) = 2 centimeters and standard deviation 
0.1 centimeter and that X, is normal with E(X,) = 5 centimeters and standard deviation 0.2 
centimeter. Also, assume that X, and X, are independent. Determine the probability that the 
perimeter exceeds 14.5 centimeters. 
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EXAMPLE 5-38 


Then, Y = 2X, + 2X, is a normal random variable that represents the perimeter of the 
part. We obtain, E(Y) = 14 centimeters and the variance of Y is 


V(Y) = 4 X 0.1? + 4 x 02? = 0.0416 
Now, 


P(Y > 14.5) = P[(Y — py)/oy > (14.5 — 14)/V0.0416] 
= P(Z > 1.12) = 0.13 


Soft-drink cans are filled by an automated filling machine. The mean fill volume is 12.1 fluid 
ounces, and the standard deviation 1s 0.1 fluid ounce. Assume that the fill volumes of the cans 
are independent, normal random variables. What 1s the probability that the average volume of 


10 cans selected from this process is less than 12 fluid ounces? 


Let X,, X,,.. 


., Xio denote the fill volumes of the 10 cans. The average fill volume 


(denoted as X) is a normal random variable with 


= — 01 
E(X) = 12.1 and V(X) = 0 — 0.001 
Consequently, 
= X- py 12 = 121 
P(X < 12) - P —ez VOo 


EXERCISES FOR SECTION 5-7? 


5-87. If X and Y are independent, normal random variables 
with E(XY)=0, Vix)=4, E(Y) = 10, and V(Y) = 9. 
Determine the following: 
(a) HX + 3Y) 

(c) P(2X + 3Y < 30) (d) P(2X + 3Y < 40) 

5-88. Suppose that the random variable X represents the 
length of a punched part in centimeters. Let Y be the length 
of the part in millimeters. If E(X) = 5 and V(X) = 0.25, what 
are the mean and variance of Y? 


(b) V2X + 3Y) 


5-89. A plastic casing for a magnetic disk is composed of 

two halves. The thickness of each half is normally distributed 

with a mean of 2 millimeters and a standard deviation of 

0.1 millimeter and the halves are independent. 

(a) Determine the mean and standard deviation of the total 
thickness of the two halves. 

(b) What is the probability that the total thickness exceeds 
4.3 millimeters? 


5-90. Inthe manufacture of electroluminescent lamps, sev- 
eral different layers of ink are deposited onto a plastic sub- 
strate. The thickness of these layers is critical if specifications 
regarding the final color and intensity of light of the lamp are 


= P(Z < —3.16) = 0.00079 


to be met. Let X and Y denote the thickness of two different 

layers of ink. It is known that X is normally distributed with a 

mean of 0.1 millimeter and a standard deviation of 0.00031 

millimeter and Y is also normally distributed with a mean of 

0.23 millimeter and a standard deviation of 0.00017 millime- 

ter. Assume that these variables are independent. 

(a) If a particular lamp is made up of these two inks only, 
what is the probability that the total ink thickness is less 
than 0.2337 millimeter? 

(b) A lamp with a total ink thickness exceeding 0.2405 mil- 
limeters lacks the uniformity of color demanded by the 
customer. Find the probability that a randomly selected 
lamp fails to meet customer specifications. 


5-9]. The width of a casing for a door is normally distrib- 

uted with a mean of 24 inches and a standard deviation of 

1/8 inch. The width of a door is normally distributed with a 

mean of 23 and 7/8 inches and a standard deviation of 1/16 

inch. Assume independence. 

(a) Determine the mean and standard deviation of the differ- 
ence between the width of the casing and the width of the 
door. 
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Figure 5-20 Figure for the 
U-shaped component. 


(b) What is the probability that the width of the casing minus 
the width of the door exceeds 1/4 inch? 

(c) What is the probability that the door does not fit in the 
casing? 

5-92. A U-shaped component is to be formed from the three 

parts A, B, and C. The picture is shown in Fig. 5-20. The length 

of A 1s normally distributed with a mean of 10 millimeters and 

a standard deviation of 0.1 millimeter. The thickness of parts B 

and C is normally distributed with a mean of 2 millimeters and 

a standard deviation of 0.05 millimeter. Assume all dimensions 

are independent. 

(a) Determine the mean and standard deviation of the length 
of the gap D. 

(b) What is the probability that the gap D is less than 5.9 mil- 
limeters? 


5-93.  Soft-drink cans are filled by an automated filling ma- 

chine and the standard deviation is 0.5 fluid ounce. Assume 

that the fill volumes of the cans are independent, normal ran- 
dom variables. 

(a) What 1s the standard deviation of the average fill volume 
of 100 cans? 

(b) If the mean fill volume is 12.1 ounces, what is the proba- 
bility that the average fill volume of the 100 cans is below 
12 fluid ounces? 

(c) What should the mean fill volume equal so that the proba- 
bility that the average of 100 cans is below 12 fluid ounces 
is 0.005? 

(d) If the mean fill volume is 12.1 fluid ounces, what should 
the standard deviation of fill volume equal so that the 
probability that the average of 100 cans is below 12 fluid 
ounces is 0.005? 

(e) Determine the number of cans that need to be measured 
such that the probability that the average fill volume is 
less than 12 fluid ounces is 0.01. 


5-94. The photoresist thickness in semiconductor manufac- 

turing has a mean of 10 micrometers and a standard deviation of 

1 micrometer. Assume that the thickness is normally distributed 

and that the thicknesses of different wafers are independent. 

(a) Determine the probability that the average thickness of 10 
wafers is either greater than 11 or less than 9 micrometers. 


(b) Determine the number of wafers that needs to be meas- 
ured such that the probability that the average thickness 
exceeds 11 micrometers is 0.01. 

(c) If the mean thickness is 10 micrometers, what should the 
standard deviation of thickness equal so that the probabil- 
ity that the average of 10 wafers is either greater than 11 or 
less than 9 micrometers is 0.001? 

5-95. Assume that the weights of individuals are independ- 

ent and normally distributed with a mean of 160 pounds and a 

standard deviation of 30 pounds. Suppose that 25 people 

squeeze into an elevator that is designed to hold 4300 pounds. 

(a) What is the probability that the load (total weight) exceeds 
the design limit? 

(b) What design limit is exceeded by 25 occupants with prob- 
ability 0.0001? 


5-8 FUNCTIONS OF RANDOM 
VARIABLES (CD ONLY) 


5-9 MOMENT GENERATING 
FUNCTION (CD ONLY) 


5-10 CHEBYSHEV’S INEQUALITY 
(CD ONLY) 
Supplemental Exercises 
5-96. Show that the following function satisfies the proper- 


ties of a joint probability mass function: 


x y F(x, y) 
0 0 1/4 
0 l 1/8 
l 0 1/8 
l l 1/4 
2 2 1/4 


5-97. Continuation of Exercise 5-96. Determine the follow- 
ing probabilities: 

(a P(X«0.5,Y < 1.5) (b) P(X= 1) 

(c) P(X « 1.5) (d) P(X > 0.5, Y < 1.5) 

(e) Determine E(X), E(Y), V(X), and V(Y). 

5-98. Continuation of Exercise 5-96. Determine the following: 
(a) Marginal probability distribution of the random variable X 
(b) Conditional probability distribution of Y given that X — 1 
(c) E(Y|X = 1) 

(d) Are X and Y independent? Why or why not? 

(e) Calculate the correlation between X and Y. 


5-99, The percentage of people given an antirheumatoid 
medication who suffer severe, moderate, or minor side effects 
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are 10, 20, and 70%, respectively. Assume that people react 

independently and that 20 people are given the medication. 

Determine the following: 

(a) The probability that 2, 4, and 14 people will suffer severe, 
moderate, or minor side effects, respectively 

(b) The probability that no one will suffer severe side effects 

(c) The mean and variance of the number of people that will 
suffer severe side effects 

(d) What is the conditional probability distribution of the 
number of people who suffer severe side effects given that 
19 suffer minor side effects? 

(e) What is the conditional mean of the number of people who 
suffer severe side effects given that 19 suffer minor side 
effects? 


5-100. The backoff torque required to remove bolts in a 

steel plate 1s rated as high, moderate, or low. Historically, the 

probability of a high, moderate, or low rating is 0.6, 0.3, or 

0.1, respectively. Suppose that 20 bolts are evaluated and that 

the torque ratings are independent. 

(a) What is the probability that 12, 6, and 2 bolts are rated as 
high, moderate, and low, respectively? 

(b) What is the marginal distribution of the number of bolts 
rated low? 

(c) What is the expected number of bolts rated low? 

(d) What is the probability that the number of bolts rated low 
is greater than two? 


5-101. Continuation of Exercise 5-100 

(a) What is the conditional distribution of the number of bolts 
rated low given that 16 bolts are rated high? 

(b) What is the conditional expected number of bolts rated 
low given that 16 bolts are rated high? 

(c) Are the numbers of bolts rated high and low independent 
random variables? 


5-102. To evaluate the technical support from a computer 

manufacturer, the number of rings before a call is answered by 

a service representative is tracked. Historically, 70% of the 

calls are answered in two rings or less, 25% are answered in 

three or four rings, and the remaining calls require five rings 
or more. Suppose you call this manufacturer 10 times and 
assume that the calls are independent. 

(a) What is the probability that eight calls are answered in two 
rings or less, one call is answered in three or four rings, 
and one call requires five rings or more? 

(b) What is the probability that all 10 calls are answered in 
four rings or less? 

(c) What is the expected number of calls answered in four 
rings or less? 

5-103. Continuation of Exercise 5-102 

(a) What is the conditional distribution of the number of calls 
requiring five rings or more given that eight calls are 
answered in two rings or less? 

(b) What is the conditional expected number of calls requir- 
ing five rings or more given that eight calls are answered 
in two rings or less? 


(c) Are the number of calls answered in two rings or less and 
the number of calls requiring five rings or more independ- 
ent random variables? 


5-104. Determine the value of c such that the function 
f(x, y) = ex’y for 0 <x <3 and 0 < y < 2 satisfies the 
properties of a joint probability density function. 


5-105. 

following: 

(a) P(X « 1, Y « I) 

(Jury 25)] (0 Pye 2.1 a Fe 15 

(e) E(X) (f) E(Y) 

5-106. Continuation of Exercise 5-104. 

(a) Determine the marginal probability distribution of the 
random variable X. 

(b) Determine the conditional probability distribution of Y 
given that X — 1. 

(c) Determine the conditional probability distribution of X 
given that Y — 1. 


5-107. The joint distribution of the continuous random 

variables X, Y, and Z is constant over the region x^ + y? < 1, 

0 « z « 4. 

(a) Determine P(X? + Y^ < 0.5) 

(b) Determine P(X? + Y? < 0.5,Z « 2) 

(c) What is the joint conditional probability density function 
of X and Y given that Z — 1? 

(d) What is the marginal probability density function of X? 


5-108. Continuation of Exercise 5-107. 

(a) Determine the conditional mean of Z given that X = 0 and 
Y — 0. 

(b) In general, determine the conditional mean of Z given that 
X — x and Y= y. 

5-109. Suppose that X and Y are independent, continuous 

uniform random variables for 0 < x < 1 and 0 < y < 1. Use 

the joint probability density function to determine the proba- 

bility that | X — Y| < 0.5. 

5-110. The lifetimes of six major components in a copier are 

independent exponential random variables with means of 8000, 

10,000, 10,000, 20,000, 20,000, and 25,000 hours, respectively. 

(a) What is the probability that the lifetimes of all the compo- 
nents exceed 5000 hours? 

(b) What is the probability that at least one component life- 
time exceeds 25,000 hours? 


5-111.  Contamination problems in semiconductor manu- 

facturing can result in a functional defect, a minor defect, or 

no defect in the final product. Suppose that 20, 50, and 30% of 

the contamination problems result in functional, minor, and no 

defects, respectively. Assume that the effects of 10 contamina- 

tion problems are independent. 

(a) What is the probability that the 10 contamination problems 
result in two functional defects and five minor defects? 

(b) What is the distribution of the number of contamination 
problems that result in no defects? 


Continuation of Exercise 5-104. Determine the 


(b) P(X < 2.5) 
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(c) What is the expected number of contamination problems 
that result in no defects? 


5-112. The weight of adobe bricks for construction is 

normally distributed with a mean of 3 pounds and a standard 

deviation of 0.25 pound. Assume that the weights of the bricks 

are independent and that a random sample of 25 bricks is 

chosen. 

(a) What is the probability that the mean weight of the sample 
is less than 2.95 pounds? 

(b) What value will the mean weight exceed with probability 
0.99? 


5-113.  Thelength and width of panels used for interior doors 

(in inches) are denoted as X and Y, respectively. Suppose that X 

and Y are independent, continuous uniform random variables for 

17.75 < x < 18.25 and 4.75 < y < 5.25, respectively. 

(a) By integrating the joint probability density function over 
the appropriate region, determine the probability that the 
area of a panel exceeds 90 squared inches. 

(b) What is the probability that the perimeter of a panel 
exceeds 46 inches? 


5-114. The weight of a small candy is normally distributed 

with a mean of 0.1 ounce and a standard deviation of 0.01 

ounce. Suppose that 16 candies are placed in a package and 

that the weights are independent. 

(a) What are the mean and variance of package net weight? 

(b) What is the probability that the net weight of a package is 
less than 1.6 ounces? 

(c) If 17 candies are placed in each package, what is the 
probability that the net weight of a package is less than 
1.6 ounces? 


5-115. The time for an automated system in a warehouse to 

locate a part is normally distributed with a mean of 45 seconds 

and a standard deviation of 30 seconds. Suppose that inde- 

pendent requests are made for 10 parts. 

(a) What 1s the probability that the average time to locate 10 
parts exceeds 60 seconds? 

(b) What is the probability that the total time to locate 10 
parts exceeds 600 seconds? 


5-116. A mechanical assembly used in an automobile en- 
gine contains four major components. The weights of the 
components are independent and normally distributed with 
the following means and standard deviations (in ounces): 


Standard 
Component Mean Deviation 
Left case 4 0.4 
Right case 5.5 0.5 
Bearing assembly 10 0.2 
Bolt assembly 8 Ds 


(a) What 1s the probability that the weight of an assembly 
exceeds 29.5 ounces? 

(b) What is the probability that the mean weight of eight 
independent assemblies exceeds 29 ounces? 


5-117. Suppose X and Y have a bivariate normal distribution 
with oy = 4,0y = l, Ly = 4, |y = 4,andp — —0.2.Draw 
a rough contour plot of the joint probability density function. 


1 =] 
5-118. If fyy(x, y) = Ton oP lo [(x — 1) 


- L6(x - y -2 + (y — 2p} 
determine E(X), E(Y), V(X), V(Y), and p by recorganizing the 
parameters in the joint probability density function. 

5-119. The permeability of a membrane used as a moisture 

barrier in a biological application depends on the thickness of 

two integrated layers. The layers are normally distributed with 

means of 0.5 and 1 millimeters, respectively. The standard 

deviations of layer thickness are 0.1 and 0.2 millimeters, 

respectively. The correlation between layers is 0.7. 

(a) Determine the mean and variance of the total thickness of 
the two layers. 

(b) What is the probability that the total thickness 1s less than 
] millimeter? 

(c) Let X, and X, denote the thickness of layers 1 and 2, re- 
spectively. A measure of performance of the membrane is 
a function 2X, + 3X, of the thickness. Determine the 
mean and variance of this performance measure. 


5-120. The permeability of a membrane used as a moisture 
barrier in a biological application depends on the thickness of 
three integrated layers. Layers 1, 2, and 3 are normally dis- 
tributed with means of 0.5, 1, and 1.5 millimeters, respec- 
tively. The standard deviations of layer thickness are 0.1, 0.2, 
and 0.3, respectively. Also, the correlation between layers 1 
and 2 is 0.7, between layers 2 and 3 is 0.5, and between layers 
] and 3 1s 0.3. 
(a) Determine the mean and variance of the total thickness of 
the three layers. 
(b) What is the probability that the total thickness is less than 
1.5 millimeters? 


5-121. A small company is to decide what investments to 
use for cash generated from operations. Each investment has a 
mean and standard deviation associated with the percentage 
gain. The first security has a mean percentage gain of 5% with 
a standard deviation of 2%, and the second security provides 
the same mean of 5% with a standard deviation of 4%. The 
securities have a correlation of —0.5, so there is a negative 
correlation between the percentage returns. If the company 
invests two million dollars with half in each security, what is 
the mean and standard deviation of the percentage return? 
Compare the standard deviation of this strategy to one that 
invests the two million dollars into the first security only. 
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MIND-EXPANDING EXERCISES 


5-122. Show that if X;, X5,..., X, are independent, 5-124. Suppose that the joint probability function of 
continuous random variables, P(X, € A,, X% € A»,..., the continuous random variables X and Y is constant on 
X,€ A) = P(X € A)PQG € Aa)... P(X, € A,) forany therectangle 0 < x « a, 0 « y « b. Show that X and Y 


regions A,, 45, ..., A, in the range of Xj, 35, ..., X, are independent. 


respectively. 5-125. Suppose that the range of the continuous 
5-123. Show that if Xj, X,,..., X, are independent variables X and Y is 0 < x < a and 0 < y < b. Also 
random variables and Y = c,Xj + c,X) + ++: + cX,, suppose that the joint probability density function 
fxy Go y) ^ g(x)A(y), where g(x) is a function only of 

Dh 2 "m 2 
WY) = eiV(X) + e3V(X5) + + eX) x and h(y) is a function only of y. Show that X and Y 


You can assume that the random variables are continuous. — 21€ independent. 


IMPORTANT TERMS AND CONCEPTS 


In the E-book, click on any Correlation Multinomial Moment generating 
term or concept below to ^ Covariance distribution function 
go to that subject. Independence Reproductive property Uniqueness property of 
Bivariate normal Joint probability density of the normal distri- moment generating 
distribution function bution function 
Conditional mean Joint probability mass Chebyshev's inequality 
Conditional probability function CU MATERIAL 
density function Linear combinations of Convolution 
Conditional probability random variables Functions of random 
mass function Marginal probability variables 
Conditional variance distribution J acobian of a transfor- 
mation 


Contour plots 


5-8 FUNCTIONS OF RANDOM VARIABLES (CD ONLY) 


EXAMPLE S5-1 


In many situations in statistics, 1t is necessary to derive the probability distribution of a func- 
tion of one or more random variables. In this section, we present some results that are helpful 
in solving this problem. 

Suppose that X is a discrete random variable with probability distribution f(x). Let Y = A(X) 
be a function of X that defines a one-to-one transformation between the values of X and Y, and we 
wish to find the probability distribution of Y. By a one-to-one transformation, we mean that each 
value x is related to one and only one value of y = A(x) and that each value of y is related to one 
and only one value of x, say, x = u(y), where u(y) is found by solving y = h(x) for x in terms of y. 

Now, the random variable Y takes on the value y when X takes on the value u(y). 
Therefore, the probability distribution of Y is 


fry) = PY = y) = PIX = u(y)] = fxlu(y)] 


We may state this result as follows. 


Suppose that X is a discrete random variable with probability distribution f(x). Let 
Y = h(X) define a one-to-one transformation between the values of X and Y so that 
the equation y = h(x) can be solved uniquely for x in terms of y. Let this solution be 
x = u(y). Then the probability distribution of the random variable Y is 


Fry) = fxlu(y)] (S5-1) 





Let X be a geometric random variable with probability distribution 


fx) = p — p), x12. 


Find the probability distribution of Y = X°. 
Since X = 0, the transformation is one to one; that is, y = x^ and x = Vy. Therefore, 
Equation S5-1 indicates that the distribution of the random variable Y is 


frly) - Ky) = p(1— p),  y—14,9,16,... 


Now suppose that we have two discrete random variables X, and X, with joint probability 
distribution fy y.(x1, x5) and we wish to find the joint probability distribution fy, y, (1, 2) of 
two new random variables Y, = A,(X;, X) and Y, = A(X., X;). We assume that the functions 
h, and h, define a one-to-one transformation between (x,, x2) and (y1, y;). Solving the equa- 
tions y, = h(x, xj) and y, = h,(x,, x;) simultaneously, we obtain the unique solution 
X, = u(y y;) and x, = wj(y;, yz). Therefore, the random variables Y, and Y, take on the 
values y, and y, when X, takes on the value u,( y,, y2) and X, takes the value u(y, y2). The 
joint probability distribution of Y, and Y, is 


fry (y y2) = PY = yy, Yo = yj) 
= PLX, = u(y, y2), Xo = uN, yj] 
= flx u(y, y2), u( Y1, y2)] 
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5-2 


EXAMPLE $5-2 


We will also state this result as follows. 


Suppose that X, and X, are discrete random variables with joint probability distribu- 
tion fy. x (x1, X2), and let Y, = Ai(X;, X2) and Y; = A;(X;, X2) define one-to-one trans- 
formations between the points (x,, x2) and (y,, y2) so that the equations y, = h,(x,, x;) 
and y, = h(x), x5) can be solved uniquely for x, and x, in terms of y, and y». Let this 
solution be x, = u(y, y;) and x; = w(y,, y2). Then the joint probability distribution 
of Y, and Y, is 


fry. ¥2) = fx, xui Ya), u( Y1, Y2) | (S32) 





A very important application of Equation S5-2 is in finding the distribution of a random vari- 
able Y, that is a function of two other random variables X, and X. That is, let Y, = A(X, X) 
where X, and X; are discrete random variables with joint distribution fy, y (x1, x2). We want to 
find the probability distribution of Yj, say, fy. (y;). To do this, we define a second function Y; = 
h,(X,, X) so that the one-to-one correspondence between the points (x4, x2) and (y;, y2) 1s main- 
tained, and we use the result in Equation S5-2 to find the joint probability distribution of Y, and 
Y». Then the distribution of Y, alone is found by summing over the y, values in this joint distribu- 
tion. That is, fy (yı) is just the marginal probability distribution of Y;, or 


fv) = X vy y2) 
y2 


Consider the case where X, and X, are independent Poisson random variables with parameters 
^, and M, respectively. We will find the distribution of the random variable Y, = X, + X. 
The joint distribution of X, and X, 1s 


Fy xx p X2) = fx (x ) fx (x2) 
g un e 9A» 


X x»! 








7 g rop E E 
~ a * x, = 0,1,..., xX, = 0,1,... 
Iesv 


because X, and X, are independent. Now to use Equation S5-2 we need to define a second func- 
tion Y, = /(X;, X). Let this function be Y, =_X,. Now the solutions for x, and x, are x, = y, — y; 
and x, = y». Thus, from Equation S5-2 the joint probability distribution of Y, and Y, is 


e Q2, n -Y2AE: 


fani y) = o P lA Dx OU ees d 


(yi — ya)! yo! 


Because x, = 0, the transformation x, = y, — y; requires that x, = y, must always be less than 
or equal to y,. Thus, the values of y; are 0, 1,..., y,, and the marginal probability distribution 
of Y, 1s obtained as follows: 


yı e ^ *À3)A {1 OA» 


yı 
fy) = 2 fae») H 2. (yi — yo)! ys! 


1 = 
yx 0 (1 — 2)! 2! Vit e 








g 0939 às y APTA etta) n fy (Vi-Y2)y y» 
PEAL Ar » Àj À? 


The summation in this last expression is the binomial expansion of (A, + A5)", so 


e Ait Ad) | oe A>)" 


fv») = y! 


à yı = 0,1,... 


We recognize this as a Poisson distribution with parameter N; + ^2. Therefore, we have shown 
that the sum of two independent Poisson random variables with parameters A, and ^, has a 
Poisson distribution with parameter N; + A». 


We now consider the situation where the random variables are continuous. Let Y — A(X), 
with X continuous and the transformation is one to one. 


Suppose that X is a continuous random variable with probability distribution f(x). 
The function Y = A(X) is a one-to-one transformation between the values of Y and X 
so that the equation y = h(x) can be uniquely solved for x in terms of y. Let this 
solution be x = u(y). The probability distribution of Y is 


fv») = fx[u] || (S5-3) 


where J = u'(y) is called the Jacobian of the transformation and the absolute value 
of J 1s used. 





Equation S5-3 is shown as follows. Let the function y = A(x) be an increasing function of x. 
Now 


If we change the variable of integration from x to y by using x = u(y), we obtain dx = u(y) dy 
and then 


a 


py a)- | AWO à 


— 00 


Since the integral gives the probability that Y = a for all values of a contained in the feasible 
set of values for y, fy| u(y) ju'( y) must be the probability density of Y. Therefore, the proba- 
bility distribution of Y is 


fry) = fluCy) lu) = flu(»))J 


If the function y = A(x) is a decreasing function of x, a similar argument holds. 


5.4 


EXAMPLE $5-3 


EXAMPLE $5-4 


Let X be a continuous random variable with probability distribution 
x 
Ka) = g QO=x<4 


Find the probability distribution of Y = A(X) = 2X + 4. 

Note that y = h(x) = 2x + 4 is an increasing function of x. The inverse solution is x = 
u(y) = (y — 4)/2, and from this we find the Jacobian to be J = u'(y) = dx/dy = 1/2. 
Therefore, from S5-3 the probability distribution of Y is 


— 4/2 (1 — 4 
nue E B G- ; 4sys 12 


We now consider the case where X, and X, are continuous random variables and we wish 
to find the joint probability distribution of Y, = h,(X,, X;) and Y; = /; (X, X2) where the trans- 
formation is one to one. The application of this will typically be in finding the probability dis- 
tribution of Y, = A,(X;, X), analogous to the discrete case discussed above. We will need the 
following result. 


Suppose that X; and X, are continuous random variables with joint probability distri- 
bution fy x.(x;, x2), and let Y; = A,(X1, X5) and Y; = /;(X;, X2) define a one-to-one 
transformation between the points (x,, x2) and (y,, y2). Let the equations y, = h,(x,, 
X5) and y; = /5(x;, x;) be uniquely solved for x, and x, in terms of y, and y; as x, = 
U1y,, y?) and x; = wy, y2). Then the joint probability of Y, and Y, is 


fry y = fzx yis y) (Y1; y2)] | J (S5-4) 


where J is the Jacobian and is given by the following determinant: 


0x /dy1, 0x1/9y» 
0x5/ yi, OX2/dV> 


and the absolute value of the determinant 1s used. 





This result can be used to find fy y,( y, y), the joint probability distribution of Y, and Y;. Then 
the probability distribution of Y, 1s 


fy n) = | fyn Qn 2) dy» 
That is, fy (yı) is the marginal probability distribution of Y;. 


Suppose that X; and X, are independent exponential random variables with fy (xı) = Doo 
and fy. (x2) = 2e ^. Find the probability distribution of Y = X1/X». 


5-5 


The joint probability distribution of X, and X, is 
fx x (xi x2) — 4e 201 *x) Xi > 0, Xo > 0 
because X, and X, are independent. Let Y, = A,(X;, X5) = Xj/X; and Y, = hj(X,, X) = 
X, + X. The inverse solutions of y, = x;/x; and y; = x, + x; are x = y1 y2/(1 + yı) and 
X? = yo/(1 + yi), and it follows that 


salah et 
Oy is +y? dy» (1 + yj) 
0X» 


Therefore 


32. O N 
Je dtar (=y) O> h 
E —d | aa 


(tx (+y) 
and from Equation 85-4 the joint probability distribution of Y, and Y, is 


frn i y») = fex [Mv y2) (yy »2)] |J] 


= 4e 2D ty)ty( ty) — — 
(1 + yi) 








= 4e ?! yo/(1 + yj) 


for y, > 0, y, > 0. We need to find the distribution of Y, = X,/X2. This is the marginal prob- 
ability distribution of Y,, or 


fy (v1) = | fry o y) dy» 
0 


= | 4e ?* [y)/(1 + y)*] dy; 
0 


1 
a >0 
(1c y) A 


An important application of Equation S5-4 is to obtain the distribution of the sum of two 
independent random variables X, and X. Let Y, = X, + X; and let Y; = X. The inverse so- 
lutions are x, = y, — y; and x; = y». Therefore, 


OX OX 

= = | ——— = oce] 
Oy, dy» 

Ox Ox 

2 ee 0 2 1 


dy) dy» 


and |J] = 1. From Equation S5-4, the joint probability density function of Y, and Y, is 


fy y Qi y2) = fx — y3)f x. y2) 


Therefore, the marginal probability density function of Y, 1s 


fv) E | fos aa V2) Fx,(¥2) dy» 


The notation is simpler if the variable of integration y, is replaced with x and y, is replaced 
with y. Then the following result is obtained. 


Convolution 
of X, and X, If X, and X; are independent random variables with probability density functions 
fx, (xı) and fy (x2), respectively, the probability density function of Y = X, + X; is 


ipo | 12.055: On (S5-5) 


— oo 





The probability density function of Y in Equation S5-5 is referred to as the convolution of the 
probability density functions for X, and_X,. This concept is commonly used for transforma- 
tions (such as Fourier transformations) in mathematics. This integral may be evaluated nu- 
merically to obtain the probability density function of Y, even for complex probability density 
functions for X, and X;. A similar result can be obtained for discrete random variables with the 
integral replaced with a sum. 

In some problems involving transformations, we need to find the probability distribution 
of the random variable Y = h(X) when X is a continuous random variable, but the transforma- 
tion is not one to one. The following result is helpful. 


Suppose that X is a continuous random variable with probability distribution f(x), 
and Y = A(X)isa transformation that is not one to one. If the interval over which X 
is defined can be partitioned into m mutually exclusive disjoint sets such that each of 
the inverse functions x, = u(y), x; = w(y),..., x, = u,(y) of y = u(x) is one to 
one, the probability distribution of Y 1s 


LoVe > fluo) e (85-6) 


where J; = u(y), i = 1, 2, ..., m and the absolute values are used. 





To illustrate how this equation is used, suppose that X is a normal random variable with 
mean p and variance o°, and we wish to show that the distribution of Y = (X — w)’/o* isa 
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chi-squared distribution with one degree of freedom. Let Z = (X — w)/o, and Y = Z°. The 
probability distribution of Z is the standard normal; that is, 


-zn =0 <7 = ig 


The inverse solutions of y = z? are z = + Vy, so the transformation is not one to one. Define 


zi = — Vy and z = +Vyso that J; 


= —(1/2)/Vy. and Jy = (1/2)/Vy. Then by Equation 


S5-6, the probability distribution of Y is 











1 =| 1 1 
fr) V 2T 2 Vy NAT t 2 Vy 
l 1 
= /2-1 ,—-y/2 
Aya? £ > 70 
Now it can be shown that Vn. = F (1/2), so we may write f(y) as 
l 1/2-1 ,-y/2 
po. y= 0 


fry) = nar (2) 


which is the chi-squared distribution with 1 degree of freedom. 


EXERCISES FOR SECTION 5-8 


S5-1. Suppose that X is a random variable with probability 
distribution 

fx(x) = 1/4, x = 1,2,3,4 
Find the probability distribution of the random Y = 2X + 1. 
S5-2. Let X be a binomial random variable with p = 0.25 
and n = 3. Find the probability distribution of the random 
variable Y = X°’. 


S5-3. Suppose that X is a continuous random variable with 
probability distribution 


(a) Find the probability distribution of the random variable 
Y — 2X + 10. 
(b) Find the expected value of Y. 


85-4. Suppose that X has a uniform probability distribution 


Show that the probability distribution of the random variable 
Y — —2 In X is chi-squared with two degrees of freedom. 


85-5. A current of / amperes flows through a resistance of R 
ohms according to the probability distribution 


Suppose that the resistance is also a random variable with 
probability distribution 


0zrzl 


Assume that / and R are independent. 

(a) Find the probability distribution for the power (in watts) 
P= PR. 

(b) Find E(P). 

S5-6. A random variable X has the following probability 

distribution: 


hka) =e, +20 

(a) Find the probability distribution for Y = X?. 
(b) Find the probability distribution for Y = X^. 
(c) Find the probability distribution for Y — In X. 


5-8 


S5-7. The velocity of a particle in a gas is a random variable 
V with probability distribution 


fy(v) = ave ™ v>0 

where b is a constant that depends on the temperature of the 

gas and the mass of the particle. 

(a) Find the value of the constant a. 

(b) The kinetic energy of the particle is W = mV?/2. Find the 
probability distribution of W. 


S5-8. Suppose that X has the probability distribution 


Find the probability distribution of the random variable Y = e*. 


S5-9. Prove that Equation S5-3 holds when y = A(x) is a 
decreasing function of x. 


S5-10. The random variable X has the probability distribution 


Q=x=4 


x(x) = z, 


Find the probability distribution of Y = (X — 2%. 
S5-11. Consider a rectangle with sides of length S, and S», 
where S, and S, are independent random variables. The prob- 


ability distributions of S, and S, are 


Ss, (s1) = 251, 


0s5,51 


and 


Js, (52) = : 


0zxs,-4A 


(a) Find the joint distribution of the area of the rectangle 4 = 
S, S and the random variable Y = S. 

(b) Find the probability distribution of the area A of the rec- 
tangle. 


85-12. Suppose we have a simple electrical circuit in 
which Ohm's law V — IR holds. We are interested in the 
probability distribution of the resistance R given that V and 
I are independent random variables with the following dis- 
tributions: 


and 


Find the probability distribution of R. 


5-9 MOMENT GENERATING FUNCTIONS (CD ONLY) 


Suppose that X is a random variable with mean u. Throughout this book we have used the idea of 
the expected value of the random variable X, and in fact E(X) = u. Now suppose that we are in- 
terested in the expected value of a particular function of X, say, g(X) = X”. The expected value of 
this function, or £[g(X)] = E(X”), is called the rth moment about the origin of the random variable 


X, which we will denote by w,.. 


Definition 


The rth moment about the origin of the random variable X is 


p, = E(X") = 


X discrete 


>, Vf); 


| x'f(x)dx, | X continuous 


— oo 





Notice that the first moment about the origin is just the mean, that is, E(X) = p = p. 
Furthermore, since the second moment about the origin is E(X*) = ww, we can write the vari- 
ance of a random variable in terms of origin moments as follows: 


o^ = EX?) - [EQOY = po — w^ 
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The moments of a random variable can often be determined directly from the definition 
in Equation S5-7, but there is an alternative procedure that is frequently useful that makes use 
of a special function. 


Definition 


The moment generating function of the random variable X 1s the expected value of 
e'* and is denoted by M, (f). That is, 


3 9 WO X discrete 


My(f) = E(e?) = (S5-8) 


| e f(x) dx, | X continuous 


—oo 





The moment generating function M,(f) will exist only if the sum or integral in the above def- 
inition converges. If the moment generating function of a random variable does exist, it can be 
used to obtain all the origin moments of the random variable. 


Let X be a random variable with moment generating function M,(t). Then 


Z d” My(t) 





Assuming that we can differentiate inside the summation and integral signs, 


te Jo. X discrete 
rmo (EP 


dt" 


| x'e"f(x)dx, | X continuous 


— OO 


Now if we set ¢ = 0 in this expression, we find that 


d'My(t) 
dt" 








t=0 A 


EXAMPLE S5-5 Suppose that X has a binomial distribution, that is 


fix) = (") pra —py'*, x-20,L...,n 


Determine the moment generating function and use it to verify that the mean and variance of 
the binomial random variable are p. = np and o? = np(1 — p). 
From the definition of a moment generating function, we have 


Mx(t) = 5 d (") p-p "= X (") (pet — py"? 


x=0 


This last summation is the binomial expansion of [pæ + (1 — p)]", so 


My(t) = [pe + (1 — p)" 


Taking the first and second derivatives, we obtain 





dM y(t 
Mxy(t) = B = npe'|l + p(e' — 1)|"! 
and 
d^My(t 
My(t) = m = npe'(1 — p + npe|l + p(e — 1)" 


If we set £ = 0 in My(t), we obtain 


Mx(t);zo = bt = p = np 


which is the mean of the binomial random variable X. Now if we set t = 0 in My(t), 


Mx(t) |,-o = m2 = np(1 — p + np) 
Therefore, the variance of the binomial random variable is 


o? = ph — y? = np(1 — p + np) — (np? = np — np? = np(1 — p) 


EXAMPLE S5-6 Find the moment generating function of the normal random variable and use it to show that 
the mean and variance of this random variable are u and o°, respectively. 
The moment generating function 1s 


a l g e-u) gy 


e o V2m. i 





My(t) = 


e Xt tox + pw") /(20°) gy 





o V2m. 


{ps jas 


If we complete the square in the exponent, we have 
x^ — 2(p. + to*)x + p = [x — (u + to?) — 2uto^ — Po" 
and then 


— {Lx — (p+ t0°)P — 2040? — Fo*y/207) Jy 





l 
= tra = g Ape] 
e | o Vm e X 


— oO 


Properties of 
Moment 
Generating 
Functions 


Let u = [x — (uw + to?)]/o. Then dx = o du, and this last expression above becomes 


My) =e er | ug ends 
T 


Now the integral is just the total area under a standard normal density, which is 1, so the mo- 
ment generating function of a normal random variable is 


M,(t) = gto /2 


Differentiating this function twice with respect to ¢ and setting t = 0 in the result, we find 





Therefore, the variance of the normal random variable is 


c^-Wu-wW-octwW-yw-ac 


2 

Moment generating functions have many important and useful properties. One of the 
most important of these 1s the uniqueness property. That is, the moment generating function 
of a random variable is unique when it exists, so if we have two random variables X and Y, say, 
with moment generating functions M(t) and My(t), then if My(t) = My(t) for all values of t, 
both X and Y have the same probability distribution. Some of the other useful properties of the 
moment generating function are summarized as follows. 


If X 1s a random variable and a 1s a constant, then 


O TOE 
(2) Max(t) = My(at) 


If Xi, X>,..., X, are independent random variables with moment generating functions 
My (t), Mx (t), ..., My (t), respectively, and if Y = X, + X, + --- + X, then the mo- 
ment generating function of Y is 


(3) My(t) = My(t) + My(t) * ^ * My (2) (S5-10) 


n 





Property (1) follows from My,,(t) = E[e"*9] = e“E(e%) = e“M,(t). Property (2) 
follows from M,y(r) = E[e'^9] = E[e^*] = M,(at). Consider property (3) for the case 
where the X’s are continuous random variables: 


My/(t) - Efe”) = E[e XX] 


= | | u | gent) f X2,- , Xn) LO aia OX, 


EXAMPLE $5-7 


Since the X's are independent, 


J ilaa) S fx) * fx 2) did o) 
and one may write 
My(t) — | e" fy (x1) dxi | e fy (x2) dxa °°" | e"" fy (Xp) dx, 


= My (t) + Mx(t) +--+ * Mx) 


For the case when the X’s are discrete, we would use the same approach replacing integrals 
with summations. 

Equation S5-10 is particularly useful. In many situations we need to find the distribution 
of the sum of two or more independent random variables, and often this result makes the prob- 
lem very easy. This is illustrated in the following example. 


Suppose that X, and X, are two independent Poisson random variables with parameters A, and 
A>, respectively. Find the probability distribution of Y = X, +X. 
The moment generating function of a Poisson random variable with parameter A is 


My(t) = e 79 


so the moment generating functions of X; and X, are My (t) = g(€ 77 and My(t) = edie 1) 
respectively. Using Equation S5-10, we find that the moment generating function of 
Y= X T X> 1S 


My(t) = My (t)My(t) = e Dgse7 = genre n 


which is recognized as the moment generating function of a Poisson random variable with pa- 
rameter A, + A,. Therefore, we have shown that the sum of two independent Poisson random 
variables with parameters A, and A, is a Poisson random variable with parameters equal to the 
sum of the two parameters A, + A). 


EXERCISES FOR SECTION 5-9 


S5-13. A random variable X has the discrete uniform distri- 


bution 


(b) Use M(t) to find the mean and variance of X. 
85-14. A random variable X has the Poisson distribution 


—rA\x 
jo 


x =0,1,... 
x! 


(a) Show that the moment generating function is 


M= enn 


(b) Use My(t) to find the mean and variance of the Poisson 
random variable. 


S5-15. The geometric random variable X has probability 
distribution 


Jo eese 09 


(a) Show that the moment generating function is 


ae 
|.1l- (1- pie! 





M(t) 


(b) Use M,(t) to find the mean and variance of X. 


85-16. The chi-squared random variable with k degrees of 
freedom has moment generating function M,(f) = (1 — 2f) "?. 
Suppose that X, and X, are independent chi-squared random 
variables with k, and k, degrees of freedom, respectively. 
What is the distribution of Y = X, + Xj? 


S5-17. A continuous random variable X has the following 
probability distribution: 


Jo) e Axe ~, x90 

(a) Find the moment generating function for X. 

(b) Find the mean and variance of X. 

85-18. The continuous uniform random variable X has den- 
sity function 





(a) Show that the moment generating function is 


el? _ el 


My(t) = (B — a) 


(b) Use My(®) to find the mean and variance of X. 
85-19. A random variable X has the exponential distribution 
x0 


Je Ag t 


(a) Show that the moment generating function of X is 


ues (i E ty" 


(b) Find the mean and variance of X. 
85-20. Arandom variable X has the gamma distribution 


5-13 


/G) = ges Aa tes 


x20 


(a) Show that the moment generating function of X is 


M,(t) = (i » 3E 


(b) Find the mean and variance of X. 


S5-21. Let X;,X,,..., X, be independent exponential ran- 

dom variables with parameter A. 

(a) Find the moment generating function of Y = X, + 
Ag dem eX. 

(b) What is the distribution of the random variable Y? 

| Hint: Use the results of Exercise S5-20]. 


S5-22. Suppose that X; has a normal distribution with mean 
p, and variance o^, i = 1,2. Let X, and X, be independent. 
(a) Find the moment generating function of Y = X, +X. 

(b) What is the distribution of the random variable Y? 


85-23. Show that the moment generating function of the 
chi-squared random variable with k degrees of freedom is 
M,(t) = (1 — 21) ^?. Show that the mean and variance of this 
random variable are k and 2k, respectively. 


85-24. Continuation of Exercise S5-20. 

(a) Show that by expanding e“ in a power series and taking 
expectations term by term we may write the moment gen- 
erating function as 


My(t) = E(e“) 
/ / t? 
=] te Pgh Bos ere 


r 


yt 
ES uut TT 
r. 


Thus, the coefficient of ¢’/r! in this expansion is w’, the rth 

origin moment. 

(b) Continuation of Exercise S5-20. Write the power series 
expansion for Mx(t), the gamma random variable. 

(c) Continuation of Exercise S5-20. Find p. and p3 using the 
results of parts (a) and (b). Does this approach give the 
same answers that you found for the mean and variance of 
the gamma random variable in Exercise S5-20? 


5-10 CHEBYSHEV'S INEQUALITY (CD ONLY) 


In Chapter 3 we showed that if X is a normal random variable with mean w and standard 
deviation o, P(u — 1.960 « X < u + 1.960) = 0.95. This result relates the probability of a 
normal random variable to the magnitude of the standard deviation. An interesting, similar re- 
sult that applies to any discrete or continuous random variable was developed by the mathe- 


matician Chebyshev in 1867. 


5.14 


Chebyshev's 
Inequality 


EXAMPLE $5-8 


For any random variable X with mean w and variance o?, 


Fx —w| 2 em) = Ie 


for c » 0. 





This result 1s interpreted as follows. The probability that a random variable differs from its 
mean by at least c standard deviations is less than or equal to 1/c*. Note that the rule is useful 
only for c > 1. 

For example, using c = 2 implies that the probability that any random variable differs 
from its mean by at least two standard deviations is no greater than 1/4. We know that for a 
normal random variable, this probability is less than 0.05. Also, using c — 3 implies that the 
probability that any random variable differs from its mean by at least three standard deviations 
is no greater than 1/9. Chebyshev's inequality provides a relationship between the standard 
deviation and the dispersion of the probability distribution of any random variable. The proof 
is left as an exercise. 

Table S5-1 compares probabilities computed by Chebyshev's rule to probabilities com- 
puted for a normal random variable. 


The process of drilling holes in printed circuit boards produces diameters with a standard 
deviation of 0.01 millimeter. How many diameters must be measured so that the probability is 
at least 8/9 that the average of the measured diameters is within 0.005 of the process mean 
diameter p? 

Let Xj, X, . . . , X, be the random variables that denote the diameters of n holes. The aver- 
age measured diameter is X = (X, + X; + + X,)/n. Assume that the X’s are independent 
random variables. From Equation 5-40, E(X) = p. and V(X) = 0.01?/n. Consequently, the 
standard deviation of X is (0.017/1)?. By applying Chebyshev's inequality to X, 


P(X — u| = c(0.017/n)'^) S 1/c* 
Let c = 3. Then, 

P(X — u| = 3(0.017/n) 7) = 1/9 
Therefore, 


P(| X — u| < 3(0.017/n) 7) = 8/9 


Table S5-1 Percentage of Distribution Greater than c Standard 
Deviations from the Mean 


Chebyshev's Rule Normal 
e for any Probability Distribution Distribution 
1:5 less than 44.4% 13.4% 
2 less than 25.0% 4.6% 
3 less than 11.1% 0.27% 
4 less than 6.3% 0.01% 


5-15 


Thus, the probability that X is within 3(0.017/n)'”” of u is at least 8/9. Finally, n is chosen 
such that 3(0.01?/n)'? = 0.005. That is, 


n= 


3^10.017/0.005*^] = 36 


EXERCISES FOR SECTION 5-10 


S5-25. The photoresist thickness in semiconductor manu- 
facturing has a mean of 10 micrometers and a standard devia- 
tion of 1 micrometer. Bound the probability that the thickness 
is less than 6 or greater than 14 micrometers. 


S5-26. Suppose X has a continuous uniform distribution 
with range 0 « x « 10. Use Chebyshev's rule to bound the 
probability that X differs from its mean by more than two stan- 
dard deviations and compare to the actual probability. 


85-27. Suppose X has an exponential distribution with 
mean 20. Use Chebyshev's rule to bound the probability that 
X differs from its mean by more than two standard deviations 
and by more than three standard deviations and compare to the 
actual probabilities. 


85-28. Suppose X has a Poisson distribution with mean À = 4. 
Use Chebyshev’s rule to bound the probability that X differs from 
its mean by more than two standard deviations and by more than 
three standard deviations and compare to the actual probabilities. 
85-29. Consider the process of drilling holes in printed cir- 
cuits boards. Assume that the standard deviation of the diame- 
ters is 0.01 and that the diameters are independent. Suppose 


that the average of 500 diameters is used to estimate the 

process mean. 

(a) The probability is at least 15/16 that the measured aver- 
age is within some bound of the process mean. What 1s the 
bound? 

(b) If it 1s assumed that the diameters are normally distrib- 
uted, determine the bound such that the probability is 
15/16 that the measured average is closer to the process 
mean than the bound. 


S5-30. Prove Chebyshev's rule from the following steps. 
Define the random variable Y as follows: 


t 
Y 
0 


(a) Determine E(Y) 

(b) Show that (X — wp) = (X — py Y= c^o?Y 

(c) Using part (b), show that E| (X — i^] = co? E[Y] 

(d) Using part (c), complete the derivation of Chebyshev's 
inequality. 


if |X — p| z co 
otherwise 


Random Sampling and 


Data Description 


CHAPTER OUTLINE 


6-1 DATA SUMMARY AND DISPLAY 
62 RANDOM SAMPLING 
6-3 STEM-AND-LEAF DIAGRAMS 


6-4 FREQUENCY DISTRIBUTIONS 
AND HISTOGRAMS 


LEARNING OBJECTIVES 


6-5 
6-6 
6-7 
6-8 


BOX PLOTS 
TIME SEQUENCE PLOTS 
PROBABILITY PLOTS 


MORE ABOUT PROBABILITY 
PLOTTING (CD ONLY) 


After careful study of this chapter you should be able to do the following: 


1. Compute and interpret the sample mean, sample variance, sample standard deviation, sample me- 


dian, and sample range 


2. Explain the concepts of sample mean, sample variance, population mean, and population variance 


3. Construct and interpret visual data displays, including the stem-and-leaf display, the histogram, 


and the box plot 


. Explain the concept of random sampling 


NW NW Bb 


oriented data. 


CD MATERIAL 


. Construct and interpret normal probability plots 


. Explain how to use box plots and other data displays to visually compare two or more samples of data 


. Know how to use simple time series plots to visually display the important features of time- 


8. Interpret probability plots for distributions other than normal. 


Answers for most odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e-Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e-Text immediately following the section they accompany. 


189 
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6-1 DATA SUMMARY AND DISPLAY 


Definition 


EXAMPLE 6-1 


Well-constructed data summaries and displays are essential to good statistical thinking, be- 
cause they can focus the engineer on important features of the data or provide insight about 
the type of model that should be used in solving the problem. The computer has become an 
important tool in the presentation and analysis of data. While many statistical techniques re- 
quire only a hand-held calculator, much time and effort may be required by this approach, and 
a computer will perform the tasks much more efficiently. 

Most statistical analysis is done using a prewritten library of statistical programs. The 
user enters the data and then selects the types of analysis and output displays that are of 
interest. Statistical software packages are available for both mainframe machines and 
personal computers. We will present examples of output from Minitab (one of the most 
widely-used PC packages), throughout the book. We will not discuss the hands-on use of 
Minitab for entering and editing data or using commands. This information 1s found in the 
software documentation. 

We often find it useful to describe data features numerically. For example, we can char- 
acterize the location or central tendency in the data by the ordinary arithmetic average or 
mean. Because we almost always think of our data as a sample, we will refer to the arithmetic 
mean as the sample mean. 


If the n observations in a sample are denoted by x1, x», ... , x,, the sample mean is 


Nii Ce 2 i 





Let’s consider the eight observations collected from the prototype engine connectors from 
Chapter 1. The eight observations are x, = 12.6, x; = 12.9, x4 = 13.4, x, = 12.3, x; = 13.6, 
Xs = 13.5, x, = 12.6, and x, = 13.1. The sample mean is 


8 
Xj 
XI d* 35 9e ass pe. B >, 4246 FLI ae TEST 
n 8 8 





=| 
| 


104 
uw. ^ 13.0 pounds 


A physical interpretation of the sample mean as a measure of location is shown in the dot 
diagram of the pull-off force data. See Figure 6-1. Notice that the sample mean x — 13.0 can be 
thought of as a “balance point.” That is, if each observation represents 1 pound of mass placed 
at the point on the x-axis, a fulcrum located at x would exactly balance this system of weights. 


The sample mean is the average value of all the observations in the data set. Usually, 
these data are a sample of observations that have been selected from some larger population 
of observations. Here the population might consist of all the connectors that will be manufac- 
tured and sold to customers. Recall that this type of population is called a conceptual or 


Figure 6-1 The 
sample mean as a 
balance point for a 
system of weights. 


Definition 
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Pull-off force 


hypothetical population, because it does not physically exist. Sometimes there is an actual 
physical population, such as a lot of silicon wafers produced in a semiconductor factory. 

In previous chapters we have introduced the mean of a probability distribution, denoted 
u. If we think of a probability distribution as a model for the population, one way to think 
of the mean is as the average of all the measurements in the population. For a finite popula- 
tion with N measurements, the mean 1s 





(6-2) 


The sample mean, x, is a reasonable estimate of the population mean, p.. Therefore, the engi- 
neer designing the connector using a 3/32-inch wall thickness would conclude, on the basis 
of the data, that an estimate of the mean pull-off force is 13.0 pounds. 

Although the sample mean is useful, it does not convey all of the information about a 
sample of data. The variability or scatter in the data may be described by the sample variance 
or the sample standard deviation. 


If x1, X2, ..., x, isa sample of n observations, the sample variance is 


(6-3) 


The sample standard deviation, s, 1s the positive square root of the sample variance. 


The units of measurements for the sample variance are the square of the original units of 
the variable. Thus, if x is measured in pounds, the units for the sample variance are (pounds). 
The standard deviation has the desirable property of measuring variability in the original units 
of the variable of interest, x. 


How Does the Sample Variance Measure Variability? 

To see how the sample variance measures dispersion or variability, refer to Fig. 6-2, which 
shows the deviations x; — x for the connector pull-off force data. The greater the amount of 
variability in the pull-off force data, the larger in absolute magnitude some of the deviations 
x; — x Will be. Since the deviations x; — x always sum to zero, we must use a measure of vari- 
ability that changes the negative deviations to nonnegative quantities. Squaring the deviations 
is the approach used in the sample variance. Consequently, if s? is small, there is relatively 
little variability in the data, but if s? is large, the variability is relatively large. 
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Figure 6-2 How the 

sample variance meas- 
ures variability through 
the deviations x; — x. DN CER "gr 
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EXAMPLE 6-2 Table 6-1 displays the quantities needed for calculating the sample variance and sample 
standard deviation for the pull-off force data. These data are plotted in Fig. 6-2. The 
numerator of s? is 


8 
(x; — xy = 1.60 
=] 


so the sample variance is 
, 1.60 1.60 


EE LS NEN SUN 2 
guo T j 0.2286 (pounds) 


and the sample standard deviation is 


s = V0.2286 = 0.48 pounds 


Computation of s? 

The computation of s^ requires calculation of x, n subtractions, and n squaring and adding op- 
erations. If the original observations or the deviations x; — x are not integers, the deviations 
x; — x may be tedious to work with, and several decimals may have to be carried to ensure 


Table 6-1 Calculation of Terms for the Sample Variance and Sample 
Standard Deviation 


i X; DEG (x; — x) 
1 12.6 —0.4 0.16 
2 12.9 —0.1 0.01 
3 13.4 0.4 0.16 
4 12.3 —0.7 0.49 
5 13.6 0.6 0.36 
6 13.5 0.5 0.25 
7 12.6 —0.4 0.16 
8 13.1 0.1 0.01 


104.0 0.0 1.60 


EXAMPLE 6-3 
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numerical accuracy. A more efficient computational formula for the sample variance is 
obtained as follows: 


n n 


n n 
> (x; — xy (x? + x° + 2xxj) Na + nx? — 2x Xx 
PEN Ab =l — a MEE... NN 


n—1 n—1 a=] 


~ 


and since x = (1/n) S;_, x; this last equation reduces to 


ae EE (6-4) 


Note that Equation 6-4 requires squaring each individual x,, then squaring the sum of the x;, 
subtracting ( $x;)^/n from > x?, and finally dividing by n — 1. Sometimes this is called the 
shortcut method for calculating s^ (or s). 


We will calculate the sample variance and standard deviation using the shortcut method, 
Equation 6-4. The formula gives 





H 8 1.60 
2 — — Z TT SOS 2 
S n1 7 7 0.2286 (pounds) 


and 
s = V0.2286 = 0.48 pounds 


These results agree exactly with those obtained previously. 


Analogous to the sample variance s^, the variability in the population is defined by the 
population variance (o°). As in earlier chapters, the positive square root of o^, or o, will 
denote the population standard deviation. When the population is finite and consists of N 
values, we may define the population variance as 


d 2 
= (x; — M) (6-5) 


We observed previously that the sample mean could be used as an estimate of the population 
mean. Similarly, the sample variance is an estimate of the population variance. In Chapter 7, 
we will discuss estimation of parameters more formally. 

Note that the divisor for the sample variance is the sample size minus one (n — 1), while 
for the population variance it 1s the population size N. If we knew the true value of the popu- 
lation mean p, we could find the sample variance as the average squared deviation of the sam- 
ple observations about w. In practice, the value of u is almost never known, and so the sum of 


194 CHAPTER 6 RANDOM SAMPLING AND DATA DESCRIPTION 


the squared deviations about the sample average x must be used instead. However, the obser- 
vations x; tend to be closer to their average, x, than to the population mean, u. Therefore, to 
compensate for this we use n — 1 as the divisor rather than n. If we used n as the divisor in the 
sample variance, we would obtain a measure of variability that is, on the average, consistently 
smaller than the true population variance o^. 

Another way to think about this is to consider the sample variance s? as being based on 
n — 1 degrees of freedom. The term degrees of freedom results from the fact that the n devi- 
ations x, — X, X; — X,...,X, — x always sum to zero, and so specifying the values of any 
n — 1 of these quantities automatically determines the remaining one. This was illustrated in 
Table 6-1. Thus, only n — 1 of the n deviations, x; — x, are freely determined. 

In addition to the sample variance and sample standard deviation, the sample range, or 
the difference between the largest and smallest observations, is a useful measure of variabil- 
ity. The sample range is defined as follows. 


Definition 


If the n observations in a sample are denoted by x4, x», ... , Xn, the sample range is 


r — max(x;) — min(x;) 


(6-6) 





For the pull-off force data, the sample range isr = 13.6 — 12.3 = 1.3. Generally, as the vari- 
ability in sample data increases, the sample range increases. 

The sample range is easy to calculate, but it ignores all of the information in the sample 
data between the largest and smallest values. For example, the two samples 1, 3, 5, 8, and 9 
and 1, 5, 5, 5, and 9, both have the same range (r = 8). However, the standard deviation of the 
first sample is sı = 3.35, while the standard deviation of the second sample is s) = 2.83. The 
variability is actually less in the second sample. 

Sometimes, when the sample size is small, say n « 8 or 10, the information loss associ- 
ated with the range is not too serious. For example, the range is used widely in statistical qual- 
ity control where sample sizes of 4 or 5 are fairly common. We will discuss some of these 
applications in Chapter 16. 


EXERCISES FOR SECTIONS 6-1 AND 6-2 


and 6890. Calculate the sample mean and sample standard de- 
viation. Construct a dot diagram of the data. 


6-1. Eight measurements were made on the inside diameter 
of forged piston rings used in an automobile engine. The data 
(in millimeters) are 74.001, 74.003, 74.015, 74.000, 74.005, 
74.002, 74.005, and 74.004. Calculate the sample mean and 
sample standard deviation, construct a dot diagram, and com- 


6-4. An article in the Journal of Structural Engineering 
(Vol. 115, 1989) describes an experiment to test the yield 
strength of circular tubes with caps welded to the ends. The 


ment on the data. 


6-2. In Applied Life Data Analysis (Wiley, 1982), Wayne 
Nelson presents the breakdown time of an insulating fluid be- 
tween electrodes at 34 kV. The times, in minutes, are as fol- 
lows: 0.19, 0.78, 0.96, 1.31, 2.78, 3.16, 4.15, 4.67, 4.85, 6.50, 
7.35, 8.01, 8.27, 12,06, 31.75, 32.52, 33.91, 36.71, and 72.89. 
Calculate the sample mean and sample standard deviation. 


6-3. The January 1990 issue of Arizona Trend contains a 
supplement describing the 12 “best” golf courses in the state. 
The yardages (lengths) of these courses are as follows: 6981, 
7099, 6930, 6992, 7518, 7100, 6935, 7518, 7013, 6800, 7041, 


first yields (in kN) are 96, 96, 102, 102, 102, 104, 104, 108, 
126, 126, 128, 128, 140, 156, 160, 160, 164, and 170. 
Calculate the sample mean and sample standard deviation. 
Construct a dot diagram of the data. 


6-5. An article in Human Factors (June 1989) presented 
data on visual accommodation (a function of eye movement) 
when recognizing a speckle pattern on a high-resolution CRT 
screen. The data are as follows: 36.45, 67.90, 38.77, 42.18, 
26.72, 50.77, 39.30, and 49.71. Calculate the sample mean 
and sample standard deviation. Construct a dot diagram of the 
data. 


6-6. The following data are direct solar intensity measure- 
ments (watts/m?) on different days at a location in southern 
Spain: 562, 869, 708, 775, 775, 704, 809, 856, 655, 806, 
878, 909, 918, 558, 768, 870, 918, 940, 946, 661, 820, 898, 
935, 052, 057, 693, 835, 905, 939, 955, 060, 498, 653, 730, 
and 753. Calculate the sample mean and sample standard 
deviation. 


6-7. The April 22, 1991 issue of Aviation Week and Space 
Technology reports that during Operation Desert Storm, U.S. 
Air Force F-117A pilots flew 1270 combat sorties for a total of 
6905 hours. What is the mean duration of an F-117A mission 
during this operation? Why is the parameter you have calcu- 
lated a population mean? 


6-8. Preventing fatigue crack propagation in aircraft struc- 

tures is an important element of aircraft safety. An engineering 

study to investigate fatigue crack in n = 9 cyclically loaded 

wing boxes reported the following crack lengths (in mm): 

2.13, 2.96, 3.02, 1.82, 1.15, 1.37, 2.04, 2.47, 2.60. 

(a) Calculate the sample mean. 

(b) Calculate the sample variance and sample standard 
deviation. 

(c) Prepare a dot diagram of the data. 


6-9. Consider the solar intensity data in Exercise 6-6. 
Prepare a dot diagram of this data. Indicate where the sample 
mean falls on this diagram. Give a practical interpretation of 
the sample mean. 


6-10. Exercise 6-5 describes data from an article in Human 

Factors on visual accommodation from an experiment involv- 

ing a high-resolution CRT screen. 

(a) Construct a dot diagram of this data. 

(b) Data from a second experiment using a low-resolution 
screen were also reported in the article. They are 8.85, 
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35.80, 26.53, 64.63, 9.00, 15.38, 8.14, and 8.24. Prepare a 
dot diagram for this second sample and compare it to the 
one for the first sample. What can you conclude about 
CRT resolution in this situation? 


6-11. The pH of a solution is measured eight times by one 

operator using the same instrument. She obtains the following 

data: 7.15, 7.20, 7.18, 7.19, 7.21, 7.20, 7.16, and 7.18. 

(a) Calculate the sample mean. 

(b) Calculate the sample variance and sample standard 
deviation. 

(c) Whatare the major sources of variability in this experiment? 


6-12. An article in the Journal of Aircraft (1988) describes 
the computation of drag coefficients for the NASA 0012 air- 
foil. Different computational algorithms were used at 
M. = 0.7 with the following results (drag coefficients are in 
units of drag counts; that is, one count is equivalent to a drag 
coefficient of 0.0001): 79, 100, 74, 83, 81, 85, 82, 80, and 84. 
Compute the sample mean, sample variance, and sample stan- 
dard deviation, and construct a dot diagram. 


6-13. The following data are the joint temperatures of the 
O-rings (°F) for each test firing or actual launch of the space 
shuttle rocket motor (from Presidential Commission on the 

Space Shuttle Challenger Accident, Vol. 1, pp. 129—131): 

84, 49, 61, 40, 83, 67, 45, 66, 70, 69, 80, 58, 68, 60, 67, 72, 

73, 70, 57, 63, 70, 78, 52, 67, 53, 67, 75, 61, 70, 81, 76, 79, 

75,76, 58,31. 

(a) Compute the sample mean and sample standard deviation. 

(b) Construct a dot diagram of the temperature data. 

(c) Set aside the smallest observation (31°F) and recompute 
the quantities in part (a). Comment on your findings. 
How “different” are the other temperatures from this 
last value? 


Definition 


In most statistics problems, we work with a sample of observations selected from the popula- 
tion that we are interested in studying. Figure 6-3 illustrates the relationship between the pop- 
ulation and the sample. We have informally discussed these concepts before; however, we 
now give the formal definitions of some of these terms. 


A population consists of the totality of the observations with which we are concerned. 





In any particular problem, the population may be small, large but finite, or infinite. The 
number of observations in the population is called the size of the population. For example, the 
number of underfilled bottles produced on one day by a soft-drink company is a population of 
finite size. The observations obtained by measuring the carbon monoxide level every day is a 
population of infinite size. We often use a probability distribution as a model for a popula- 
tion. For example, a structural engineer might consider the population of tensile strengths of a 
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Figure 6-3 Relation- 
ship between a popula- 
tion and a sample. 


Definition 


Population 


TN 


Oo 


Sample (x, Xo, x,..., x,) 


BN x, sample average 
s, sample standard 


deviation 
Histogram 
x E 
À 
x | x 
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chassis structural element to be normally distributed with mean p and variance o?. We could 
refer to this as a normal population or a normally distributed population. 

In most situations, it is impossible or impractical to observe the entire population. For ex- 
ample, we could not test the tensile strength of all the chassis structural elements because it 
would be too time consuming and expensive. Furthermore, some (perhaps many) of these 
structural elements do not yet exist at the time a decision 1s to be made, so to a large extent, 
we must view the population as conceptual. Therefore, we depend on a subset of observations 
from the population to help make decisions about the population. 


A sample is a subset of observations selected from a population. 





For statistical methods to be valid, the sample must be representative of the population. It 
is often tempting to select the observations that are most convenient as the sample or to exer- 
cise judgment in sample selection. These procedures can frequently introduce bias into the 
sample, and as a result the parameter of interest will be consistently underestimated (or over- 
estimated) by such a sample. Furthermore, the behavior of a judgment sample cannot be statis- 
tically described. To avoid these difficulties, it is desirable to select a random sample as the 
result of some chance mechanism. Consequently, the selection of a sample is a random exper- 
iment and each observation in the sample is the observed value of a random variable. The 
observations in the population determine the probability distribution of the random variable. 

To define a random sample, let X be a random variable that represents the result of one se- 
lection of an observation from the population. Let f(x) denote the probability density function 
of X. Suppose that each observation in the sample is obtained independently, under unchanging 
conditions. That is, the observations for the sample are obtained by observing X independently 
under unchanging conditions, say, n times. Let X; denote the random variable that represents 
the ith replicate. Then, Xj, X5, ..., X, 1s a random sample and the numerical values obtained 
are denoted as x4, x5, ... , x,. The random variables in a random sample are independent with 
the same probability distribution f(x) because of the identical conditions under which each 
observation is obtained. That is, the marginal probability density function of X, X5, ... , X, 1s 
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f(x), f(X2), ... , (x4), respectively, and by independence the joint probability density function 
of the random sample is fy x, .. x (x1, X25 ..., Xn) = f(x) f(%2) ...f(x,). 


Definition 
The random variables Xj, X5, ..., X, are a random sample of size n if (a) the X;s are 


independent random variables, and (b) every X; has the same probability distribution. 





To illustrate this definition, suppose that we are investigating the effective service life of 
an electronic component used in a cardiac pacemaker and that component life is normally dis- 
tributed. Then we would expect each of the observations on component life Xj, X5, ... , X, in 
a random sample of n components to be independent random variables with exactly the same 
normal distribution. After the data are collected, the numerical values of the observed life- 
times are denoted as X4, X2, «« , Xp 

The primary purpose in taking a random sample is to obtain information about the unknown 
population parameters. Suppose, for example, that we wish to reach a conclusion about the pro- 
portion of people in the United States who prefer a particular brand of soft drink. Let p represent 
the unknown value of this proportion. It is impractical to question every individual in the popula- 
tion to determine the true value of p. In order to make an inference regarding the true proportion 
p, a more reasonable procedure would be to select a random sample (of an appropriate size) and 
use the observed proportionp of people in this sample favoring the brand of soft drink. 

The sample proportion, p is computed by dividing the number of individuals in the sam- 
ple who prefer the brand of soft drink by the total sample size n. Thus, p is a function of the 
observed values in the random sample. Since many random samples are possible from a pop- 
ulation, the value of p will vary from sample to sample. That is, p is a random variable. Such 
a random variable is called a statistic. 


Definition 


A statistic 1s any function of the observations in a random sample. 





We have encountered statistics before. For example, if X, X5, ..., X, 1s a random sample of 
size n, the sample mean X, the sample variance S^, and the sample standard deviation S are 
statistics. 

Although numerical summary statistics are very useful, graphical displays of sample data 
are a very powerful and extremely useful way to visually examine the data. We now present a few 
of the techniques that are most relevant to engineering applications of probability and statistics. 


6-3 STEM-AND-LEAF DIAGRAMS 


The dot diagram is a useful data display for small samples, up to (say) about 20 observations. 
However, when the number of observations is moderately large, other graphical displays may 
be more useful. 

For example, consider the data in Table 6-2. These data are the compressive strengths in 
pounds per square inch (psi) of 80 specimens of a new aluminum-lithium alloy undergoing eval- 
uation as a possible material for aircraft structural elements. The data were recorded in the order 
of testing, and in this format they do not convey much information about compressive strength. 
Questions such as “What percent of the specimens fail below 120 psi?" are not easy to answer. 
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Steps for 
Constructing a Stem- 
and-Leaf Diagram 


EXAMPLE 6-4 


Table 6-2 Compressive Strength (in psi) of 80 Aluminum-Lithium Alloy Specimens 


105 221 183 186 121 181 180 143 
97 154 153 174 120 168 167 141 
245 228 174 199 181 158 176 110 
163 131 154 115 160 208 158 133 
207 180 190 193 194 133 156 123 
134 178 76 167 184 135 229 146 
218 157 101 171 165 172 158 169 
199 151 142 163 145 171 148 158 
160 175 149 87 160 237 150 135 


196 201 200 176 150 170 118 149 


Because there are many observations, constructing a dot diagram of these data would be rela- 
tively inefficient; more effective displays are available for large data sets. 

A stem-and-leaf diagram is a good way to obtain an informative visual display of a data 
set x1, X2, ..., X,, Where each number x; consists of at least two digits. To construct a stem- 
and-leaf diagram, use the following steps. 


Divide each number x; into two parts: a stem, consisting of one or more of the 
leading digits and a leaf, consisting of the remaining digit. 


List the stem values in a vertical column. 


Record the leaf for each observation beside its stem. 


Write the units for stems and leaves on the display. 





To illustrate, if the data consist of percent defective information between 0 and 100 on 
lots of semiconductor wafers, we can divide the value 76 into the stem 7 and the leaf 6. In gen- 
eral, we should choose relatively few stems in comparison with the number of observations. 
It is usually best to choose between 5 and 20 stems. 


To illustrate the construction of a stem-and-leaf diagram, consider the alloy compressive 
strength data in Table 6-2. We will select as stem values the numbers 7, 8, 9, ..., 24. The 
resulting stem-and-leaf diagram is presented in Fig. 6-4. The last column in the diagram is a 
frequency count of the number of leaves associated with each stem. Inspection of this display 
immediately reveals that most of the compressive strengths lie between 110 and 200 psi and 
that a central value is somewhere between 150 and 160 psi. Furthermore, the strengths are dis- 
tributed approximately symmetrically about the central value. The stem-and-leaf diagram 
enables us to determine quickly some important features of the data that were not immediately 
obvious in the original display in Table 6-2. 


In some data sets, it may be desirable to provide more classes or stems. One way to do this 
would be to modify the original stems as follows: Divide the stem 5 (say) into two new stems, 
5L and 5U. The stem 5L has leaves 0, 1, 2, 3, and 4, and stem 5U has leaves 5, 6, 7, 8, and 9. 
This will double the number of original stems. We could increase the number of original stems 
by four by defining five new stems: 5z with leaves 0 and 1, 5t (for twos and three) with leaves 
2 and 3, 5f (for fours and fives) with leaves 4 and 5, 5s (for six and seven) with leaves 6 and 
7, and 5e with leaves 8 and 9. 
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Stem Leaf Frequency 
7 6 
8 7 | 
9 d 1 
10 51 2 
11 580 3 
12 103 3 
13 413535 6 
14 29583169 8 
15 471340886808 12 
16 3073050879 10 
17 8544162106 10 
18 0361410 7 
19 960934 6 
20 7108 d 
21 8 l! 
Figure 6-4 Stem- 25 189 3 
and-leaf diagram for 33 7 1 
the compressive 24 5 1 
strength data in Table 
6-2. Stem : Tens and hundreds digits (psi); Leaf: Ones digits (psi) 


EXAMPLE 6-5 Figure 6-5 illustrates the stem-and-leaf diagram for 25 observations on batch yields from a 
chemical process. In Fig. 6-5(a) we have used 6, 7, 8, and 9 as the stems. This results in too 
few stems, and the stem-and-leaf diagram does not provide much information about the data. 
In Fig. 6-5(b) we have divided each stem into two parts, resulting in a display that more 





Stem Leaf Stem Leaf Stem Leaf 
6 134556 6L 134 
7 011357889 6U 556 
8 1344788 7L 0113 
9 233 TU 57889 
(a) 8L 1344 
8U 788 
9L 23 
9U 5 
(b) Ts 7 
Te 889 
8z l 
at 3 
8f 44 
8s 7 
8e 88 
9z 
Figure 6-5 Stem- ot 23 
and-leaf displays for 9f 5 
Example 6-5. Stem: 9s 
Tens digits. Leaf: 9e 


Ones digits. (c) 


200 


Figure 6-6 A stem- 
and-leaf diagram from 
Minitab. 


CHAPTER 6 RANDOM SAMPLING AND DATA DESCRIPTION 


Character Stem-and-Leaf Display 


Stem-and-leaf of Strength 
N=80 Leaf Unit = 1.0 


l 7 6 
2 8 f 

3 9 7 

5 10 15 

8 11 058 

11 12 013 

17 13 133455 

25 14 12356899 

37 15 001344678888 
(10) — 16 0003357789 
33 17 0112445668 
23 18 0011346 

16 19 034699 

10 20 0178 

6 21 8 

5 22 189 

2 23 7 

1 24 5 


adequately displays the data. Figure 6-5(c) illustrates a stem-and-leaf display with each stem 
divided into five parts. There are too many stems in this plot, resulting in a display that does 
not tell us much about the shape of the data. 


Figure 6-6 shows a stem-and-leaf display of the compressive strength data in Table 6-2 
produced by Minitab. The software uses the same stems as in Fig. 6-4. Note also that the com- 
puter orders the leaves from smallest to largest on each stem. This form of the plot is usually 
called an ordered stem-and-leaf diagram. This is not usually done when the plot is con- 
structed manually because it can be time consuming. The computer adds a column to the left of 
the stems that provides a count of the observations at and above each stem in the upper half of 
the display and a count of the observations at and below each stem in the lower half of the dis- 
play. At the middle stem of 16, the column indicates the number of observations at this stem. 

The ordered stem-and-leaf display makes it relatively easy to find data features such as per- 
centiles, quartiles, and the median. The sample median is a measure of central tendency that 
divides the data into two equal parts, half below the median and half above. If the number of 
observations is even, the median is halfway between the two central values. From Fig. 6-6 
we find the 40th and 41st values of strength as 160 and 163, so the median is 
(160 + 163)/2 = 161.5. If the number of observations is odd, the median is the central value. 
The sample mode is the most frequently occurring data value. Figure 6-6 indicates that the mode 
is 158; this value occurs four times, and no other value occurs as frequently in the sample. 

We can also divide data into more than two parts. When an ordered set of data 1s divided 
into four equal parts, the division points are called quartiles. The first or lower quartile, q,, is 
a value that has approximately 25% of the observations below it and approximately 75% of 
the observations above. The second quartile, q), has approximately 50% of the observations 
below its value. The second quartile is exactly equal to the median. The third or upper quar- 
tile, q4, has approximately 75% of the observations below its value. As in the case of the 
median, the quartiles may not be unique. The compressive strength data in Fig. 6-6 contains 
n = 80 observations. Minitab software calculates the first and third quartiles as the (n + 1)/4 
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Table 6-3 Summary Statistics for the Compressive Strength Data from Minitab 


Variable N Mean Median StDev SE Mean 
80 162.66 161.50 395. 77 3.78 
Min Max QI Q3 
76.00 245.00 143.50 181.00 


and 3(m + 1)/A ordered observations and interpolates as needed. For example, 
(80 + 1)/4 = 20.25 and 3(80 + 1)/4 = 60.75. Therefore, Minitab interpolates between the 
20th and 21st ordered observation to obtain q} = 143.50 and between the 60th and 61st ob- 
servation to obtain q} = 181.00. In general, the 100Kth percentile is a data value such that ap- 
proximately 100k% of the observations are at or below this value and approximately 
100(1 — k)% of them are above it. Finally, we may use the interquartile range, defined as 
IQR = q} — q,, as a measure of variability. The interquartile range is less sensitive to the ex- 
treme values in the sample than is the ordinary sample range. 

Many statistics software packages provide data summaries that include these quantities. 
The output obtained for the compressive strength data in Table 6-2 from Minitab is shown in 
Table 6-3. 


EXERCISES FOR SECTION 6-3 


6-14. An article in Technometrics (Vol. 19, 1977, p. 425) (b) Does it appear likely that a coupon will “survive” beyond 
presents the following data on the motor fuel octane ratings of 2000 cycles? Justify your answer. 


several blends of gasoline: 


88.5 
94.7 
84.3 
90.1 
89.0 
89.8 
91.6 
90.3 
90.0 
91.5 
89.9 


Construct a stem-and-leaf display for these data. 
6-15. The following data are the numbers of cycles to fail- 


98.8 
88.3 
90.4 
91.2 
90.6 
92.2 
87.7 
91.1 
86.7 
93.4 
96.1 


89.6 
90.4 
91.6 
90.7 
88.6 
88.3 
94.2 
85.3 
90.1 
89.3 
91.1 


6-16. The percentage of cotton in material used to manufac- 
ture men’s shirts follows. Construct a stem-and-leaf display 


92.2 92.7 88.4  À 87.5 90.9 for the data. 


83.4 $87.9 926 87.8 89.9 


910 930 0937 883 918 34.2 37.8 33.6 32.6 33.8 35.8 34.7 34.6 
382 944 965 892 3897 33.1 366 347 33.1 342 37.6 33.6 33.6 
885 904 843 923 922 345 235.4 35.0 346 334 373 32.5 34.1 
933 912 932 889 35.6 346 354 35.9 347 346 34.1 34.7 
374 867 886 898 363 33.8 362 34.7 346 35.5 35.1 35.7 
911 942 887 927 35.1 37.1 368 33.6 352 32.8 368 36.8 
905 908 92.7 933 347 340 35.1 32.9 35.0 32.1 2379 34.3 
100.3 901 $93 867 33.6 341 35.3 233.5 349 2345 364 32.7 
87.6 91.8 91.0 91.0 6-17. The following data represent the yield on 90 consecu- 


tive batches of ceramic substrate to which a metal coating has 
been applied by a vapor-deposition process. Construct a stem- 
and-leaf display for these data. 


ure of aluminum test coupons subjected to repeated alternat- 


ing stress at 21,000 psi, 18 cycles per second: 


1115 
1310 
1540 
1502 
1258 
1315 
1085 

798 
1020 


865 
2130 
1421 
1109 
1481 
1567 
1883 
1203 
1270 


1015 
845 
1674 
1016 
1102 
1605 
706 
2215 
785 


94.1 86.1 953 849 888 846 944 84.1 
93.2 904 941 783 864 83.6 96.1 83.7 
90.6 89.1 978 89.6 85.1 85.4 980 82.9 
91.4 87.3 93.1 90.3 840 897 85.4 87.3 
88.2 841 864 93.1 093.7 87.6 866 86.4 
86.1 90.1 876 946 387.7 85.1 91.7 84.5 
95.1 95.2 941 963 906 896 $75 
90.0 86.1 92.1 94.7 894 90.0 $842 
924 943 964 91.1 886 90.1 85.1 
87.3 93.2 882 924 841 943 90.5 
86.6 86.7 864 90.6 826 97.3 95.6 


885 1594 1000 1416 1501 
1223 2023 1820 1560 1238 
375 1315 1940 1055 990 
2265 1269 1120 1764 1468 
1910 1260 910 1330 1512 
1018 1888 1730 1608 1750 
1452 1782 1102 1535 1642 
1890 1522 1578 1781 
2100 1792 758 1750 


(a) Construct a stem-and-leaf display for these data. 91.2 83.0 85.0 89.1 83.1 96.8 883 


a ( (UN 


@ 
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6-18. Find the median and the quartiles for the motor fuel 
octane data in Exercise 6-14. 

6-19. Find the median and the quartiles for the failure data 
in Exercise 6-15. 

6-20. Find the median, mode, and sample average of the 
data in Exercise 6-16. Explain how these three measures of 
location describe different features in the data. 

6-21. Find the median and the quartiles for the yield data in 
Exercise 6-19. 

6-22. The female students in an undergraduate engineering 
core course at ASU self-reported their heights to the nearest 
inch. The data are 


62 64 66 67 


68 64 66 68 
69 65 70 65 


65 68 61 65 67 65 64 63 67 
69 65 67 62 66 68 67 66 65 
67 68 65 63 64 67 67 


(a) Calculate the 
height. 

(b) Construct a stem-and-leaf diagram for the height data and 
comment on any important features that you notice. 

(c) What is the median height of this group of female engi- 
neering students? 

6-23. The shear strengths of 100 spot welds in a titantum 

alloy follow. Construct a stem-and-leaf diagram for the weld 

strength data and comment on any important features that you 

notice. 


sample mean and standard deviation of 
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distance test 1s conducted by hitting balls with a driver swung 
by a mechanical device nicknamed “Iron Byron” after the 
legendary great Byron Nelson, whose swing the machine is said 
to emulate. Following are 100 distances (in yards) achieved by 
a particular brand of golf ball in the overall distance test. 
Construct a stem-and-leaf diagram for this data and comment 
on any important features that you notice. Compute the sample 
mean, sample standard deviation, and the sample median. 


261.3 
258.1 
254.2 
257.1 
254,9 
255.9 
241.4 
244.3 
241.2 
254.4 
256.8 
295.9 
255.0 


259.4 
2770.5 
230.7 
272.6 
274.0 
260.7 
260.6 
212.2 
251 
232.1 
273.0 
266.6 
2123.2 


265.7 
252.1 
29352 
255.7 
264.5 
245.5 
280.3 
248.3 
267.0 
Z5 
240.8 
250.2 
251.4 


2770.6 
268.9 
263.5 
262.2 
244.8 
249.6 
212.7 
278.7 
273.4 
242.9 
276.6 
255.8 
276.1 


274.2 
267.4 
244.5 
252.0 
264.0 
237.8 
261.0 
236.0 
247.7 
273.6 
264.5 
285.3 
ATT.S 


261.4 
253.6 
251.8 
280.3 
268.3 
278.5 
260.0 
211.2 
254.8 
256.1 
264.5 
255.4 
266.8 


254.5 
234.3 
239.5 
274.9 
272.1 
213. 
279.3 
279.8 
272.8 
251.6 
226.8 
240.5 
268.5 


283.7 
263.2 
2545 
233.7 
260.2 
263.7 
252.1 
245.6 
270.5 


6-26. A semiconductor manufacturer produces devices used 


5408 
5420 
5407 
5463 
5399 
5445 
5381 
5401 
5458 
5383 


6-24. 


5431 
5429 
5469 
5408 
5391 
5436 
5425 
5411 
5485 
5401 


5475 
5401 
5416 
5481 
5477 
5454 
5388 
3399 
5431 
5407 


5442 
5446 
5377 
5453 
5447 
5453 
5388 
5431 
5416 
5385 


5376 
5487 
5454 
5422 
5329 
5428 
5378 
5440 
5431 
5440 


5388 
5416 
5375 
5354 
5473 
5418 
5481 
5413 
5390 
5422 


5459 
3202 
5409 
5421 
5423 
5465 
5387 
5406 
3999 
5448 


5422 
5357 
5459 
5406 
5441 
5427 
5440 
5342 
5435 
5366 


5416 
5388 
5445 
5444 
5412 
5421 
5482 
5452 
5387 
5430 


5435 
5457 
5429 
5466 
5384 
5396 
5406 
5420 
5462 
5418 


An important quality characteristic of water is the con- 
centration of suspended solid material. Following are 60 meas- 


as central processing units in personal computers. The speed of 
the device (in megahertz) is important because it determines the 
price that the manufacturer can charge for the devices. The fol- 
lowing table contains measurements on 120 devices. Construct a 
stem-and-leaf diagram for this data and comment on any impor- 
tant features that you notice. Compute the sample mean, sample 
standard deviation, and the sample median. What percentage of 
the devices has a speed exceeding 700 megahertz? 


680 669 719 699 670 710 722 663 658 634 720 690 


677 
649 
681 
652 


669 
675 
679 
720 
121 
652 
723 


700 
701 
691 
660 
653 
664 
710 


718 
721 
683 
695 
637 
702 
680 


690 
683 
705 
701 
660 
661 
684 


681 
735 
746 
724 
693 
720 
705 


702 
688 
706 
668 
679 
695 
681 


696 
763 
649 
698 
682 
670 
748 


692 
672 
668 
668 
724 
656 
697 


690 
698 
672 
660 
642 
718 
703 


694 
659 
690 
680 
704 
660 
660 


660 
704 
724 
799 
695 
648 
722 


urements on suspended solids from a certain lake. Construct a 
stem-and-leaf diagram for this data and comment on any im- 
portant features that you notice. Compute the sample mean, 
sample standard deviation, and the sample median. 


6-25. The United States Golf Association tests golf balls to 
ensure that they conform to the rules of golf. Balls are tested for 
weight, diameter, roundness, and overall distance. The overall 


644 
715 


683 
665 


685 
663 


695 
676 


678 
665 


674 
675 


656 
655 


667 
659 


683 
720 


691 
675 


680 
697 


6-27. A group of wine enthusiasts taste-tested a pinot noir 
wine from Oregon. The evaluation was to grade the wine on a 
0 to 100 point scale. The results follow: 


94 90 92 91 09] 86 89 91 91 90 
900 93 87 90 9] 92 89 86 89 90 
88 95 91 88 89 92 87 89 95 92 
85 91 85 89 88 84 85 90 090 83 


(a) Construct a stem-and-leaf diagram for this data and com- 
ment on any important features that you notice. 
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(b) Compute the sample mean, sample standard deviation, 
and the sample median. 

(c) A wine rated above 90 is considered truly exceptional. 
What proportion of the taste-tasters considered this partic- 
ular pinot noir truly exceptional? 

6-28. In their book /ntroduction to Linear Regression 

Analysis (3rd edition, Wiley, 2001) Montgomery, Peck, and 

Vining present measurements on NbOCI, concentration from 

a tube-flow reactor experiment. The data, in gram — mole per 

liter X 10 5, are as follows: 


450 450 473 507 457 452 453 1215 1256 


1145 1085 1066 1111 1364 1254 1396 1575 1617 
1735. 2753. 3186 3227 3469 1911 2588 2635 2725 


(a) Construct a stem-and-leaf diagram for this data and com- 
ment on any important features that you notice. 
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(b) Compute the sample mean, sample standard deviation, 
and the sample median. 


6-29. A Comparative Stem-and-Leaf Diagram. In 
Exercise 6-22, we presented height data that was self-reported 
by female undergraduate engineering students in a core course 
at ASU. In the same class, the male students self-reported their 
heights as follows: 


69 67 69 70 65 68 69 70 71 69 66 67 69 75 68 67 68 
69 70 7] 72 68 69 69 70 71 68 72 69 69 68 69 73 70 
73 68 69 71 67 68 65 68 68 69 70 74 71 69 70 69 


(a) Construct a comparative stem-and-leaf diagram by listing 
the stems in the center of the display and then placing the 
female leaves on the left and the male leaves on the right. 

(b) Comment on any important features that you notice in this 
display. 


6-4 FREQUENCY DISTRIBUTIONS AND HISTOGRAMS 


A frequency distribution is a more compact summary of data than a stem-and-leaf diagram. 
To construct a frequency distribution, we must divide the range of the data into intervals, which 
are usually called class intervals, cells, or bins. If possible, the bins should be of equal width 
in order to enhance the visual information in the frequency distribution. Some judgment must 
be used in selecting the number of bins so that a reasonable display can be developed. The num- 
ber of bins depends on the number of observations and the amount of scatter or dispersion in 
the data. A frequency distribution that uses either too few or too many bins will not be inform- 
ative. We usually find that between 5 and 20 bins is satisfactory in most cases and that the num- 
ber of bins should increase with n. Choosing the number of bins approximately equal to the 
square root of the number of observations often works well in practice. 

A frequency distribution for the comprehensive strength data in Table 6-2 is shown in 
Table 6-4. Since the data set contains 80 observations, and since V80 = 9, we suspect that 
about eight to nine bins will provide a satisfactory frequency distribution. The largest and 
smallest data values are 245 and 76, respectively, so the bins must cover a range of at least 
245 — 76 — 169 units on the psi scale. If we want the lower limit for the first bin to begin 
slightly below the smallest data value and the upper limit for the last bin to be slightly above 
the largest data value, we might start the frequency distribution at 70 and end it at 250. This is 
an interval or range of 180 psi units. Nine bins, each of width 20 psi, give a reasonable 
frequency distribution, so the frequency distribution in Table 6-4 1s based on nine bins. 

The second row of Table 6-4 contains a relative frequency distribution. The relative 
frequencies are found by dividing the observed frequency in each bin by the total number of 


Table 6-4 Frequency Distribution for the Compressive Strength Data in Table 6-2 


Class 


oss 90 90 sse OO 130 0 sse 00 = 0 


Ty OSs 0 30 0S = 50 


Frequency 2 3 6 14 22 17 10 4 2 
Relative 

frequency 0.0250 0.0375 0.0750 0.1750 0.2750 0.2125 0.1250 0.0500 0.0250 
Cumulative 

relative 

frequency 0.0250 0.0625 0.1375 0.3125 0.5875 0.8000 0.9250 0.9750 1.0000 
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Constructing a 
Histogram (Equal 
Bin Widths) 


Figure 6-7  Histogram 
of compressive strength 
for 80 aluminum- 
lithium alloy 
specimens. 


observations. The last row of Table 6-4 expresses the relative frequencies on a cumulative ba- 
sis. Frequency distributions are often easier to interpret than tables of data. For example, from 
Table 6-4 it is very easy to see that most of the specimens have compressive strengths between 
130 and 190 psi and that 97.5 percent of the specimens fail below 230 psi. 

The histogram is a visual display of the frequency distribution. The stages for construct- 
ing a histogram follow. 


Label the bin (class interval) boundaries on a horizontal scale. 
Mark and label the vertical scale with the frequencies or the relative 


frequencies. 
Above each bin, draw a rectangle where height is equal to the frequency (or rel- 
ative frequency) corresponding to that bin. 





Figure 6-7 is the histogram for the compression strength data. The histogram, like the stem- 
and-leaf diagram, provides a visual impression of the shape of the distribution of the meas- 
urements and information about the central tendency and scatter or dispersion in the data. 
Notice the symmetric, bell-shaped distribution of the strength measurements in Fig. 6-7. This 
display often gives insight about possible choices of probability distribution to use as a model 
for the population. For example, here we would likely conclude that the normal distribution 
is a reasonable model for the population of compression strength measurements. 

Sometimes a histogram with unequal bin widths will be employed. For example, if the 
data have several extreme observations or outliers, using a few equal-width bins will result 
in nearly all observations falling in just of few of the bins. Using many equal-width bins will 
result in many bins with zero frequency. A better choice is to use shorter intervals in the region 
where most of the data falls and a few wide intervals near the extreme observations. When the 
bins are of unequal width, the rectangle's area (not its height) should be proportional to the 
bin frequency. This implies that the rectangle height should be 


bin frequency 


Rectangle height — "-— 
$ E bin width 

In passing from either the original data or stem-and-leaf diagram to a frequency distribu- 
tion or histogram, we have lost some information because we no longer have the individual 
observations. However, this information loss is often small compared with the conciseness 
and ease of interpretation gained in using the frequency distribution and histogram. 
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Strength Strength 
Figure 6-8 A histogram of the compressive strength Figure 6-9 A histogram of the compressive strength 
data from Minitab with 17 bins. data from Minitab with nine bins. 


Figure 6-10 A 
cumulative distribution 
plot of the compressive 
strength data from 
Minitab. 


Figure 6-8 shows a histogram of the compressive strength data from Minitab. The “de- 
fault" settings were used in this histogram, leading to 17 bins. We have noted that histograms 
may be relatively sensitive to the number of bins and their width. For small data sets, his- 
tograms may change dramatically in appearance if the number and/or width of the bins 
changes. Histograms are more stable for larger data sets, preferably of size 75 to 100 or more. 
Figure 6-9 shows the Minitab histogram for the compressive strength data with nine bins. This 
is similar to the original histogram shown in Fig. 6-7. Since the number of observations is 
moderately large (n = 80), the choice of the number of bins is not especially important, and 
both Figs. 6-8 and 6-9 convey similar information. 

Figure 6-10 shows a variation of the histogram available in Minitab, the cumulative fre- 
quency plot. In this plot, the height of each bar is the total number of observations that are less 
than or equal to the upper limit of the bin. Cumulative distributions are also useful in data in- 
terpretation; for example, we can read directly from Fig. 6-10 that there are approximately 70 
observations less than or equal to 200 psi. 

When the sample size is large, the histogram can provide a reasonably reliable indicator of 
the general shape of the distribution or population of measurements from which the sample 
was drawn. Figure 6-11 presents three cases. The median is denoted as x. Generally, if the data 
are symmetric, as in Fig. 6-11(5), the mean and median coincide. If, in addition, the data have 
only one mode (we say the data are unimodal), the mean, median, and mode all coincide. If the 
data are skewed (asymmetric, with a long tail to one side), as in Fig. 6-11(a) and (c), the mean, 
median, and mode do not coincide. Usually, we find that mode < median < mean if the 
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Figure 6-11 
Histograms for sym- 
metric and skewed dis- 
tributions. 


EXAMPLE 6-6 


~ ~ — 


ZUGE x X X 
x 
Negative or left skew Symmetric Positive or right skew 
(a) (b) (c) 


distribution is skewed to the right, whereas mode > median > mean if the distribution is 
skewed to the left. 

Frequency distributions and histograms can also be used with qualitative or categorical 
data. In some applications there will be a natural ordering of the categories (such as freshman, 
sophomore, junior, and senior), whereas in others the order of the categories will be arbitrary 
(such as male and female). When using categorical data, the bins should have equal width. 


Figure 6-12 presents the production of transport aircraft by the Boeing Company in 1985. No- 
tice that the 737 was the most popular model, followed by the 757, 747, 767, and 707. 


A chart of occurrences by category (in which the categories are ordered by the number of 
occurrences) is sometimes referred to as a Pareto chart. See Exercise 6-41. 

In this section we have concentrated on descriptive methods for the situation in which each 
observation in a data set is a single number or belongs to one category. In many cases, we work 
with data in which each observation consists of several measurements. For example, in a gasoline 
mileage study, each observation might consist of a measurement of miles per gallon, the size of 
the engine in the vehicle, engine horsepower, vehicle weight, and vehicle length. This is an ex- 
ample of multivariate data. In later chapters, we will discuss analyzing this type of data. 


EXERCISES FOR SECTION 6-4 


6-30. Construct a frequency distribution and histogram for 


the motor fuel octane data from Exercise 6-14. Use eight bins. the cotton content data in Exercise 6-16. 


6-31. Construct a frequency distribution and histogram us- 6-33. Construct a frequency distribution and histogram for 
ing the failure data from Exercise 6-15. the yield data in Exercise 6-17. 


Figure 6-12 

Airplane production in 
1985. (Source: Boeing 
Company.) 
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6-34. Construct a frequency distribution and histogram with 
16 bins for the motor fuel octane data in Exercise 6-14. Compare 
its shape with that of the histogram with eight bins from Exercise 
6-30. Do both histograms display similar information? 


6-35. Construct a histogram for the female student height 
data in Exercise 6-22. 

6-36. Construct a histogram with 10 bins for the spot weld 
shear strength data in Exercise 6-23. Comment on the shape of 
the histogram. Does it convey the same information as the 
stem-and-leaf display? 

6-37. Construct a histogram for the water quality data in 
Exercise 6-24. Comment on the shape of the histogram. Does 
it convey the same information as the stem-and-leaf display? 
6-38. Construct a histogram with 10 bins for the overall dis- 
tance data in Exercise 6-25. Comment on the shape of the his- 
togram. Does it convey the same information as the stem-and- 
leaf display? 

6-39. Construct a histogram for the semiconductor speed 
data in Exercise 6-26. Comment on the shape of the his- 
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togram. Does it convey the same information as the stem-and- 
leaf display? 

6-40. Construct a histogram for the pinot noir wine rating data 
in Exercise 6-27. Comment on the shape of the histogram. Does 
it convey the same information as the stem-and-leaf display? 


6-41. The Pareto Chart. An important variation of a his- 
togram for categorical data is the Pareto chart. This chart is 
widely used in quality improvement efforts, and the categories 
usually represent different types of defects, failure modes, or 
product/process problems. The categories are ordered so that 
the category with the largest frequency is on the left, followed 
by the category with the second largest frequency and so forth. 
These charts are named after the Italian economist V. Pareto, 
and they usually exhibit “Pareto’s law”; that is, most of the de- 
fects can be accounted for by only a few categories. Suppose 
that the following information on structural defects in auto- 
mobile doors is obtained: dents, 4; pits, 4; parts assembled out 
of sequence, 6; parts undertrimmed, 21; missing holes/slots, 8; 
parts not lubricated, 5; parts out of contour, 30; and parts not 
deburred, 3. Construct and interpret a Pareto chart. 


The stem-and-leaf display and the histogram provide general visual impressions about a data 
set, while numerical quantities such as x or s provide information about only one feature of 
the data. The box plot is a graphical display that simultaneously describes several important 
features of a data set, such as center, spread, departure from symmetry, and identification of 


unusual observations or outliers. 


A box plot displays the three quartiles, the minimum, and the maximum of the data on a rec- 
tangular box, aligned either horizontally or vertically. The box encloses the interquartile range with 
the left (or lower) edge at the first quartile, g,, and the right (or upper) edge at the third quartile, q3. 
A line is drawn through the box at the second quartile (which is the 50th percentile or the median), 
q, = x. A line, or whisker, extends from each end of the box. The lower whisker is a line from the 
first quartile to the smallest data point within 1.5 interquartile ranges from the first quartile. The 
upper whisker is a line from the third quartile to the largest data point within 1.5 interquartile 
ranges from the third quartile. Data farther from the box than the whiskers are plotted as individ- 
ual points. A point beyond a whisker, but less than 3 interquartile ranges from the box edge, is 
called an outlier. A point more than 3 interquartile ranges from the box edge is called an extreme 
outlier. See Fig. 6-13. Occasionally, different symbols, such as open and filled circles, are used to 
identify the two types of outliers. Sometimes box plots are called box-and-whisker plots. 


Whisker extends to 


smallest data point within 
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Figure 6-13  Descrip- 
tion of a box plot. 
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Figure 6-14 Box plot for compressive 


strength data in Table 6-2. 
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Figure 6-15 Comparative box plots of a 
quality index at three plants. 


Figure 6-14 presents the box plot from Minitab for the alloy compressive strength data 
shown in Table 6-2. This box plot indicates that the distribution of compressive strengths is 
fairly symmetric around the central value, because the left and right whiskers and the lengths 
of the left and right boxes around the median are about the same. There are also two mild out- 


liers on either end of the data. 


Box plots are very useful in graphical comparisons among data sets, because they have 
high visual impact and are easy to understand. For example, Fig. 6-15 shows the comparative 
box plots for a manufacturing quality index on semiconductor devices at three manufacturing 
plants. Inspection of this display reveals that there is too much variability at plant 2 and that 
plants 2 and 3 need to raise their quality index performance. 


EXERCISES FOR SECTION 6-5 


6-42. Exercise 6-13 presented the joint temperatures of 

the O-rings (°F) for each test firing or actual launch of the 

space shuttle rocket motor. In that exercise you were asked 
to find the sample mean and sample standard deviation of 
temperature. 

(a) Find the upper and lower quartiles of temperature. 

(b) Find the median. 

(c) Set aside the smallest observation (31°F) and recompute 
the quantities in parts (a) and (b). Comment on your find- 
ings. How “different” are the other temperatures from this 
smallest value? 

(d) Construct a box plot of the data and comment on the pos- 
sible presence of outliers. 

6-43. An article in the Transactions of the Institution of 

Chemical Engineers (Vol. 34, 1956, pp. 280-293) reported 

data from an experiment investigating the effect of several 


process variables on the vapor phase oxidation of naphtha- 
lene. A sample of the percentage mole conversion of naphtha- 
lene to maleic anhydride follows: 4.2, 4.7, 4.7, 5.0, 3.8, 3.6, 
3.0, 5.1, 3.1, 3.8, 4.8, 4.0, 5.2, 4.3, 2.8, 2.0, 2.8, 3.3, 4.8, 5.0. 
(a) Calculate the sample mean. 

(b) Calculate the sample variance and sample standard 

deviation. 
(c) Construct a box plot of the data. 


6-44. The “cold start ignition time” of an automobile engine 
is being investigated by a gasoline manufacturer. The follow- 
ing times (in seconds) were obtained for a test vehicle: 1.75, 
1.92, 2.602, 2.35; 3.09. 3.15, 2.53, 1.91. 

(a) Calculate the sample mean and sample standard deviation. 
(b) Construct a box plot of the data. 


6-45. The nine measurements that follow are furnace tem- 
peratures recorded on successive batches in a semiconductor 


@ 


@ 


manufacturing process (units are °F): 953, 950, 948, 955, 951, 

949, 957, 954, 955. 

(a) Calculate the sample mean, sample variance, and standard 
deviation. 

(b) Find the median. How much could the largest temperature 
measurement increase without changing the median value? 

(c) Construct a box plot of the data. 


6-46. Exercise 6-12 presents drag coefficients for the 
NASA 0012 airfoil. You were asked to calculate the sample 
mean, sample variance, and sample standard deviation of 
those coefficients. 

(a) Find the upper and lower quartiles of the drag coefficients. 
(b) Construct a box plot of the data. 

(c) Set aside the largest observation (100) and rework parts a 

and b. Comment on your findings. 


6-47. The following data are the temperatures of effluent at 
discharge from a sewage treatment facility on consecutive 
days: 


43 47 24 48 52 50 46 49 
45 52 46 Sit 44 49 46 51 
49 45 44 50 48 50 49 50 


(a) Calculate the sample mean and median. 

(b) Calculate the sample variance and sample standard 
deviation. 

(c) Construct a box plot of the data and comment on the in- 
formation in this display. 


6-48. Reconsider the golf course yardage data in Exercise 6-3. 
Construct a box plot of the yardages and write an interpreta- 
tion of the plot. 


6-49. Reconsider the motor fuel octane rating data in 
Exercise 6-14. Construct a box plot of the yardages and write 
an interpretation of the plot. How does the box plot compare 
in interpretive value to the original stem-and-leaf diagram in 
Exercise 6-14? 
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6-50. Reconsider the spot weld shear strength data in 
Exercise 6-23. Construct a box plot of the strengths and write 
an interpretation of the plot. How does the box plot compare 
in interpretive value to the original stem-and-leaf diagram in 
Exercise 6-23? 


6-51. Reconsider the female engineering student height 
data in Exercise 6-22. Construct a box plot of the heights and 
write an interpretation of the plot. How does the box plot com- 
pare in interpretive value to the original stem-and-leaf dia- 
gram in Exercise 6-22? 


6-52. Reconsider the water quality data in Exercise 6-24. 
Construct a box plot of the concentrations and write an interpre- 
tation of the plot. How does the box plot compare in interpretive 
value to the original stem-and-leaf diagram in Exercise 6-24? 


6-53. Reconsider the golf ball overall distance data in 
Exercise 6-25. Construct a box plot of the yardage distance 
and write an interpretation of the plot. How does the box plot 
compare in interpretive value to the original stem-and-leaf di- 
agram in Exercise 6-25? 


6-54. Reconsider the wine rating data in Exercise 6-27. 
Construct a box plot of the wine ratings and write an interpreta- 
tion of the plot. How does the box plot compare in interpretive 
value to the original stem-and-leaf diagram in Exercise 6-27? 


6-55. Use the data on heights of female and male engineer- 
ing students from Exercises 6-22 and 6-29 to construct 
comparative box plots. Write an interpretation of the informa- 
tion that you see in these plots. 


6-56. In Exercise 6-44, data was presented on the cold start 
ignition time of a particular gasoline used in a test vehicle. A 
second formulation of the gasoline was tested in the same ve- 
hicle, with the following times (in seconds): 1.83, 1.99, 3.13, 
3.29, 2.65, 2.87, 3.40, 2.46, 1.89, and 3.35. Use this new data 
along with the cold start times reported in Exercise 6-44 to 
construct comparative box plots. Write an interpretation of the 
information that you see in these plots. 


The graphical displays that we have considered thus far such as histograms, stem-and-leaf 
plots, and box plots are very useful visual methods for showing the variability in data. 
However, we noted in Section 1-2.2 that time is an important factor that contributes to vari- 
ability in data, and those graphical methods do not take this into account. A time series or 
time sequence is a data set in which the observations are recorded in the order in which they 
occur. A time series plot is a graph in which the vertical axis denotes the observed value of 
the variable (say x) and the horizontal axis denotes the time (which could be minutes, days, 
years, etc.) When measurements are plotted as a time series, we often see trends, cycles, or 
other broad features of the data that could not be seen otherwise. 
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Sales, x 


1982 1983 1984 1985 1986 19871988 1989 1990 1991 Years 1 2 3 4 1 2 3 4 1 2 3 AQuarters 


1989 1990 1991 
(a) (b) 


Figure 6-16 Company sales by year (a) and by quarter (5). 


For example, consider Fig. 6-16(a), which presents a time series plot of the annual sales 
of a company for the last 10 years. The general impression from this display is that sales show 
an upward trend. There is some variability about this trend, with some years’ sales increasing 
over those of the last year and some years’ sales decreasing. Figure 6-16(5) shows the last 
three years of sales reported by quarter. This plot clearly shows that the annual sales in this 
business exhibit a cyclic variability by quarter, with the first- and second-quarter sales being 
generally greater than sales during the third and fourth quarters. 

Sometimes it can be very helpful to combine a time series plot with some of the other 
graphical displays that we have considered previously. J. Stuart Hunter (The American 
Statistician, Vol. 42, 1988, p. 54) has suggested combining the stem-and-leaf plot with a time 
series plot to form a digidot plot. 

Figure 6-17 shows a digidot plot for the observations on compressive strength from 
Table 6-2, assuming that these observations are recorded in the order in which they 


Leaf Stem Time series plot 
5 | 24 
7 |23 
189 | 22 
8 | 21 
7108 | 20 
960934 | 19 
0361410 | 18 


8544162106 | 17 
3073050979 | 16 
471340886808 | 15 
29583169 | 14 





413535 | 13 
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15 | 10 
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Figure 6-17 A digidot plot of the compressive strength data in Table 6-2. 
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occurred. This plot effectively displays the overall variability in the compressive strength 
data and simultaneously shows the variability in these measurements over time. The general 
impression is that compressive strength varies around the mean value of 162.67, and there 
is no strong obvious pattern in this variability over time. 

The digidot plot in Fig. 6-18 tells a different story. This plot summarizes 30 observations 
on concentration of the output product from a chemical process, where the observations are 
recorded at one-hour time intervals. This plot indicates that during the first 20 hours of oper- 
ation this process produced concentrations generally above 85 grams per liter, but that 
following sample 20, something may have occurred in the process that results in lower con- 
centrations. If this variability in output product concentration can be reduced, operation of this 


process can be improved. 


EXERCISES FOR SECTION 6-6 


6-57. The College of Engineering and Applied Science at 
Arizona State University had a VAX computer system. 
Response times for 20 consecutive jobs were recorded and are 
as follows: (read across) 


(b) Specifications on product viscosity are at 48 + 2. What 

conclusions can you make about process performance? 
6-59. The pull-off force for a connector is measured in a 
laboratory test. Data for 40 test specimens follow (read down, 


53 101 59 12.2 112 124 9.2 then left to right). 
5.0 5.8 32 8.5 T2 3.9 10.5 24] 203 201 25] 236 190 
9.5 6.2 10.0 4.7 6.4 8.1 258 195 195 238 245 175 
Construct and interpret a time series plot of these data. 237 249 255 210 209 178 
6-58. The following data are the viscosity measurements ad a A 198 212 175 
for a chemical product observed hourly (read down, then left 194 194 235 199 185 190 
= 248 209 249 213 218 
— 47.9 48.6 48.0 48.1 43.0 43.2 
47.9 48.8 47.5 48.0 42.9 4386 (a) Construct a time series plot of the data. 
48.6 48.1 48.6 48.3 43.6 432 (b) Construct and interpret either a digidot plot or a stem-and- 
480 483 480 432 433 435 eo 
48.4 472 47.9 43.0 43.0 43.0 6-60. In their book Time Series Analysis, Forecasting, and 
m 48.9 48.3 43.5 42.8 l Control (Prentice Hall, 1994), G. E. P. Box, G. M. Jenkins, 
48.0 48.6 AGE 431 431 and G. C. Reinsel present chemical process concentration 


(a) Construct and interpret either a digidot plot or a separate 
stem-and-leaf and time series plot of these data. 


readings made every two hours. Some of these data follow 
(read down, then left to right). 
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Table 6-5 United Kingdom Passenger Airline Miles Flown 





Month 1964 1965 1966 45 1968 1969 T2 
Jan. 7.269 8.350 8.186 8.334 8.639 9.49] 10.840 
Feb. 6.775 7.829 7.444 7.899 8.772 8.919 10.436 
Mar. 7.819 8.829 8.484 9.994 10.894 11.607 13.589 
Apr. 8.371 9.948 9.864 10.078 10.455 8.852 13.402 
May 9.069 10.638 10.252 10.801 11.179 12.537 13.103 
June 10.248 11.253 12.232 12.953 10.588 14.759 14.933 
July 11.030 11.424 11.637 12:222 10.794 13.667 14.147 
Aug. 10.882 11.391 11.577 12.246 12.770 13.731 14.057 
Sept. 10.333 10.665 12.417 13.281 13.812 15.110 16.234 
Oct 9.109 9.396 9.637 10.366 10.857 12.185 12.389 
Nov. 7.685 RII 8.094 8.730 9.290 10.645 11.594 
Dec 7.682 7.933 9.280 9.614 10.925 12.161 12.772 

17.0 16.7 17.1 17.5 17.6 41 10 16 8 62 94 

16.6 17.4 17.4 18.1 155 21 8 7 13 98 96 

16.3 17.2 17.4 17.5 16.5 16 2 4 57 124 i 

16.1 17.4 17.5 17.4 17.8 6 0 2 122 96 59 

Ej 17.1 17.4 17.4 17.4 17.3 4 i 8 138 66 d4 

—Z 16.9 17.0 17.6 17.1 17.3 7 5 17 103 64 47 

16.8 17.3 17.4 17.6 17.1 14 12 36 86 54 30 

17.4 17.2 17.3 17.7 17.4 34 14 50 63 39 16 

17.1 17.4 17.0 17.4 16.9 45 35 62 37 2l 7 

17.0 16.8 17.8 17.8 17.3 43 46 67 24 T 37 

48 41 71 11 4 74 
6-61. Construct and interpret either a digidot plot or a stem- 42 30 48 15 23 
and-leaf plot of these data. The 100 annual Wolfer sunspot 28 24 28 40 55 


numbers from 1770 to 1869 follow. (For an interesting analy- 


——— 





101 
82 
66 
35 


sis and interpretation of these numbers, see the book by 6-62. 


In their book Forecasting and Time Series Analysis, 


"Á Box, Jenkins, and Reinsel referenced in Exercise 6-60. Their 2nd edition (McGraw-Hill, 1990), D. C. Montgomery, L. A. 
analysis requires some advanced knowledge of statistics and 


statistical model building.) (read down, then left to right) 

(a) Construct a time series plot of these data. 

(b) Construct and interpret either a digidot plot or a stem-and- 
leaf plot of these data. 


Johnson, and J. S. Gardiner analyze the data in Table 6-5, 

which are the monthly total passenger airline miles flown in 

the United Kingdom, 1964—1970 (in millions of miles). 

(a) Draw a time series plot of the data and comment on any 
features of the data that are apparent. 

(b) Construct and interpret either a digidot plot or a stem-and- 


31 154 38 83 90 leaf plot of these data. 
7 125 25 132 67 

20 85 10 Isl 60 

92 68 24 118 47 


6-7 PROBABILITY PLOTS 


How do we know if a particular probability distribution is a reasonable model for data? 
Sometimes, this is an important question because many of the statistical techniques 
presented in subsequent chapters are based on an assumption that the population distribution 
is of a specific type. Thus, we can think of determining whether data come from a specific 


EXAMPLE 6-7 
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probability distribution as verifying assumptions. In other cases, the form ofthe distribution 
can give insight into the underlying physical mechanism generating the data. For example, in 
reliability engineering, verifying that time-to-failure data come from an exponential distri- 
bution identifies the failure mechanism in the sense that the failure rate is constant with 
respect to time. 

Some of the visual displays we have used earlier, such as the histogram, can provide 
insight about the form of the underlying distribution. However, histograms are usually not 
really reliable indicators of the distribution form unless the sample size is very large. 
Probability plotting is a graphical method for determining whether sample data conform 
to a hypothesized distribution based on a subjective visual examination of the data. The gen- 
eral procedure is very simple and can be performed quickly. It is also more reliable than the 
histogram for small to moderate size samples. Probability plotting typically uses special 
graph paper, known as probability paper, that has been designed for the hypothesized 
distribution. Probability paper 1s widely available for the normal, lognormal, Weibull, and 
various chi-square and gamma distributions. We focus primarily on normal probability plots 
because many statistical techniques are appropriate only when the population is (at least ap- 
proximately) normal. 

To construct a probability plot, the observations in the sample are first ranked from 
smallest to largest. That is, the sample x1, x5, ... , x, 1s arranged as xq, X2) ... 5 xq, where 
xa) is the smallest observation, xo) is the second smallest observation, and so forth, with xe) 
the largest. The ordered observations x, ; are then plotted against their observed cumulative 
frequency (j — 0.5)/n on the appropriate probability paper. If the hypothesized distribution 
adequately describes the data, the plotted points will fall approximately along a straight line; 
if the plotted points deviate significantly from a straight line, the hypothesized model is not 
appropriate. Usually, the determination of whether or not the data plot as a straight line is 
subjective. The procedure is illustrated in the following example. 


Ten observations on the effective service life in minutes of batteries used in a portable 
personal computer are as follows: 176, 191, 214, 220, 205, 192, 201, 190, 183, 185. We 
hypothesize that battery life is adequately modeled by a normal distribution. To use probabil- 
ity plotting to investigate this hypothesis, first arrange the observations in ascending order and 
calculate their cumulative frequencies (j — 0.5)/10 as shown in Table 6-6. 

The pairs of values xj and (j — 0.5)/10 are now plotted on normal probability paper. 
This plot is shown in Fig. 6-19. Most normal probability paper plots 100(j — 0.5)/n on the left 
vertical scale and 100[ 1 — (j — 0.5)/n] on the right vertical scale, with the variable value plot- 
ted on the horizontal scale. A straight line, chosen subjectively, has been drawn through the plot- 
ted points. In drawing the straight line, you should be influenced more by the points near the 
middle of the plot than by the extreme points. A good rule of thumb is to draw the line approxi- 
mately between the 25th and 75th percentile points. This 1s how the line in Fig. 6-19 was deter- 
mined. In assessing the “closeness” of the points to the straight line, imagine a “fat pencil" lying 
along the line. If all the points are covered by this imaginary pencil, a normal distribution ade- 
quately describes the data. Since the points in Fig. 6-19 would pass the “fat pencil" test, we con- 
clude that the normal distribution is an appropriate model. 


A normal probability plot can also be constructed on ordinary graph paper by plotting 
the standardized normal scores z, against x, ;, where the standardized normal scores satisfy 
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100( j - 0.5)/n 
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i Table 6-6 Calculation for Constracting a Normal 
l Probability Plot 
T J X(j) (j = 0.5)/10 Zi 
a 1 176 0.05 — 1.64 
1 j 183 0.15 —1.04 
bi 3 185 0.25 —0.67 
= 4 190 0.35 —0.39 
= 5 191 0.45 =0.13 
6 192 0.55 0.13 
zs 7 201 0.65 0.39 
99.9 8 205 0.75 0.67 
ur 210 220 9 214 0.85 1.04 
10 220 0.95 1.64 


Figure 6-19 Normal probability plot for battery life. 


Figure 6-20 


probability plot 
obtained from 


standardized normal 


SCOres. 


Normal 


For example, if (j — 0.5)/n = 0.05, d(z;) = 0.05 implies that z; = —1.64. To illustrate, 
consider the data from Example 6-4. In the last column of Table 6-6 we show the standarized 
normal scores. Figure 6-20 presents the plot of z; versus x(;, This normal probability plot is 
equivalent to the one in Fig. 6-19. 

We have constructed our probability plots with the probability scale (or the z-scale) on the 
vertical axis. Some computer packages “flip” the axis and put the probability scale on the hor- 
izontal axis. 

The normal probability plot can be useful in identifying distributions that are symmetric 
but that have tails that are “heavier” or "lighter" than the normal. They can also be useful in 
identifying skewed distributions. When a sample is selected from a light-tailed distribution 
(such as the uniform distribution), the smallest and largest observations will not be as extreme 
as would be expected in a sample from a normal distribution. Thus if we consider the straight 
line drawn through the observations at the center of the normal probability plot, observations 
on the left side will tend to fall below the line, whereas observations on the right side will tend 
to fall above the line. This will produce an S-shaped normal probability plot such as shown in 
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Figure 6-21 Normal probability plots indicating a nonnormal distrubution. (a) Light-tailed distribution. (b) Heavy-tailed 


distribution. (c) A distribution with positive (or right) skew. 


Fig. 6-21(a). A heavy-tailed distribution will result in data that also produces an S-shaped 
normal probability plot, but now the observations on the left will be above the straight line and 
the observations on the right will lie below the line. See Fig. 6-19(b). A positively skewed dis- 
tribution will tend to produce a pattern such as shown in Fig. 6-19(c), where points on both 
ends of the plot tend to fall below the line, giving a curved shape to the plot. This occurs be- 
cause both the smallest and the largest observations from this type of distribution are larger 
than expected in a sample from a normal distribution. 

Even when the underlying population is exactly normal, the sample data will not plot 
exactly on a straight line. Some judgment and experience are required to evaluate the plot. 
Generally, if the sample size is n « 30, there can be a lot of deviation from linearity in normal 
plots, so in these cases only a very severe departure from linearity should be interpreted as a 
strong indication of nonnormality. As n increases, the linear pattern will tend to become 
stronger, and the normal probability plot will be easier to interpret and more reliable as an 


indicator of the form of the distribution. 


EXERCISES FOR SECTION 6-7 


6-63. Construct a normal probability plot of the piston ring 
diameter data in Exercise 6-1. Does it seem reasonable to 
assume that piston ring diameter is normally distributed? 


6-64. Construct a normal probability plot of the insulating 
fluid breakdown time data in Exercise 6-2. Does it seem 
reasonable to assume that breakdown time is normally 
distributed? 


6-65. Construct a normal probability plot of the visual 
accommodation data in Exercise 6-5. Does it seem reason- 
able to assume that visual accommodation is normally 
distributed? 


ax 6-66. Construct a normal probability plot of the O-ring joint 


SE 


temperature data in Exercise 6-13. Does it seem reasonable to 
assume that O-ring joint temperature is normally distributed? 
Discuss any interesting features that you see on the plot. 


6-67. Construct a normal probability plot of the octane rat- 
ing data in Exercise 6-14. Does it seem reasonable to assume 
that octane rating is normally distributed? 


6-68. Construct a normal probability plot of the cycles to 
failure data in Exercise 6-15. Does it seem reasonable to as- 
sume that cycles to failure is normally distributed? 


6-69. Construct a normal probability plot of the wine qual- 
ity rating data in Exercise 6-27. Does it seem reasonable to 
assume that this variable is normally distributed? 


6-70. Construct a normal probability plot of the sus- 
pended solids concentration data in Exercise 6-24. Does it 
seem reasonable to assume that the concentration of 
suspended solids in water from this particular lake is 
normally distributed? 


w (EN 
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6-71. Construct two normal probability plots for the height 
data in Exercises 6-22 and 6-29. Plot the data for female and 
male students on the same axes. Does height seem to be 
normally distributed for either group of students? If both pop- 
ulations have the same variance, the two normal probability 
plots should have identical slopes. What conclusions would 
you draw about the heights of the two groups of students from 
visual examination of the normal probability plots? 


6-72. Itis possible to obtain a “quick and dirty" estimate of 
the mean of a normal distribution from the fiftieth percentile 
value on a normal probability plot. Provide an argument why 
this is so. It is also possible to obtain an estimate of the stan- 
dard deviation of a normal distribution by subtracting the 
sixty-fourth percentile value from the fiftieth percentile value. 
Provide an argument why this is so. 


6-8 MORE ABOUT PROBABILITY 
PLOTTING (CD ONLY) 
Supplemental Exercises 
6-73. The concentration of a solution is measured six times 


by one operator using the same instrument. She obtains the 

following data: 63.2, 67.1, 65.8, 64.0, 65.1, and 65.3 (grams 

per liter). 

(a) Calculate the sample mean. Suppose that the desirable 
value for this solution has been specified to be 65.0 
grams per liter. Do you think that the sample mean 
value computed here is close enough to the target value 
to accept the solution as conforming to target? Explain 
your reasoning. 

(b) Calculate the sample variance and sample standard 
deviation. 

(c) Suppose that in measuring the concentration, the operator 
must set up an apparatus and use a reagent material. What 
do you think the major sources of variability are in this ex- 
periment? Why is it desirable to have a small variance of 
these measurements? 


6-74. A sample of six resistors yielded the following resist- 
ances (ohms): x, = 45, x, = 38, x4 = 47, x4 = 41, x5 = 35, 
and xg = 43. 

(a) Compute the sample variance and sample standard 
deviation. 

(b) Subtract 35 from each of the original resistance measure- 
ments and compute s^ and s. Compare your results with 
those obtained in part (a) and explain your findings. 

(c) If the resistances were 450, 380, 470, 410, 350, and 430 
ohms, could you use the results of previous parts of this 
problem to find s? and s? 


6-75. Consider the following two samples: 
Sample 1: 10, 9, 8, 7, 8, 6, 10, 6 
Sample 2: 10, 6, 10, 6, 8, 10, 8, 6 
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(a) Calculate the sample range for both samples. Would you 
conclude that both samples exhibit the same variability? 
Explain. 

(b) Calculate the sample standard deviations for both sam- 
ples. Do these quantities indicate that both samples have 
the same variability? Explain. 

(c) Write a short statement contrasting the sample range 
versus the sample standard deviation as a measure of vari- 
ability. 

6-76. An article in Quality Engineering (Vol. 4, 1992, pp. 

487—495) presents viscosity data from a batch chemical 

process. A sample of these data follows: 


13.3 143 149 152 158 142 160 14.0 
14.5 161 13.7 15.2 13.7 169 149 14.4 
15.3 13.1 15.2 15.9 15.1 49 13.6 13.7 
15.3 155 14.5 165 154 152 153 13.8 
14.3 126 15.3 148 141 144 143 15.6 
14.8 146 15.6 15.1 148 152 15.6 14.5 
15.2 143 15.8 170 143 146 16.1 12.8 
14.5 154 13.3 14.9 143 164 13.9 16.1 
14.6 152 14.1 148 164 142 15.2 16.6 
14.1 16.8 15.4 140 169 15.7 144 15.6 


(a) Reading down and left to right, draw a time series plot of 
all the data and comment on any features of the data that 
are revealed by this plot. 

(b) Consider the notion that the first 40 observations were 
generated from a specific process, whereas the last 40 ob- 
servations were generated from a different process. Does 
the plot indicate that the two processes generate similar 
results? 

(c) Compute the sample mean and sample variance of the first 
40 observations; then compute these values for the second 
40 observations. Do these quantities indicate that both 
processes yield the same mean level? The same variabil- 
ity? Explain. 

6-77. Reconsider the data from Exercise 6-76. Prepare 

comparative box plots for two groups of observations: the 

first 40 and the last 40. Comment on the information in the 
box plots. 


6-78. The data shown in Table 6-7 are monthly champagne 

sales in France (1962-1969) in thousands of bottles. 

(a) Construct a time series plot of the data and comment on 
any features of the data that are revealed by this plot. 

(b) Speculate on how you would use a graphical proce- 
dure to forecast monthly champagne sales for the year 
1970. 


6-79. A manufacturer of coil springs is interested in imple- 
menting a quality control system to monitor his production 
process. As part of this quality system, it is decided to record 
the number of nonconforming coil springs in each production 


Table 6-7 Champagne Sales in France 


Month 1962 1963 1964 1965 
Jan. 2.851 2.541 3.113 2:929 
Feb. 2.672 2.475 3.006 3.088 
Mar. 2:155 3.031 4.047 3.718 
Apr. 22 3.266 3:525 4.514 
May 2.946 3.776 3.937 4.520 
June 3.036 3.230 3.986 4.539 
July 2.282 3.028 3.260 3.663 
Aug. 2,217 1:759 1:575 1.643 
Sept 2.922 2:595 3.526 4.739 
Oct. 4.301 4.474 5.211 5.428 
Nov. 5.764 6.838 7.614 8.314 
Dec 7,152 8.357 9.254 10.651 


1966 


3.633 
4.292 
4.154 
4.121 
4.647 
4.753 
3.965 
1.723 
5.048 
6.922 
9.858 
11.331 
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1967 


4.016 
3.957 
4.510 
4.276 
4.968 
4.677 
3.525 
1.821 
25:222 
6.873 
10.803 
13.916 


1968 


2.639 
2.899 
3.370 
3.740 
2:927 
3.986 
4.217 
1.738 
5:221 
6.424 
9.842 
13.076 
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1969 


3.934 
3.162 
4.286 
4.676 
5.010 
4.874 
4.633 
1.659 
2:991 
6.981 
9.851 
12.670 


batch of size 50. During 40 days of production, 40 batches of 
data were collected as follows: 


Read data across. 


9 12 6 9 7 14 12 4 6 7 
8 5 9 7 8 11 3 6 7 7 
11 4 4 8 T 5 - 5 8 
19 19 1$ 12 ll 17 15 17 15» 13 


(a) Construct a stem-and-leaf plot of the data. 

(b) Find the sample average and standard deviation. 

(c) Construct a time series plot of the data. Is there evidence 
that there was an increase or decrease in the average 
number of nonconforming springs made during the 40 
days? Explain. 

6-80. A communication channel is being monitored by 

recording the number of errors in a string of 1000 bits. Data 

for 20 of these strings follow: 


Read data across. 
3 1 0 ] 3 2 4 1 3 1 
1 1 2 3 3 2 0 2 0 1 


(a) Construct a stem-and-leaf plot of the data. 

(b) Find the sample average and standard deviation. 

(c) Construct a time series plot of the data. Is there evidence 
that there was an increase or decrease in the number of 
errors in a string? Explain. 


6-81. Reconsider the data in Exercise 6-76. Construct normal 
probability plots for two groups of the data: the first 40 and the 
last 40 observations. Construct both plots on the same axes. 
What tentative conclusions can you draw? 

6-82. Construct a normal probability plot of the effluent dis- 
charge temperature data from Exercise 6-47. Based on the 
plot, what tentative conclusions can you draw? 

6-83. Construct normal probability plots of the cold start 
ignition time data presented in Exercises 6-44 and 6-56. 


Construct a separate plot for each gasoline formulation, but 
arrange the plots on the same axes. What tentative conclusions 
can you draw? 


6-84. Transformations. In some data sets, a transformation 

by some mathematical function applied to the original data, 

such as V/y or log y, can result in data that are simpler to work 
with statistically than the original data. To illustrate the effect 
of a transformation, consider the following data, which repre- 

sent cycles to failure for a yarn product: 675, 3650, 175, 1150, 

290, 2000, 100, 375. 

(a) Construct a normal probability plot and comment on the 
shape of the data distribution. 

(b) Transform the data using logarithms; that is, let y* (new 
value) = log y (old value). Construct a normal probability 
plot of the transformed data and comment on the effect of 
the transformation. 


6-85. In 1879, A. A. Michelson made 100 determinations of 
the velocity of light in air using a modification of a method 
proposed by the French physicist Foucault. He made the 
measurements in five trials of 20 measurements each. The ob- 
servations (in kilometers per second) follow. Each value has 
299,000 substracted from it. 


Trial 1 
850 900 930 950 980 
1000 930 760 1000 960 
740 1070 850 980 880 
980 650 810 1000 960 
Trial 2 
960 960 880 850 900 
830 810 880 800 760 
940 940 800 880 840 
790 880 830 790 800 


218 


Trial 3 
880 880 720 620 970 
880 850 840 850 840 
880 860 720 860 950 
910 870 840 840 840 
Trial 4 
890 810 800 760 750 
910 890 880 840 850 
810 820 770 740 760 
920 860 720 850 780 
Trial 5 
890 780 760 790 820 
870 810 810 950 810 
840 810 810 810 850 
870 740 940 800 870 


The currently accepted true velocity of light in a vacuum is 
299, 792.5 kilometers per second. Stigler (1977, The Annals of 
Statistics) reports that the “true” value for comparison to these 
measurements is 734.5. Construct comparative box plots of 
these measurements. Does it seem that all five trials are con- 
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sistent with respect to the variability of the measurements? 
Are all five trials centered on the same value? How does each 
group of trials compare to the true value? Could there have 
been "startup" effects in the experiment that Michelson 
performed? Could there have been bias in the measuring 
instrument? 

6-86. In 1789, Henry Cavendish estimated the density of 


the earth by using a torsion balance. His 29 measurements 
follow, expressed as a multiple of the density of water. 


5.50 5.30 5.47 5.10 2.29 3.65 
3.255 5.61 3:75 5.63 3.2 5.44 
5.57 5.36 4.88 5.86 5.34 3,99 
5.34 2/99 3.29 4.07 5.85 5.46 
5.42 5.79 5.62 5.58 5.26 


(a) Calculate the sample mean, sample standard deviation, 
and median of the Cavendish density data. 

(b) Construct a normal probability plot of the data. Comment 
on the plot. Does there seem to be a “low” outlier in the 
data? 

(c) Would the sample median be a better estimate of the 
density of the earth than the sample mean? Why? 


In the E-book, click on any 
term or concept below to 
go to that subject. 

Box plot 

Frequency distribution 
and histogram 
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MIND-EXPANDING EXERCISES 


6-87. Consider the airfoil data in Exercise 6-12. 
Subtract 30 from each value and then multiply the re- 
sulting quantities by 10. Now compute s? for the new 
data. How is this quantity related to s* for the original 
data? Explain why. 

6-88. Consider the quantity }/_, (x; — a)? . For what 
value of a is this quantity minimized? 

6-89 Using the results of Exercise 6-87, which of the 
two quantities Y, (x; — x) and 9j, (x; — py 
will be smaller, provided that x # u? 

6-90. Coding the Data. Let y; = a + bx; i = 
1,2, ...,n, where a and b are nonzero constants. Find the 
relationship between x and y, and between sy and s. 


6-91. Asample oftemperature measurements in a fur- 
nace yielded a sample average (°F) of 835.00 and a sam- 
ple standard deviation of 10.5. Using the results from 
Exercise 6-90, what are the sample average and sample 
standard deviations expressed in ^C? 


6-92. Consider the sample x1, x2, ..., x, with sample 
mean x and sample standard deviation s. Let 
zi = (x; — x)/s,i = 1,2,...,n. What are the values of 
the sample mean and sample standard deviation of the z;? 
6-93. An experiment to investigate the survival time 
in hours of an electronic component consists of placing 
the parts in a test cell and running them for 100 hours 
under elevated temperature conditions. (This is called an 
“accelerated” life test.) Eight components were tested 
with the following resulting failure times: 


75, 63, 100°, 36, 51, 45, 80, 90 


The observation 100* indicates that the unit still func- 
tioned at 100 hours. Is there any meaningful measure of 
location that can be calculated for these data? What is its 
numerical value? 


IMPORTANT TERMS AND CONCEPTS 


Normal probability plot 

Population mean 

Population standard 
deviation 

Population variance 

Random sample 


Median, quartiles and 
percentiles 


6-94. Suppose that we have a sample x4, X2, ..., x, and 
we have calculated x, and s? for the sample. Now an 
(n + l)st observation becomes available. Let x, ; and 
52,4 be the sample mean and sample variance for the 
sample using all n + 1 observations. 

(a) Show how x, + 1 can be computed using x, and x, ,,. 


n(Xn+1 = xs 
Hol 

(c) Use the results of parts (a) and (b) to calculate the 
new sample average and standard deviation for the 
data of Exercise 6-22, when the new observation is 
Xag = 64. 

6-95. The Trimmed Mean. Suppose that the data are 

arranged in increasing order, T% of the observations are 

removed from each end and the sample mean of the re- 
maining numbers is calculated. The resulting quantity is 
called a trimmed mean. The trimmed mean generally 
lies between the sample mean x and the sample median 

x. Why? 

(a) Calculate the 10% trimmed mean for the yield data 
in Exercise 6-17. 

(b) Calculate the 20% trimmed mean for the yield data 
in Exercise 6-17 and compare it with the quantity 
found in part (a). 

(c) Compare the values calculated in parts (a) and (b) 
with the sample mean and median for the yield 
data. Is there much difference in these quantities? 
Why? 

6-96. The Trimmed Mean. Suppose that the sample 

size n is such that the quantity nT/100 is not an integer. 

Develop a procedure for obtaining a trimmed mean in 

this case. 





(b) Show that nS, = (n — 1)s2 TH 
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Sample mean 
Sample standard 
deviation plot 
Sample variance Goodness of fit 
Stem-and-leaf diagram 


Time series plots 
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Exponential probability 


Weibull probability plot 
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Probability plots are extremely useful and are often the first technique used in an effort to 
determine which probability distribution is likely to provide a reasonable model for the data. 

We give a simple illustration of how a normal probability plot can be useful in distin- 
guishing between normal and nonnormal data. Table S6-1 contains 50 observations gener- 
ated at random from an exponential distribution with mean 20 (or A — 0.05). These data 
were generated using the random number generation capability in Minitab. Figure S6-1 
presents a normal probability plot of these data, constructed using Minitab. The observa- 
tions do not even approximately lie along a straight line, giving a clear indication that the 
data do not follow a normal distribution. The strong curvature at both ends of the plot sug- 
gests that the data come from a distribution with right or positive skew. Compare Fig. S6-1 
with Fig. 6-19c. 

Minitab also provides estimates of the mean and standard deviation of the distribution us- 
ing the method of maximum likelihood (abbreviated ML on the graph in Figure S6-1). We 
will discuss maximum likelihood estimation in Chapter 7. For the normal distribution, this is 
the familiar sample mean and sample standard deviation that we first presented in Chapter 1. 
Minitab also presents a quantitative measure of how well the data are described by a normal 
distribution. This goodness-of-fit measure is called the Anderson-Darling statistic (abbrevi- 
ated AD on the Minitab probability plot). The Anderson-Darling statistic 1s based on the prob- 
ability integral transformation 


that can be used to convert the data to a uniform distribution if the hypothesized distribution 
is correct. Thus, if x1, X2, . . . , x, are independent and identically distributed random variables 
whose cumulative distribution function is F(x), then F(x,), F(x;), . . . , F(x,) are independent 
uniform (0, 1) random variables. The Anderson-Darling statistic essentially compares how 
close the F(x,), F(x), . .. , F(x,) values are to values from a uniform (0, 1) distribution. For 


Normal probability plot 
ML estimates 


e ML estimates 
Mean 20.7362 
St. Dev. 19.2616 


Goodness of fit 
AD* 1.904 


Percentage 
O1 
e 





Figure S6-1. Normal 
probability plot (from 
Minitab) of the data 0 50 100 
from Table S6-1. Data 


6-1 


6-2 


Figure S6-2. 
Exponential probabil- 
ity plot (from Minitab) 
of the data from Table 
S6-1. 


Figure S6-3. Weibull 
probability plot (from 
Minitab) of the data 
from Table S6-1. 
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this reason, the Anderson-Darling test is sometimes called a “distance” test. The test is upper- 
tailed; that is, if the computed value exceeds a critical value, the hypothesis of normality is 
rejected. The 5% critical value of the Anderson-Darling statistic 1s 0.752 and the 1% value is 
1.035. Because the Anderson-Darling statistic in Figure S6-1 is 1.904, and this exceeds the 1% 
critical value, we conclude that the assumption of normality would be inappropriate. 

Minitab can construct several other types of probability plots. An exponential probability 
plot of the data in Table S6-1 is shown in Figure S6-2. Notice that the data lies very close to 
the straight line in this plot, implying that the exponential is a good model for the data. 
Minitab also provides an estimate of the mean of the exponential distribution. This estimate is 
just the sample mean. 

Figure S6-3 is a Weibull probability plot of the data from Table S6-1, constructed using 
Minitab. The data lies approximately along a straight line, suggesting that the Weibull 
distribution is also a reasonable model for the data. Notice that Minitab provides maximum 
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Table S6-1 50 Observations Drawn at Random from an Exponential Distribution with 
Mean 20 (A — 0.05) 


1.2934 14.1968 15.2798 16.1154 14.8891 31.6489 

2.4330 13.1818 DLE 39.9558 97.8874 21.1057 
15.8770 9.0942 31.3655 1.3104 12.0008 11.2846 
21.9606 21.1336 26.8364 9.3134 31.0346 29.0222 
47.4481 0.3389 1.0999 19.8350 1.5191 2.5623 
34.8720 39.2494 12.1621 18.8295 35.3307 
27.0908 0.6731 14.2467 49.9397 6.0479 
17.5255 68.7876 57.8919 1.0882 22.4244 

6.8290 37.9023 1.8219 6.4967 12.8239 


likelihood estimates of the shape parameter Q and the scale parameter ò. The shape parameter 
estimate is very close to unity, and we know that a Weibull distribution with B = 1 is the 
exponential distribution. 
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7-2.1 Unbiased Estimators 7-3.3 Bayesian Estimation of 
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Point Estimate 


7-2.5 Bootstrap Estimate of the Standard 
Error (CD Only) 


7-2.6 Mean Square Error of an Estimator 


LEARNING OBJECTIVES 


After careful study of this chapter you should be able to do the following: 
1. Explain the general concepts of estimating the parameters of a population or a probability distribution 
2. Explain important properties of point estimators, including bias, variance, and mean square error 


3. Know how to construct point estimators using the method of moments and the method of maxi- 
mum likelihood 


4. Know how to compute and explain the precision with which a parameter is estimated 
5. Understand the central limit theorem 


6. Explain the important role of the normal distribution as a sampling distribution 


CD MATERIAL 
7. Use bootstrapping to find the standard error of a point estimate 


8. Know how to construct a point estimator using the Bayesian approach 


7-1 INTRODUCTION Z2] 


Answers for most odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e- Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e-Text immediately following the section they accompany. 


7-1 INTRODUCTION 


Definition 


The field of statistical inference consists of those methods used to make decisions or to draw 
conclusions about a population. These methods utilize the information contained in a sample 
from the population in drawing conclusions. This chapter begins our study of the statistical 
methods used for inference and decision making. 

Statistical inference may be divided into two major areas: parameter estimation and 
hypothesis testing. As an example of a parameter estimation problem, suppose that a structural 
engineer is analyzing the tensile strength of a component used in an automobile chassis. Since 
variability in tensile strength is naturally present between the individual components because of 
differences in raw material batches, manufacturing processes, and measurement procedures (for 
example), the engineer is interested in estimating the mean tensile strength of the components. 
In practice, the engineer will use sample data to compute a number that is in some sense a rea- 
sonable value (or guess) of the true mean. This number is called a point estimate. We will see 
that it is possible to establish the precision of the estimate. 

Now consider a situation in which two different reaction temperatures can be used in a 
chemical process, say f, and tj. The engineer conjectures that ¢, results in higher yields than 
does t,. Statistical hypothesis testing is a framework for solving problems of this type. In this 
case, the hypothesis would be that the mean yield using temperature f, is greater than the mean 
yield using temperature £5. Notice that there is no emphasis on estimating yields; instead, the 
focus is on drawing conclusions about a stated hypothesis. 

Suppose that we want to obtain a point estimate of a population parameter. We know that 
before the data is collected, the observations are considered to be random variables, say 
Xi, X5, ... , X,. Therefore, any function of the observation, or any statistic, is also a random 
variable. For example, the sample mean X and the sample variance S^ are statistics and they 
are also random variables. 

Since a statistic is a random variable, it has a probability distribution. We call the proba- 
bility distribution of a statistic a sampling distribution. The notion of a sampling distribution 
is very important and will be discussed and illustrated later in the chapter. 

When discussing inference problems, it is convenient to have a general symbol to represent 
the parameter of interest. We will use the Greek symbol 0 (theta) to represent the parameter. The 
objective of point estimation is to select a single number, based on sample data, that is the most 
plausible value for 0. A numerical value of a sample statistic will be used as the point estimate. 

In general, if X is a random variable with probability distribution f(x), characterized by 
the unknown parameter 0, and if Xj, X5, ..., X, 1s a random sample of size n from X, the 
statistic Ô = h(X,, Xo, ... , Xa) is called a point estimator of 0. Note that Ó is a random vari- 
able because it is a function of random variables. After the sample has been selected, € takes 
on a particular numerical value 6 called the point estimate of 0. 


A point estimate of some population parameter 0 is a single numerical value 6 of a 


statistic Ô. The statistic © is called the point estimator. 
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As an example, suppose that the random variable X is normally distributed with an un- 
known mean pu. The sample mean is a point estimator of the unknown population mean p. 
That is, (u = X. After the sample has been selected, the numerical value x is the point estimate 
of u. Thus, if x, = 25, x, = 30, x4 = 29, and x, = 31, the point estimate of p is 


=. 495930 r29 F31 
x= cA4&n» 


Similarly, if the population variance o? is also unknown, a point estimator for o? is the sample 
variance $7, and the numerical value s^ = 6.9 calculated from the sample data is called the 
point estimate of o°. 

Estimation problems occur frequently in engineering. We often need to estimate 


e The mean p ofa single population 

e The variance o? (or standard deviation o) of a single population 

e The proportion p of items in a population that belong to a class of interest 
e The difference in means of two populations, p; — p» 


e The difference in two population proportions, p, — p» 
Reasonable point estimates of these parameters are as follows: 


e For p, the estimate is Ñ = x, the sample mean. 


2 


* Foro’, the estimate is 6^ = s^, the sample variance. 


e For p, the estimate is p = x/n, the sample proportion, where x is the number of items 
in a random sample of size n that belong to the class of interest. 


e For u; — p», the estimate is Ñ; — fio = x4 — x5, the difference between the sample 
means of two independent random samples. 


e For p; — p, the estimate is p, — p», the difference between two sample proportions 
computed from two independent random samples. 


We may have several different choices for the point estimator of a parameter. For ex- 
ample, if we wish to estimate the mean of a population, we might consider the sample 
mean, the sample median, or perhaps the average of the smallest and largest observations 
in the sample as point estimators. In order to decide which point estimator of a particular 
parameter is the best one to use, we need to examine their statistical properties and develop 
some criteria for comparing estimators. 


7-2 GENERAL CONCEPTS OF POINT ESTIMATION 
7-2.1 Unbiased Estimators 


An estimator should be “close” in some sense to the true value of the unknown parameter. 
Formally, we say that € is an unbiased estimator of 0 if the expected value of Ô is equal to 0. 
This is equivalent to saying that the mean of the probability distribution of © (or the mean of 
the sampling distribution of Ô) is equal to 0. 


Definition 


EXAMPLE 7-1 
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The point estimator €) is an unbiased estimator for the parameter 0 if 


^ 


E(O) = 0 


If the estimator is not unbiased, then the difference 


E(O) — 0 


is called the bias of the estimator Ô. 





When an estimator is unbiased, the bias is zero; that is, E(O) — 0 = 0. 


Suppose that X is a random variable with mean w and variance o°. Let Xj, X5, ..., X, bea 
random sample of size n from the population represented by X. Show that the sample mean X 
and sample variance S^ are unbiased estimators of u and o7 , respectively. 

First consider the sample mean. In Equation 5.40a in Chapter 5, we showed that E(X) = w. 
Therefore, the sample mean X is an unbiased estimator of the population mean p. 

Now consider the sample variance. We have 


n 


Y a -xy 








n u u 1 H _ 
EY a+- x)= e| x? — nk?) 
1 a ; e 
E >, E(X?) — nE(X?) 
i=l 


n= | 





The last equality follows from Equation 5-37 in Chapter 5. However, since E(X aA = pu +o? 
and E(X?) = u? + o?/n, we have 


E(S^) = P : | > (u^ + o^?) — n(w* + o*/n) 








l 
oa (np? + no^ — np’ — o?) 


Therefore, the sample variance $7 is an unbiased estimator of the population variance o. 


Although $^ is unbiased for o°, S is a biased estimator of c. For large samples, the bias is very 
small. However, there are good reasons for using S as an estimator of o in samples from nor- 
mal distributions, as we will see in the next three chapters when are discuss confidence 
intervals and hypothesis testing. 
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Sometimes there are several unbiased estimators of the sample population parameter. For 
example, suppose we take a random sample of size n = 10 from a normal population and 
obtain the data x, = 12.8, x; = 9.4, x4 = 8.7, x, = 11.6, x5 = 13.1, x, = 9.8, x, = 14.1, 
Xg = 8.5, x9 = 12.1, x = 10.3. Now the sample mean is 


12.5 + 9.4 + 8.7 + LEO + 13.1 + 9.8 + 14.1 + 8.5 + 12.1 + 10.3 
10 
= 11.04 


the sample median is 


~ 10.3 + 11.6 
X 3 = 10.95 


and a 10% trimmed mean (obtained by discarding the smallest and largest 10% of the sample 
before averaging) is 


_ 8.7 + 9.4 + 9.8 + 10.3 + 11.6 + 12.1 + 12.8 + 13.1 
Xtr(10) — ns 
= 10.98 
We can show that all of these are unbiased estimates of u. Since there is not a unique unbiased 


estimator, we cannot rely on the property of unbiasedness alone to select our estimator. We 
need a method to select among unbiased estimators. We suggest a method in Section 7-2.3. 


7-2.2 Proof That S is a Biased Estimator of o (CD Only) 


7-2.3 Variance of a Point Estimator 


Definition 


Figure 7-1 The 
sampling distributions 
of two unbiased estima- 
tors 6, and Ô. 


Suppose that Ô, and Ô, are unbiased estimators of 0. This indicates that the distribution of 
each estimator 1s centered at the true value of 0. However, the variance of these distributions 
may be different. Figure 7-1 illustrates the situation. Since ©, has a smaller variance than O,, 
the estimator ©, is more likely to produce an estimate close to the true value 0. A logical prin- 
ciple of estimation, when selecting among several estimators, is to choose the estimator that 
has minimum variance. 


If we consider all unbiased estimators of 0, the one with the smallest variance 1s 


called the minimum variance unbiased estimator (MVUE). 








^ 
Distribution of ©, 


^ 
Distribution of ©, 


Theorem 7-1 


7-2 GENERAL CONCEPTS OF POINT ESTIMATION 225 


In a sense, the MVUE is most likely among all unbiased estimators to produce an estimate 0 
that is close to the true value of 0. It has been possible to develop methodology to identify the 
MVUE in many practical situations. While this methodology is beyond the scope of this book, 
we give one very important result concerning the normal distribution. 


If Xj, X, ..., X, isa random sample of size n from a normal distribution with mean 


u and variance o7, the sample mean X is the MVUE for p. 





In situations in which we do not know whether an MVUE exists, we could still use a minimum 
variance principle to choose among competing estimators. Suppose, for example, we wish to es- 
timate the mean of a population (not necessarily a normal population). We have a random sample 
of n observations X1, X5, ... , X, and we wish to compare two possible estimators for w: the sam- 
ple mean X and a single observation from the sample, say, X;. Note that both X and X; are unbi- 
ased estimators of p; for the sample mean, we have V(X) = o^/n from Equation 5-40b and the 
variance of any observation is V(X;) = o^. Since V(X) < V(X;) for sample sizes n = 2, we 
would conclude that the sample mean is a better estimator of u than a single observation X;. 


7-2.4 Standard Error: Reporting a Point Estimate 


Definition 


When the numerical value or point estimate of a parameter is reported, it is usually desirable 
to give some idea of the precision of estimation. The measure of precision usually employed 
is the standard error of the estimator that has been used. 


The standard error of an estimator Ô is its standard deviation, given by 


^ 


og = V V(O). If the standard error involves unknown parameters that can be esti- 


mated, substitution of those values into og produces an estimated standard error, 
denoted by 65. 





^ 


Sometimes the estimated standard error is denoted by sa or se(9). 
Suppose we are sampling from a normal distribution with mean p and variance Oo, Now 
the distribution of X is normal with mean w and variance o^/ n, so the standard error of X is 


If we did not know o but substituted the sample standard deviation 5 into the above equation, 
the estimated standard error of X would be 


When the estimator follows a normal distribution, as in the above situation, we can be rea- 
sonably confident that the true value of the parameter lies within two standard errors of the 
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EXAMPLE 7?-2 


estimate. Since many point estimators are normally distributed (or approximately so) for large 
n, this is a very useful result. Even in cases in which the point estimator is not normally 
distributed, we can state that so long as the estimator 1s unbiased, the estimate of the parameter 
will deviate from the true value by as much as four standard errors at most 6 percent of the time. 
Thus a very conservative statement is that the true value of the parameter differs from the point 
estimate by at most four standard errors. See Chebyshev’s inequality in the CD only material. 


An article in the Journal of Heat Transfer (Trans. ASME, Sec. C, 96, 1974, p. 59) described 
a new method of measuring the thermal conductivity of Armco iron. Using a temperature of 
100°F and a power input of 550 watts, the following 10 measurements of thermal conductiv- 
ity (in Btu/hr-ft-°F) were obtained: 


41.60, 41.48, 42.34, 41.95, 41.86, 
42.18, 41.72, 42.26, 41.81, 42.04 


A point estimate of the mean thermal conductivity at 100°F and 550 watts is the sample mean or 
x = 41.924 Btu/hr-ft-°F 


The standard error of the sample mean is oy = o/ Vn, and since ø is unknown, we may replace 
it by the sample standard deviation s — 0.284 to obtain the estimated standard error of X as 


" S 0.284 0.0898 
Oy ———————oce— : 
X Vn V10 


Notice that the standard error is about 0.2 percent of the sample mean, implying that we have ob- 
tained a relatively precise point estimate of thermal conductivity. If we can assume that thermal 
conductivity is normally distributed, 2 times the standard error is 26 y = 2(0.0898) = 0.1796, 
and we are highly confident that the true mean thermal conductivity is with the interval 
41.924 + 0.1756, or between 41.744 and 42.104. 


7-2.5 Bootstrap Estimate of the Standard Error (CD Only) 


7-2.6 Mean Square Error of an Estimator 


Definition 


Sometimes it 1s necessary to use a biased estimator. In such cases, the mean square error of the 
estimator can be important. The mean square error of an estimator O is the expected squared 
difference between © and 0. 


The mean square error of an estimator © of the parameter 0 is defined as 


MSE(@) = E(® — 0y 





The mean square error can be rewritten as follows: 


MSE(O) = E[Ó — E()P + [0 — E(O)/? 


Figure 7-2 <A biased 
estimator 0, that has 
smaller variance than 
the unbiased estimator 


Ô.. 
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^ 
Distribution of ©, 


^ 
Distribution of O» 


0 E(Ó) 


That is, the mean square error of Ó is equal to the variance of the estimator plus the squared bias. 
If © is an unbiased estimator of 0, the mean square error of © is equal to the variance of Ô. 

The mean square error is an important criterion for comparing two estimators. Let Ô} 
and Ô, be two estimators of the parameter 0, and let MSE (0,) and MSE (@,) be the mean 
square errors of ©, and @,. Then the relative efficiency of ©, to ©, is defined as 


MSE(0,) 
IE S (7-4) 
MSE(@,) 
If this relative efficiency is less than 1, we would conclude that ©, is a more efficient estima- 
tor of 0 than @,, in the sense that it has a smaller mean square error. 

Sometimes we find that biased estimators are preferable to unbiased estimators because they 
have smaller mean square error. That is, we may be able to reduce the variance of the estimator 
considerably by introducing a relatively small amount of bias. As long as the reduction in variance 
is greater than the squared bias, an improved estimator from a mean square error viewpoint will 
result. For example, Fig. 7-2 shows the probability distribution of a biased estimator Ô; that has 
a smaller variance than the unbiased estimator Ô. An estimate based on ©, would more likely 
be close to the true value of 0 than would an estimate based on @,. Linear regression analysis 
(Chapters 11 and 12) is an area in which biased estimators are occasionally used. 

An estimator €) that has a mean square error that is less than or equal to the mean square 
error of any other estimator, for all values of the parameter 0, is called an optimal estimator 
of 0. Optimal estimators rarely exist. 


EXERCISES FOR SECTION 7-2 


7-1. Suppose we have a random sample of size 2n from a 
population denoted by X, and E(X) = p and V(X) = o°. Let 


E 1 2n 
X = X, 


a 
2n an 2 


1 n 
n 2 


i=1 
be two estimators of u. Which is the better estimator of u? 
Explain your choice. 

7-2. Let Xi, X5,...,X7 denote a random sample from a 


population having mean u and variance o?. Consider the 
following estimators of w: 





~ Xt Xt +X. 

Ô, = 1 2 7 
7 

~ 2X, -X + X, 

Ô, = 1 6 4 
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(a) Is either estimator unbiased? 

(b) Which estimator is best? In what sense is it best? 

7-3. Suppose that Ô, and Ô, are unbiased estimators of the 
parameter 0. We know that V(@,) = 10 and V(@,) = 4. 
Which estimator is best and in what sense is it best? 

7-4. Calculate the relative efficiency of the two estimators 
in Exercise 7-2. 

7-5. Calculate the relative efficiency of the two estimators 
in Exercise 7-3. 

7-6. Suppose that Ô, and Ô, are estimators of the parame- 
ter 0. We know that E(@,) = 0, E(O,) = 6/2, V(Ó,) = 10, 
V(®,) = 4. Which estimator is best? In what sense is it best? 
7-7. Suppose that Ó,, Ó,, and Ô, are estimators of 0. We 
know that E(@,) = E(0;,) = 6, E(Ô,) # 6, V(O,) = 12, 
V(Ó,) = 10, and E(@; — 0)? = 6. Compare these three esti- 
mators. Which do you prefer? Why? 
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7-8. Let three random samples of sizes n, = 20, n; = 10, 
and n, = 8 be taken from a population with mean pu and 
variance o?. Let SÊ, 55. and S. be the sample variances. 
Show that S? = (2087 + 10S + 853)/38 is an unbiased 
estimator of o°. 


7-9. (a) Show that 9; , (X; — X)?/n is a biased estimator 
ofo. 

(b) Find the amount of bias in the estimator. 

(c) What happens to the bias as the sample size n increases? 
7-10. Let Xj, X5, ..., X, be a random sample of size n from 
a population with mean w and variance o°. 
(a) Show that X? is a biased estimator for u”. 

(b) Find the amount of bias in this estimator. 

(c) What happens to the bias as the sample size n increases? 


7-11. Data on pull-off force (pounds) for connectors used in 
an automobile engine application are as follows: 79.3, 75.1, 
78.2, 74.1, 73.9, 75.0, 77.6, 77.3, 73.8, 74.6, 75.5, 74.0, 74.7, 
75.9, 72.9, 73.8, 74.2, 78.1, 75.4, 76.3, 75.3, 76.2, 74.9, 78.0, 
75.1, 76.8. 

(a) Calculate a point estimate of the mean pull-off force of all 
connectors in the population. State which estimator you 
used and why. 

(b) Calculate a point estimate of the pull-off force value that 
separates the weakest 5096 of the connectors in the popu- 
lation from the strongest 509^. 

(c) Calculate point estimates of the population variance and 
the population standard deviation. 

(d) Calculate the standard error of the point estimate found in 
part (a). Provide an interpretation of the standard error. 

(e) Calculate a point estimate of the proportion of all connec- 
tors in the population whose pull-off force is less than 
73 pounds. 


7-12. Data on oxide thickness of semiconductors are as 

follows: 425, 431, 416, 419, 421, 436, 418, 410, 431, 433, 

423, 426, 410, 435, 436, 428, 411, 426, 409, 437, 422, 428, 

413, 416. 

(a) Calculate a point estimate of the mean oxide thickness for 
all wafers in the population. 

(b) Calculate a point estimate of the standard deviation of 
oxide thickness for all wafers in the population. 

(c) Calculate the standard error of the point estimate from 
part (a). 

(d) Calculate a point estimate of the median oxide thickness 
for all wafers in the population. 

(e) Calculate a point estimate of the proportion of wafers in 
the population that have oxide thickness greater than 430 
angstrom. 


7-13. X, X5,..., X, 1s a random sample from a normal 
distribution with mean u and variance o?. Let Xin and Xmax 
be the smallest and largest observations in the sample. 

(a) Is (Xmin + X max)/2 an unbiased estimate for p? 

(b) What 1s the standard error of this estimate? 

(c) Would this estimate be preferable to the sample mean X? 
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7-14. Suppose that X is the number of observed “successes” 

in a sample of n observations where p is the probability of 

success on each observation. 

(a) Show that p= X/n is an unbiased estimator of p. 

(b) Show that the standard error of P is Vp(1 — p)/n. 
How would you estimate the standard error? 


7-15. X, and S? are the sample mean and sample variance 
from a population with mean p. and variance gô. Similarly, X; 
and 5$ are the sample mean and sample variance from a sec- 
ond independent population with mean u; and variance o3. 
The sample sizes are nı and n», respectively. 

(a) Show that X, — X, is an unbiased estimator of u; — p. 
(b) Find the standard error of X; — X>. How could you 

estimate the standard error? 


1-16. Continuation of Exercise 7-15. Suppose that both pop- 
ulations have the same variance; that is, of = 04 = o>. Show 


that 


oo (m DS + (m= 83 
d Ny + Ny — 2 


is an unbiased estimator of o°. 


7-17. Two different plasma etchers in a semiconductor fac- 
tory have the same mean etch rate ju. However, machine 1 is 
newer than machine 2 and consequently has smaller variability 
in etch rate. We know that the variance of etch rate for machine 

1 is o7 and for machine 2 it is o5 = ao1. Suppose that we have 

n; independent observations on etch rate from machine 1 and n, 

independent observations on etch rate from machine 2. 

(a) Show that (u = a X, + (1 — a) X, is an unbiased estima- 
tor of u for any value of a between 0 and 1. 

(b) Find the standard error of the point estimate of u in part (a). 

(c) What value of a would minimize the standard error of the 
point estimate of p? 

(d) Suppose that a = 4 and n, = 2n,. What value of a would 
you select to minimize the standard error of the point esti- 
mate of u. How “bad” would it be to arbitrarily choose 
a = 0.5 in this case? 


7-18. Of n, randomly selected engineering students at ASU, 

X, owned an HP calculator, and of n) randomly selected 

engineering students at Virginia Tech X; owned an HP calculator. 

Let p, and p, be the probability that randomly selected ASU and 

Va. Tech engineering students, respectively, own HP calculators. 

(a) Show that an unbiased estimate for p, — p> is (X,/n,) — 
(X»/n;). 

(b) What is the standard error of the point estimate in 
part (a)? 

(c) How would you compute an estimate of the standard error 
found in part (b)? 

(d) Suppose that n, = 200, X, = 150, n, = 250, and X, = 185. 
Use the results of part (a) to compute an estimate of p, — pp. 

(e) Use the results in parts (b) through (d) to compute an 
estimate of the standard error of the estimate. 
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4-3. METHODS OF POINT ESTIMATION 


The definitions of unbiasness and other properties of estimators do not provide any guidance 
about how good estimators can be obtained. In this section, we discuss two methods for ob- 
taining point estimators: the method of moments and the method of maximum likelihood. 
Maximum likelihood estimates are generally preferable to moment estimators because they 
have better efficiency properties. However, moment estimators are sometimes easier to com- 
pute. Both methods can produce unbiased point estimators. 


7-3.1 Method of Moments 


Definition 


Definition 


EXAMPLE 7-3 


The general idea behind the method of moments is to equate population moments, which are 
defined in terms of expected values, to the corresponding sample moments. The population 
moments will be functions of the unknown parameters. Then these equations are solved to 
yield estimators of the unknown parameters. 


Let X;, X5, ..., X, be a random sample from the probability distribution f(x), where 
f(x) can be a discrete probability mass function or a continuous probability density 


function. The kth population moment (or distribution moment) is E(X*), k = 
1, 2, .... The corresponding kth sample moment is (1/1) X;-; X$, k = 1,2, .... 





To illustrate, the first population moment is E(X) = p, and the first sample moment is 
(1/n) S;-, X; = X. Thus by equating the population and sample moments, we find that 
(i. = X. That is, the sample mean is the moment estimator of the population mean. In the 
general case, the population moments will be functions of the unknown parameters of the dis- 
tribution, say, 04, 05, ..., Op- 


Let Xi, X5, ..., X, be a random sample from either a probability mass function 
or probability density function with m unknown parameters 04, 05,..., Om. The 


moment estimators ©,, 65, ..., Ó,, are found by equating the first m population 
moments to the first m sample moments and solving the resulting equations for the 
unknown parameters. 





Suppose that X;, X5, ... , X, isa random sample from an exponential distribution with param- 
eter A. Now there is only one parameter to estimate, so we must equate E(X) to X. For the 


exponential, E(X) — 1/A. Therefore E(X) — X results in 1/A — X, so À — 1/X is the 
moment estimator of A. 


As an example, suppose that the time to failure of an electronic module used in an automobile 
engine controller is tested at an elevated temperature to accelerate the failure mechanism. 
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EXAMPLE 7-4 


EXAMPLE 7-5 


The time to failure is exponentially distributed. Eight units are randomly selected and 
tested, resulting in the following failure time (in hours): x, = 11.96, x, = 5.03, x; = 67.40, x, 
= 16.07, x; = 31.50, x; = 7.73, x; = 11.10, and x, = 22.38. Because x = 21.65, the moment 
estimate of A isA = 1/x = 1/21.65 = 0.0462. 


aes that X;, X5, ..., X, is a random sample from a normal distribution with parameters p 


and c? For the Rona distribution E(X) = wand E(X’) = y? + o?. Equating E(X) to X and 
E(X?) tol X;-1X? gives 


p= X, -3 DX 


S 


Solving these equations gives the moment estimators 


n 2 n 
$x-(Gàx) ža- 
RD ^2 i= i=1 i= 


, n n 


Notice that the moment estimator of o? is not an unbiased estimator. 


Suppose that Xj, X>, ..., X, 1S a random sample from a gamma distribution with parameters r 
and \. For the gamma distribution E(X) = r/A and E(X’) = r(r + 1)/A7. The moment esti- 
mators are found by solving 


r/\ =X, r(r t+ 1/A* = D X; 


The resulting estimators are 


. xe F X 
r= ? A = 
(1/n) X — X? (1/n) >) X? — 
i=1 rel 


To illustrate, consider tie time to failure data introduced following Example 7-3. For this data, 
x = 21.65 and pa ix? = 6639.40, so the moment estimates are 


u (2L65)* — i- 21.65 MT 
'  (1/8)6638940 — 21.65? | 7^ (1/8)6639.40 — (21.05? ` 


When r = 1, the gamma reduces to the exponential distribution. Because 7 slightly exceeds 
unity, it is quite possible that either the gamma or the exponential distribution would provide 
a reasonable model for the data. 


7-3.2 Method of Maximum Likelihood 


One of the best methods of obtaining a point estimator of a parameter is the method of maxi- 
mum likelihood. This technique was developed in the 1920s by a famous British statistician, 
Sir R. A. Fisher. As the name implies, the estimator will be the value of the parameter that 
maximizes the likelihood function. 


Definition 


EXAMPLE 7-6 
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Suppose that X is a random variable with probability distribution f(x; 9), where 0 is 
a single unknown parameter. Let x,, X2, ... , x, be the observed values in a random 
sample of size n. Then the likelihood function of the sample is 


2) es nene oe neces yes.) D) (7-5) 


Note that the likelihood function is now a function of only the unknown parameter 0. 
The maximum likelihood estimator of 0 is the value of 0 that maximizes the like- 
lihood function L(6). 





In the case of a discrete random variable, the interpretation of the likelihood function is 
clear. The likelihood function of the sample L(6) is just the probability 


P(X; = X1, X> = 5 re. = x 


That is, L(0) is just the probability of obtaining the sample values x,, x2, ..., x,. Therefore, in 
the discrete case, the maximum likelihood estimator is an estimator that maximizes the prob- 
ability of occurrence of the sample values. 


Let X be a Bernoulli random variable. The probability mass function 1s 


p-p)* x=0,1 
0, otherwise 


f(x; p) = l 


where p is the parameter to be estimated. The likelihood function of a random sample of size 
n is 


L(p) sog — p) ^ p*(l — p) *--p*(l - p)» 
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We observe that if p maximizes L( p), p also maximizes In L( p). Therefore, 


In L(p) — (> x) In p + (^ - 5 sj In(1 — p) 


i=1 i=1 


Now 





Equating this to zero and solving for p yields f = (1/n) X, x;. Therefore, the maximum 
likelihood estimator of p is 


234 CHAPTER 7 POINT ESTIMATION OF PARAMETERS 


EXAMPLE 7-7 


EXAMPLE 7-8 


Suppose that this estimator was applied to the following situation: n items are selected 
at random from a production line, and each item is judged as either defective (in which case 
we set x, = 1) or nondefective (in which case we set x, = 0). Then Y. x; is the number of 
defective units in the sample, and p is the sample proportion defective. The parameter p is 
the population proportion defective; and it seems intuitively quite reasonable to use p as 
an estimate of p. 

Although the interpretation of the likelihood function given above is confined to the dis- 
crete random variable case, the method of maximum likelihood can easily be extended to a 
continuous distribution. We now give two examples of maximum likelihood estimation for 
continuous distributions. 


Let X be normally distributed with unknown w and known variance o^. The likelihood 
function of a random sample of size n, say X,, X>, ..., Xps 1s 





L(p) = II — eg ei M Qe?) — T€ g 02052 um 
Now 
In L(w) = —(n/2) In(2«o?) — (20?) ! Y = 
and 


d\n L(p) PEE. E 
dy. = (0°) > (x; — m) 


Equating this last result to zero and solving for u yields 





TES m X 
Thus the sample mean is the maximum likelihood estimator of u. Notice that this is identical 
to the moment estimator. 
Let X be exponentially distributed with parameter N. The likelihood function of a random 
sample of size n, say Xj, X», ... , Xp 1s 
n 
L(A) = [[ Ae = A'e è 


The log likelihood is 


In L(A) 2 nlnÀ - A S x; 
=i 


l= 


-32.59 


-32.61 


-32.63 


-32.65 


Log likelihood 


-32.67 


-32.69 
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Now 


dln L(A) 
dÀ 


n 
E > Xi 
i=l 


>]|a 


and upon equating this last result to zero we obtain 


A=n/ >, X= 1/X 
i=] 


Thus the maximum likelihood estimator of À is the reciprocal of the sample mean. Notice that 
this is the same as the moment estimator. 


It 1s easy to illustrate graphically just how the method of maximum likelihood works. 
Figure 7-3(a) plots the log of the likelihood function for the exponential parameter from 
Example 7-8, using the n = 8 observations on failure time given following Example 7-3. We 
found that the estimate of \ was A = 0.0462. From Example 7-8, we know that this is a 
maximum likelihood estimate. Figure 7-3(a) shows clearly that the log likelihood function is 
maximized at a value of À that is approximately equal to 0.0462. Notice that the log likelihood 
function is relatively flat in the region of the maximum. This implies that the parameter is not 
estimated very precisely. If the parameter were estimated precisely, the log likelihood function 
would be very peaked at the maximum value. The sample size here is relatively small, and this 
has led to the imprecision in estimation. This is illustrated in Fig. 7-3(b) where we have plot- 
ted the difference in log likelihoods for the maximum value, assuming that the sample sizes 
were n — 8, 20, and 40 but that the sample average time to failure remained constant at 
x = 21.65. Notice how much steeper the log likelihood is for n = 20 in comparsion to n = 8, 
and for n — 40 in comparison to both smaller sample sizes. 

The method of maximum likelihood can be used in situations where there are several un- 
known parameters, say, 0,, 05, ... , 0, to estimate. In such cases, the likelihood function is a func- 
tion of the k unknown parameters 0,, 065, ..., 0,, and the maximum likelihood estimators (65 
would be found by equating the k partial derivatives 0L(0;, 05, ..., 0,)/905, i = 1,2, ..., k to 
zero and solving the resulting system of equations. 
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(a) (b) 


Figure 7-3 Log likelihood for the exponential distribution, using the failure time data. (a) Log likelihood with n = 8 (original 
data). (b) Log likelihood ifn = 8, 20, and 40. 
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EXAMPLE 7-9 Let X be normally distributed with mean wp and variance o?, where both u and o? are 
unknown. The likelihood function for a random sample of size n is 
Z 1 


- ~(1/202) > (x;— 9 





L 2) — -Qu-uy/(Qm)— 7 — = 
(m °) I o V2m * (2mo y É i 
and 
» 2 | oS 2 
In L(p, 0^) = — = In(2207^) — 2 (x; — pu) 
e uel 
Now 


ð In L(p, o?) 1 
ee E — = = 0 
a(o?) 20? T 364 ; 2. ey 


The solutions to the above equation yield the maximum likelihood estimators 


n 


xc E um d - 
TEL & -- V Q-Xy 
i-l 


Once again, the maximum likelihood estimators are equal to the moment estimators. 


Properties of the Maximum Likelihood Estimator 

The method of maximum likelihood is often the estimation method that mathematical statisti- 
cians prefer, because it is usually easy to use and produces estimators with good statistical 
properties. We summarize these properties as follows. 


Properties of 


the Maximum Under very general and not restrictive conditions, when the sample size n is large and 


Likelihood if Ô is the maximum likelihood estimator of the parameter 0, 
Estimator 


(1) @ is an approximately unbiased estimator for 0 [E(@) = 0], 


(2) the variance of Ô is nearly as small as the variance that could be obtained 
with any other estimator, and 


(3) Ó has an approximate normal distribution. 





Properties 1 and 2 essentially state that the maximum likelihood estimator is approxi- 
mately an MVUE. This is a very desirable result and, coupled with the fact that it 1s fairly easy 
to obtain in many situations and has an asymptotic normal distribution (the “asymptotic” 
means “when 7 is large"), explains why the maximum likelihood estimation technique is 
widely used. To use maximum likelihood estimation, remember that the distribution of the 
population must be either known or assumed. 


The Invariance 
Property 


EXAMPLE 7-10 


EXAMPLE 7-11 
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To illustrate the “large-sample” or asymptotic nature of the above properties, consider the 
maximum likelihood estimator for o°, the variance of the normal distribution, in Example 7-9. 
It is easy to show that 





The bias 1s 








Because the bias is negative, G* tends to underestimate the true variance c?. Note that the 
bias approaches zero as n increases. Therefore, 6^ is an asymptotically unbiased estimator 
for o?. 

We now give another very important and useful property of maximum likelihood 


estimators. 


Let Ó,, 65, ..., Ô, be the maximum likelihood estimators of the parameters 6,, 
05, ..., 0,. Then the maximum likelihood estimator of any function /(0,, 05, ..., 0,) 


of these parameters is the same function /(0,, @),...,@,) of the estimators 


Ó,, Ô, ..., Ô. 





In the normal distribution case, the maximum likelihood estimators of and o? were Ñ = X 
and ô? = YX;.,(X; — X Y/n. To obtain the maximum likelihood estimator of the function 


h(u, o?) = Vo? = g, substitute the estimators (i and 6^ into the function h, which yields 


; 12 _ ]/2 
sa vE -|i S any] 


i=1 


Thus, the maximum likelihood estimator of the standard deviation o is not the sample 
standard deviation S. 


Complications in Using Maximum Likelihood Estimation 

While the method of maximum likelihood is an excellent technique, sometimes complications 
arise in its use. For example, it is not always easy to maximize the likelihood function because 
the equation(s) obtained from dL (0)/d0 = 0 may be difficult to solve. Furthermore, it may 
not always be possible to use calculus methods directly to determine the maximum of L(6). 
These points are illustrated in the following two examples. 


Let X be uniformly distributed on the interval 0 to a. Since the density function is f(x) = 1/a 
for 0 S x S a and zero otherwise, the likelihood function of a random sample of size n is 

E l 

L(a) = Il a = om 


i=1 a 
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Figure 7-4 The like- 
lihood function for the 
uniform distribution in 
Example 7-10. 


EXAMPLE 7-12 


L(a) 


0 Max (xj) a 


fOsx,Sa,05x% Sa,...,0 =x, <a. Note that the slope of this function is not zero 
anywhere. That is, as long as max(x;) X a, the likelihood is 1/a”, which is positive, but when 
a « max(x;), the likelihood goes to zero, as illustrated in Fig. 7-4. Therefore, calculus meth- 
ods cannot be used directly because the maximum value of the likelihood function occurs at 
a point of discontinuity. However, since d/da(a ") — —n/a"^ * ! is less than zero for all val- 
ues of a > 0,a " is a decreasing function of a. This implies that the maximum of the likeli- 
hood function L(a) occurs at the lower boundary point. The figure clearly shows that we 
could maximize L(a) by setting à equal to the smallest value that it could logically take on, 
which is max(x;). Clearly, a cannot be smaller than the largest sample observation, so setting 
à equal to the largest sample value is reasonable. 


Let Xi, X5, ... , X, be a random sample from the gamma distribution. The log of the likelihood 
function is 


In L(r, A) = In 
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The derivatives of the log likelihood are 





ð In L(r, A) i D" (r) 
t 5s UE In(A) + 2 In(x;) — n P(r) 
ð In L(r, A) u nr i 

A A H 


i=1 


When the derivatives are equated to zero, we obtain the equations that must be solved to find 
the maximum likelihood estimators of r and A: 





^ r 
A= 
X 
" n I’ (7) 
A ln(A) + l — 
A) + E n(x) = n 


There is no closed form solution to these equations. 
Figure 7-5 shows a graph of the log likelihood for the gamma distribution using the n = 8 
observations on failure time introduced previously. Figure 7-5(a) shows the log likelihood 





0077 o s 
0.075 1.58 


(a) 
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Figure 7-5 Log likelihood for the gamma distribution using the failure time data. (a) Log likelihood surface. (b) Contour plot. 


surface as a function ofr and À, and Figure 7-5(b) is a contour plot. These plots reveal that the 
log likelihood is maximized at approximately ? — 1.75 and À — 0.08. Many statistics com- 
puter programs use numerical techniques to solve for the maximum likelihood estimates when 


no simple solution exists. 


7-3.3 Bayesian Estimation of Parameters (CD Only) 


EXERCISES FOR SECTION 7-3 


7-19. Consider the Poisson distribution 


—Aqx 
fi) = =, 


x! 





xX—0.1,2,« 


Find the maximum likelihood estimator of A, based on a 
random sample of size n. 


1-20. Consider the shifted exponential distribution 


Jao =e 4-9 x=0 

When 0 = 0, this density reduces to the usual exponential 

distribution. When 0 — 0, there is only positive probability to 

the right of 0. 

(a) Find the maximum likelihood estimator of A and 0, based 
on a random sample of size n. 

(b) Describe a practical situation in which one would suspect 
that the shifted exponential distribution is a plausible 
model. 


7-21. Let X be a geometric random variable with parameter 
p. Find the maximum likelihood estimator of p, based on a 
random sample of size n. 

1-22. LetX be a random variable with the following proba- 
bility distribution: 


(0c 1)9, 0sx=1 


w=] 0, 


Find the maximum likelihood estimator of 8, based on a random 
sample of size n. 


otherwise 


7-23. Consider the Weibull distribution 
Gy eG 0 
ES < 
fQ)248N8) © >”? t 
0 , otherwise 


(a) Find the likelihood function based on a random sample of 
size n. Find the log likelihood. 
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(b) Show that the log likelihood is maximized by solving the 





equations 
n n = 1 
xP In(x;) In(x;) 
i-1 i-1 
pee 
x 
i=l 
n 1/8 
—. I= 
ò = n 


(c) What complications are involved in solving the two equa- 
tions in part (b)? 

1-24. Consider the probability distribution in Exercise 7-22. 

Find the moment estimator of 0. 


7-25. Let Xj, X, ... , X, be uniformly distributed on the in- 
terval 0 to a. Show that the moment estimator of a is â = 2X. 
Is this an unbiased estimator? Discuss the reasonableness of 
this estimator. 


1-26. Let Xj, X,,..., X, be uniformly distributed on the 
interval 0 to a. Recall that the maximum likelihood estimator 
ofaisá = max(X;). 

(a) Argue intuitively why à cannot be an unbiased estimator 
for a. 

(b) Suppose that E(à) = na/(n + 1). Is it reasonable that à 
consistently underestimates a? Show that the bias in the 
estimator approaches zero as n gets large. 

(c) Propose an unbiased estimator for a. 

(d) Let Y = max(X;). Use the fact that Y = y if and only 
if each X; S y to derive the cumulative distribution func- 
tion of Y. Then show that the probability density function 
of Y is 





=i 
ny” 
, O=y=a 
fe i 
0 , otherwise 


Use this result to show that the maximum likelihood esti- 
mator for a is biased. 
1-241. For the continuous distribution of the interval 0 to a, 
we have two unbiased estimators for a: the moment estimator 
â; = 2X and à; = [(n  1)/n] max(X;), where max(X;) is 
the largest observation in a random sample of size n (see 
Exercise 7-26). It can be shown that V(à,) = a^/(3n) and that 
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V(à;) = a?/[n(n + 2)]. Show that if n > 1, à; is a better 
estimator than à. In what sense 1s 1t a better estimator of a? 


1-28. Consider the probability density function 


f(x) = — xe "9, 0zxx«o, 0«0«o 


Find the maximum likelihood estimator for 0. 


1-29. 
function 


The Rayleigh distribution has probability density 


e 28 0—0-«o 


fa) = x>0, 


(a) It can be shown that E(X 2) = 20. Use this information to 
construct an unbiased estimator for 0. 

(b) Find the maximum likelihood estimator of 0. Compare 
your answer to part (a). 

(c) Use the invariance property of the maximum likelihood 
estimator to find the maximum likelihood estimator of the 
median of the Raleigh distribution. 


7-30. Consider the probability density function 


(a) Find the value of the constant c. 

(b) What is the moment estimator for 0? 

(c) Show that 6 = 3X is an unbiased estimator for 0. 
(d) Find the maximum likelihood estimator for 0. 


7-31. Reconsider the oxide thickness data in Exercise 7-12 

and suppose that it is reasonable to assume that oxide thick- 

ness is normally distributed. 

(a) Use the results of Example 7-9 to compute the maximum 
likelihood estimates of p. and o. 

(b) Graph the likelihood function in the vicinity of Ñ and 67, 
the maximum likelihood estimates, and comment on its 
shape. 


1-32. Continuation of Exercise 7-31. Suppose that for the 
situation of Exercise 7-12, the sample size was larger (n — 40) 
but the maximum likelihood estimates were numerically 
equal to the values obtained in Exercise 7-31. Graph the 
likelihood function for n — 40, compare it to the one from 
Exercise 7-31 (b), and comment on the effect of the larger 
sample size. 


Statistical inference is concerned with making decisions about a population based on the 
information contained in a random sample from that population. For instance, we may be 
interested in the mean fill volume of a can of soft drink. The mean fill volume in the 
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population is required to be 300 milliliters. An engineer takes a random sample of 25 cans and 
computes the sample average fill volume to be x = 298 milliliters. The engineer will probably 
decide that the population mean is w = 300 milliliters, even though the sample mean was 
298 milliliters because he or she knows that the sample mean is a reasonable estimate of u and 
that a sample mean of 298 milliliters is very likely to occur, even if the true population mean is 
| = 300 milliliters. In fact, if the true mean is 300 milliliters, tests of 25 cans made repeatedly, 
perhaps every five minutes, would produce values of x that vary both above and below u = 
300 milliliters. 

The sample mean is a statistic; that is, it is a random variable that depends on the results 
obtained in each particular sample. Since a statistic is a random variable, it has a probability 
distribution. 


Definition 


The probability distribution of a statistic 1s called a sampling distribution. 





For example, the probability distribution of X is called the sampling distribution of the 
mean. 

The sampling distribution of a statistic depends on the distribution of the population, the 
size of the sample, and the method of sample selection. The next section presents perhaps the 
most important sampling distribution. Other sampling distributions and their applications will 
be illustrated extensively in the following two chapters. 
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Consider determining the sampling distribution of the sample mean X. Suppose that a random 
sample of size n is taken from a normal population with mean p. and variance o°. Now each 
observation in this sample, say, X;, X, ..., X,, 1s a normally and independently distributed 
random variable with mean u and variance o7. Then by the reproductive property of the 
normal distribution, Equation 5-41 in Chapter 5, we conclude that the sample mean 


X= F 


has a normal distribution with mean 


and variance 


5 duod wg te to o 
CC. = =e om — 
X n? n 


If we are sampling from a population that has an unknown probability distribution, the 
sampling distribution of the sample mean will still be approximately normal with mean pu and 
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Theorem 7-2: 
The Central 
Limit Theorem 


Figure 7-6 
Distributions of average 
scores from throwing 
dice. [Adapted with 
permission from Box, 
Hunter, and Hunter 
(1978).] 





variance o^/n, if the sample size n is large. This is one of the most useful theorems in statis- 
tics, called the central limit theorem. The statement is as follows: 


If Xi, X5, ..., X, is a random sample of size n taken from a population (either finite 
or infinite) with mean w and finite variance o^, and if X is the sample mean, the lim- 
iting form of the distribution of 


 X-u 
7 a/Vn 


A (7-6) 


as n — ©, is the standard normal distribution. 


The normal approximation for X depends on the sample size n. Figure 7-6(a) shows the 
distribution obtained for throws of a single, six-sided true die. The probabilities are equal 
(1/6) for all the values obtained, 1, 2, 3, 4, 5, or 6. Figure 7-6(b) shows the distribution of the 
average score obtained when tossing two dice, and Fig. 7-6(c), 7-6(d), and 7-6(e) show the 
distributions of average scores obtained when tossing three, five, and ten dice, respectively. 
Notice that, while the population (one die) is relatively far from normal, the distribution of 
averages 1S approximated reasonably well by the normal distribution for sample sizes as small 
as five. (The dice throw distributions are discrete, however, while the normal is continuous). 
Although the central limit theorem will work well for small samples (n — 4, 5) in most cases, 
particularly where the population is continuous, unimodal, and symmetric, larger samples will 
be required in other situations, depending on the shape of the population. In many cases of 
practical interest, if n = 30, the normal approximation will be satisfactory regardless of the 
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(b) Two dice 
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(d) Five dice 
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(e) Ten dice 








EXAMPLE 7-13 


EXAMPLE 7-14 
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shape of the population. If n < 30, the central limit theorem will work if the distribution of the 
population is not severely nonnormal. 


An electronics company manufactures resistors that have a mean resistance of 100 ohms and 
a standard deviation of 10 ohms. The distribution of resistance is normal. Find the probability 
that a random sample of n — 25 resistors will have an average resistance less than 95 ohms. 

Note that the sampling distribution of X is normal, with mean wy = 100 ohms and a 
standard deviation of 


|. 0 10 
= VET Vas 
Therefore, the desired probability corresponds to the shaded area in Fig. 7-7. Standardizing 
the point X = 95 in Fig. 7-7, we find that 


|.95— 100 — 


m 
2 


Z 


and therefore, 


P(X 95) e P935 
— 0.0062 


The following example makes use of the central limit theorem. 


Suppose that a random variable X has a continuous uniform distribution 


fG) = s 4<x<6 


0, otherwise 


Find the distribution of the sample mean of a random sample of size n = 40. 

The mean and variance of X are p = 5 and o? = (6 — 4)?/12 = 1/3. The central limit 
theorem indicates that the distribution of X is approximately normal with mean uy = 5 and 
variance o= o?/n = 1/[3(40)] = 1/120. The distributions of X and X are shown in Fig. 7-8. 


Now consider the case in which we have two independent populations. Let the first pop- 


ulation have mean p; and variance o1 and the second population have mean p. and variance 
05. Suppose that both populations are normally distributed. Then, using the fact that linear 


2 
o r- 1/120 





95 100 x 4 5 6 x 


Figure 7-7 Probability for Example 7-13. Figure 7-8 The distributions of X and 
X for Example 7-14. 
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Definition 


EXAMPLE 7-15 
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combinations of independent normal random variables follow a normal distribution (see 
Equation 5-41), we can say that the sampling distribution of X, — X» is normal with mean 


Myx-x Mx ~ Bx, M^ M2 (7-7) 
and variance 
2 2, 2 _ Of 9 
Cx-x 9x* 5*5 m ny (7-8) 


If the two populations are not normally distributed and if both sample sizes n, and n, are 
greater than 30, we may use the central limit theorem and assume that X, and X, follow 
approximately independent normal distributions. Therefore, the sampling distribution of 
X, — X; is approximately normal with mean and variance given by Equations 7-7 and 7-8, 
respectively. If either n, or n, is less than 30, the sampling distribution of X, — X; will still be 
approximately normal with mean and variance given by Equations 7-7 and 7-8, provided that 
the population from which the small sample is taken is not dramatically different from the nor- 
mal. We may summarize this with the following definition. 


If we have two independent populations with means p; and pi; and variances c5 and 
c? and if X, and X; are the sample means of two independent random samples of 
sizes n, and n, from these populations, then the sampling distribution of 

X = X) = (On = 


M2) 


Z= (7-9) 


V on -+ 05/n; 


is approximately standard normal, if the conditions of the central limit theorem 
apply. If the two populations are normal, the sampling distribution of Z 1s exactly 
standard normal. 


The effective life of a component used in a jet-turbine aircraft engine is a random variable 
with mean 5000 hours and standard deviation 40 hours. The distribution of effective life 1s 
fairly close to a normal distribution. The engine manufacturer introduces an improvement 
into the manufacturing process for this component that increases the mean life to 5050 hours 
and decreases the standard deviation to 30 hours. Suppose that a random sample of n; = 16 
components is selected from the “old” process and a random sample of rn, = 25 components 
is selected from the “improved” process. What is the probability that the difference in the two 
sample means X; — X; is at least 25 hours? Assume that the old and improved processes can 
be regarded as independent populations. 

To solve this problem, we first note that the distribution of X, is normal with mean 
w; = 5000 hours and standard deviation 0,/Vn, = 40/V/16 = 10 hours, and the distribution 
of X, is normal with mean p, = 5050 hours and standard deviation o2/ Vn, = 30/V25 = 
6 hours. Now the distribution of X; — X, is normal with mean p, — w; = 5050 — 5000 = 
50 hours and variance o$/n; + oj/n; = (6)* + (10) = 136 hours’. This sampling distribu- 
tion is shown in Fig. 7-9. The probability that X; — X, = 25 is the shaded portion of the 
normal distribution in this figure. 


Figure 7-9 The 

sampling distribution 

of X, — X, in 

Example 7-15. 0 25 50 75 
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Corresponding to the value x, — x, = 25 in Fig. 7-9, we find that 


and we find that 


EXERCISES FOR SECTION 7-5 


7-33. PVC pipe is manufactured with a mean diameter of 
1.01 inch and a standard deviation of 0.003 inch. Find the 
probability that a random sample of n = 9 sections of pipe 
will have a sample mean diameter greater than 1.009 inch and 
less than 1.012 inch. 


7-34. Suppose that samples of size n = 25 are selected at 
random from a normal population with mean 100 and standard 
deviation 10. What is the probability that the sample mean falls 
in the interval from wy — 1.8o y to wy + 1.00 $3? 


7-35. A synthetic fiber used in manufacturing carpet has 
tensile strength that is normally distributed with mean 75.5 psi 
and standard deviation 3.5 psi. Find the probability that a ran- 
dom sample of n — 6 fiber specimens will have sample mean 
tensile strength that exceeds 75.75 psi. 


1-36. Consider the synthetic fiber in the previous exercise. 
How is the standard deviation of the sample mean changed 
when the sample size is increased from n = 6 ton = 49? 
7-37. The compressive strength of concrete is normally dis- 
tributed with ju = 2500 psi and ø = 50 psi. Find the probability 
that a random sample of n — 5 specimens will have a sample 
mean diameter that falls in the interval from 2499 psi to 2510 psi. 
7-38. Consider the concrete specimens in the previous 
example. What is the standard error of the sample mean? 
7-39. A normal population has mean 100 and variance 25. 
How large must the random sample be if we want the standard 
error of the sample average to be 1.5? 

7-40. Suppose that the random variable X has the continu- 
ous uniform distribution 


L Osxz!l 


0, otherwise 


se) | 


z= —— = -2.14 


=X 225) =F (Z -24) 


= 0.9838 


Suppose that a random sample of n = 12 observations is 
selected from this distribution. What is the probability distribu- 
tion of X — 6? Find the mean and variance of this quantity. 


7-41. Suppose that X has a discrete uniform distribution 


1 = 
Reels mu ps 


0, otherwise 


A random sample of n = 36 is selected from this population. 
Find the probability that the sample mean is greater than 2.1 
but less than 2.5, assuming that the sample mean would be 
measured to the nearest tenth. 


7-42. The amount of time that a customer spends waiting at an 
airport check-in counter is a random variable with mean 8.2 min- 
utes and standard deviation 1.5 minutes. Suppose that a random 
sample of n — 49 customers is observed. Find the probability 
that the average time waiting in line for these customers is 

(a) Less than 10 minutes 

(b) Between 5 and 10 minutes 

(c) Less than 6 minutes 


7-43. A random sample of size n, = 16 is selected from a 
normal population with a mean of 75 and a standard deviation 
of 8. A second random sample of size n; — 9 is taken from an- 
other normal population with mean 70 and standard deviation 
12. Let X, and X; be the two sample means. Find 

(a) The probability that X, — X; exceeds 4 

(b) The probability that 3.5 =< X, — X; = 5.5 

7-44. A consumer electronics company is comparing the 
brightness of two different types of picture tubes for use in its 
television sets. Tube type A has mean brightness of 100 and 
standard deviation of 16, while tube type B has unknown 
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mean brightness, but the standard deviation 1s assumed to be 
identical to that for type 4. A random sample of n — 25 tubes 
of each type is selected, and Xg — X, is computed. If wz 
equals or exceeds w4, the manufacturer would like to adopt 
type B for use. The observed difference is xg — x4, = 3.5. 
What decision would you make, and why? 


7-45. The elasticity of a polymer is affected by the concen- 
tration of a reactant. When low concentration 1s used, the true 
mean elasticity is 55, and when high concentration is used the 
mean elasticity is 60. The standard deviation of elasticity is 4, 
regardless of concentration. If two random samples of size 16 
are taken, find the probability that Xy; — Xtow = 2. 


Supplemental Exercises 


7-46. Suppose that a random variable is normally distrib- 
uted with mean p and variance o°, and we draw a random 
sample of five observations from this distribution. What is the 
joint probability density function of the sample? 


7-47. Transistors have a life that is exponentially distributed 
with parameter A. A random sample of n transistors is taken. 
What is the joint probability density function of the sample? 


7-48. Suppose that X is uniformly distributed on the interval 
from 0 to 1. Consider a random sample of size 4 from X. What 
is the joint probability density function of the sample? 


1-49. A procurement specialist has purchased 25 resistors 
from vendor | and 30 resistors from vendor 2. Let X|;, 
X45, ... , X155 represent the vendor 1 observed resistances, 
which are assumed to be normally and independently distrib- 
uted with mean 100 ohms and standard deviation 1.5 ohms. 
Similarly, let X5 1, Xz, ..., X230 represent the vendor 2 ob- 
served resistances, which are assumed to be normally and in- 
dependently distributed with mean 105 ohms and standard 
deviation of 2.0 ohms. What is the sampling distribution of 
X,- X? 

1-50. Consider the resistor problem in Exercise 7-49. What 
is the standard error of X, — X5? 


7-51. A random sample of 36 observations has been drawn 
from a normal distribution with mean 50 and standard devia- 
tion 12. Find the probability that the sample mean is in the 
interval 47 x X < 53. 


7-52. Is the assumption of normality important in Exercise 
7-51? Why? 
7-53. A random sample of n = 9 structural elements is 


tested for compressive strength. We know that the true mean 


CHAPTER 7 POINT ESTIMATION OF PARAMETERS 


compressive strength ju = 5500 psi and the standard deviation 
is o = 100 psi. Find the probability that the sample mean 
compressive strength exceeds 4985 psi. 


7-54. A normal population has a known mean 50 and 
known variance o? = 2. A random sample of n = 16 is se- 
lected from this population, and the sample mean is x — 52. 
How unusual is this result? 


7-55. A random sample of size n = 16 is taken from a nor- 
mal population with u = 40 and o? = 5. Find the probability 
that the sample mean is less than or equal to 37. 

7-56. A manufacturer of semiconductor devices takes a 
random sample of 100 chips and tests them, classifying each 
chip as defective or nondefective. Let X; = 0 if the chip is 
nondefective and X; — 1 if the chip is defective. The sample 
fraction defective is 


X tX T abd + X99 
100 


p= 


What is the sampling distribution of the random variable P? 


7-57. LetX bea random variable with mean p and variance 
c^. Given two independent random samples of sizes n; and m, 
with sample means X; and X5, show that 


» 


= aX; + (1 —- a)X%„p 0<a<1 


is an unbiased estimator for u. If X; and X, are independent, 
find the value of a that minimizes the standard error of X. 


7-58. A random variable x has probability density function 


1 
f(x gae. 


= 0«0«0o 
20 


Vax «05 


Find the maximum likelihood estimator for 0. 

7-59. Let f(x) = &* !, 0«0 «o, and O<x<1. 
Show that © = —n/(In T];_, X) is the maximum likelihood 
estimator for 0. 

7-60. Let f(x) = (1/0)x 9, 0 — x « 1,and0 <0 «o. 
Show that © = —(1/n) X;-; In(X;) is the maximum likelihood 
estimator for 0 and that Ô is an unbiased estimator for 0. 


1-5 SAMPLING DISTRIBUTIONS OF MEANS 


MIND-EXPANDING EXERCISES 


7-61. A lot consists of N transistors, and of these M 
(M x N)are defective. We randomly select two transis- 
tors without replacement from this lot and determine 
whether they are defective or nondefective. The ran- 
dom variable 


1, ifthe ith transistor 
is nondefective 

if the ith transistor 
is defective 


i= 1,2 


Determine the joint probability function for X, and X3. 
What are the marginal probability functions for X, and 
X? Are X, and X, independent random variables? 


7-62. When the sample standard deviation is based on 
a random sample of size n from a normal population, it 
can be shown that S is a biased estimator for o. Spe- 
cifically, 


E(S) = o V2/(n — 1) F(n/2/T (n — 19/2] 


(a) Use this result to obtain an unbiased estimator for o 
of the form c, S, when the constant c, depends on the 
sample size n. 

(b) Find the value of c, for n = 10 and n = 25. 
Generally, how well does S perform as an estimator 
of o for large n with respect to bias? 


1-63. A collection of n randomly selected parts is 
measured twice by an operator using a gauge. Let X; and 
Y; denote the measured values for the ith part. Assume 
that these two random variables are independent and 
normally distributed and that both have true mean p; and 
variance o°. 


(a) Show that the maximum likelihood estimator of o? 
is 67 = (1/41) Z1 (Qf — Yyy". 


(b) Show that G? is a biased estimator for o?. What 
happens to the bias as n becomes large? 

(c) Find an unbiased estimator for o. 

7-64. Consistent Estimator. Another way to measure 
the closeness of an estimator Ó to the parameter 0 is in 
terms of consistency. If Ó, is an estimator of 0 based on 
a random sample of n observations, Ó, is consistent for 
0 if 


lim P(Ó, — € < €) = 1 


Thus, consistency is a large-sample property, describing 
the limiting behavior of Ó, as n tends to infinity. It is 
usually difficult to prove consistency using the above 
definition, although it can be done from other ap- 
proaches. To illustrate, show that X is a consistent esti- 
mator of p. (when o? < c) by using Chebyshev’s 
inequality. See Section 5-10 (CD Only). 


7-65. Order Statistics. Let Xj, X,,..., X, be a 
random sample of size n from X, a random variable hav- 
ing distribution function F(x). Rank the elements in or- 
der of increasing numerical magnitude, resulting in Xq, 
Xo, +++» Xo, Where Xay is the smallest sample element 
(Xa = miniX;, X», ..., X,j) and Xin is the largest sam- 
ple element (Xm = max{X), X», ..., X,}). Xo is called 
the ith order statistic. Often the distribution of some of 
the order statistics is of interest, particularly the mini- 
mum and maximum sample values. Xa) and Xa, respec- 
tively. Prove that the cumulative distribution functions 
of these two order statistics, denoted respectively by 
Fx. (t) and Fy (f) are 


Prove that if X is continuous with probability density 
function f(x), the probability distributions of Xa) and 
Xm are 


faxo) sm DEB | 
fe, (t) = nLF(O]"- FO 


7-66. Continuation of Exercise 7-65. Let Xj, X5, ..., 
X, be a random sample of a Bernoulli random variable 
with parameter p. Show that 


PX = 1) = 1— (1 — py 
P(X) = 0) =1—p" 


Use the results of Exercise 7-65. 


7-67. Continuation of Exercise 7-65. Let Xj, X5,..., 
X, be a random sample of a normal random variable 
with mean u and variance c^. Using the results of 
Exercise 7-65, derive the probability density functions 
of X and Xon. 
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MIND-EXPANDING EXERCISES 


7-68. Continuation of Exercise 7-65. Let Xj, X5, ..., 
X, be a random sample of an exponential random vari- 
able of parameter N. Derive the cumulative distribution 
functions and probability density functions for Xq) and 
X. Use the result of Exercise 7-65. 


4-69. Let Xj, X5, ..., X, be a random sample of a 
continuous random variable with cumulative distribu- 
tion function F(x). Find 


E|F(Xq | 
and 
EPAX 


7-70. Let X be a random variable with mean p. and 
variance c^, and let X, X, ... , X, be a random sample 
of size n from X. Show that the statistic V = k$] 
(X;,; — X; is an unbiased estimator for o? for an 
appropriate choice for the constant k. Find this value 
for k. 

7-71. When the population has a normal distribution, 
the estimator 


ô = median (|X, — X|, |X, — X|, 
S |X, — X|)/0.6745 


is Sometimes used to estimate the population standard 
deviation. This estimator is more robust to outliers than 
the usual sample standard deviation and usually does 


IMPORTANT TERMS AND CONCEPTS 


In the E-book, click on any Mean square error of an 
term or concept below to estimator 
go to that subject. Minimum variance 
Bias in parameter unbiased estimator 
estimation Moment estimator 
Central limit theorem Normal distribution as 
Estimator versus the sampling distribu- 
estimate tion of a sample mean 
Likelihood function Normal distribution as 
Maximum likelihood the sampling distri- 


estimator bution of the differ- 


not differ much from S when there are no unusual 

observations. 

(a) Calculate o and S for the data 10, 12, 9, 14, 18, 15, 
and 16. 

(b) Replace the first observation in the sample (10) with 
50 and recalculate both S and o. 


7-72. Censored Data. A common problem in indus- 
try is life testing of components and systems. In this 
problem, we will assume that lifetime has an exponen- 
tial distribution with parameter \, so p = 1/A = X is 
an unbiased estimate of u. When n components are tested 
until failure and the data X,, X, ..., X, represent actual 
lifetimes, we have a complete sample, and .X is indeed an 
unbiased estimator of u. However, in many situations, the 
components are only left under test until r < n failures 
have occurred. Let Y, be the time of the first failure, Y, be 
the time of the second failure, ... , and Y, be the time of the 
last failure. This type of test results in censored data. 
There are n — r units still running when the test is termi- 
nated. The total accumulated test time at termination 1s 


T, = > Lro- 
i=l 


(a) Show that [Ju = T,/r is an unbiased estimator for p. 
[Hint: You will need to use the memoryless property 
of the exponential distribution and the results of 
Exercise 7-68 for the distribution of the minimum of 
a sample from an exponential distribution with 
parameter A.] 

(b) It can be shown that V(T,/r) — 1/(A?r). How does 


this compare to V(X’) in the uncensored experiment? 


ence in two sample Statistic 
means Statistical inference 
Parameter estimation Unbiased estimator 
Point estimator 
: ee CD MATERIAL 
Population or distribu- 
enn Bayes estimator 
Bootstrap 


Sample moments 

Sampling distribution 

Standard error and 
estimated standard 
error of an estimator 


Posterior distribution 
Prior distribution 


7-2.2 Proof That S is a Biased Estimator of o (CD Only) 


We proved that the sample variance is an unbiased estimator of the population variance, that 
is, E(S?) = o, and that this result does not depend on the form of the distribution. However, 
the sample standard deviation 1s not an unbiased estimator of the population standard devia- 
tion. This is easy to demonstrate for the case where the random variable X follows a normal 
distribution. 

Let Xi, X5, ..., X, be a random sample of size n from a normal population with mean p 
and variance o?. Now it can be shown that the distribution of the random variable 


(n — 1)S? 


g^ 





is chi-square with n — 1 degrees of freedom, denoted x2. , (the chi-squared distribution 
was introduced in our discussion of the gamma distribution in Chapter 4, and the above re- 
sult will be presented formally in Chapter 8). Therefore the distribution of S? is o?/(n — 1) 
times a X2. , random variable. So when sampling from a normal distribution, the expected 
value of S? is 





2 2 2 
O O O 
E(S*) = s. = e) = m Ea) Ža Ds o^ 


because the mean of a chi-squared random variable with n — 1 degrees of freedom is n — 1. 
Now it follows that the distribution of 


Vin — 1)S 


Oo 


is a chi distribution with n — 1 degrees of freedom, denoted x „—1. The expected value of S can 
be written as 


Oo 


(e) 
E(S) = E| — == yx,-1 ] = — E(X,- 
(S) (x, ] ij ced (Xn—1) 
The mean of the chi distribution with n — 1 degrees of freedom is 


I (n/2) 
(n = 1/2] 





E(Xn-1) = V2 


oO 


where the gamma function I(r) = | y" le dy. Then 
0 


7 2 DU (n/2) 
PONTE = 0/2] ^ 





Although S is a biased estimator of o, the bias gets small fairly quickly as the sample size 
n increases. For example, note that c, = 0.94 for a sample of n = 5,c, = 0.9727 for a sample 
of n = 10, and c, = 0.9896 or very nearly unity for a sample of n = 25. 
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7-2.5 Bootstrap Estimate of the Standard Error (CD Only) 


EXAMPLE $7-1 


There are situations in which the standard error of the point estimator is unknown. Usually, 
these are cases where the form of Ó is complicated, and the standard expectation and variance 
operators are difficult to apply. A computer-intensive technique called the bootstrap that was 
developed in recent years can be used for this problem. 

Suppose that we are sampling from a population that can be modeled by the probability 
distribution f(x; 0). The random sample results in data values x4, x2, ... , x, and we obtain 6 as 
the point estimate of 0. We would now use a computer to obtain bootstrap samples from the 
distribution (x; 6), and for each of these samples we calculate the bootstrap estimate 0" of 0. 
This results in 


Bootstrap Sample Observations Bootstrap Estimate 
x x x Ak 
1 Xis X2, ee. 3 Xh 01 
x x x A 
2 X1» X2, +++ 3 Xj 0^ 
x x x Ak 
B X1. X2, +--+ , Xj Op 


Usually B = 100 or 200 of these bootstrap samples are taken. Let 0* = (1/B) 5 0% be the 
sample mean of the bootstrap estimates. The bootstrap estimate of the standard error of O is 
just the sample standard deviation of the 0;, or 


(S7-1) 





In the bootstrap literature, B — 1 in Equation S7-1 is often replaced by B. However, for 
the large values usually employed for B, there is little difference in the estimate produced 
for sô. 


The time to failure of an electronic module used in an automobile engine controller is tested 
at an elevated temperature in order to accelerate the failure mechanism. The time to failure 
is exponentially distributed with unknown parameter A. Eight units are selected at random 
and tested, with the resulting failure times (in hours): x, = 11.96, x, = 5.03, x; = 67.40, 
X4 = 16.07, x5 = 31.50, x; = 7.73, x; = 11.10, and x, = 22.38. Now the mean of an expo- 
nential distribution is ju. = 1/A, so E(X) = 1/A, and the expected value of the sample average 
is E(X) = 1/A. Therefore, a reasonable way to estimate \ is with A = 1/X. For our sample, 
x = 21.65, so our estimate of A isA = 1/21.65 = 0.0462. To find the bootstrap standard error 
we would now obtain B = 200 (say) samples of n = 8 observations each from an exponential 
distribution with parameter A = 0.0462. The following table shows some of these results: 


Bootstrap Sample Observations Bootstrap Estimate 
l 8.01, 28.85, 14.14, 59.12, 3.11, 32.19, 5.26, 14.17 Ai = 0.0485 


2 33.27, 2.10, 40.17, 32.43, 6.94, 30.66, 18.99, 5.61 A> = 0.0470 


200 40.26, 39.26, 19.59, 43.53, 9.55, 7.07, 6.03, 8.94 Á»oo = 0.0459 
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The sample average of the À; (the bootstrap estimates) is 0.0513, and the standard deviation 
of these bootstrap estimates is 0.020. Therefore, the bootstrap standard error of À is 0.020. In 
this case, estimating the parameter A in an exponential distribution, the variance of the esti- 
mator we used, A, is known. When n is large, V(A) = A?/n. Therefore the estimated standard 
error of Â is VÀ?/n = V/(0.0462)?/8 = 0.016. Notice that this result agrees reasonably 
closely with the bootstrap standard error. 


Sometimes we want to use the bootstrap in situations in which the form of the probabil- 
ity distribution is unknown. In these cases, we take the n observations in the sample as the 
population and select B random samples each of size n, with replacement, from this popula- 
tion. Then Equation 57-1 can be applied as described above. The book by Efron and 
Tibshirani (1993) 1s an excellent introduction to the bootstrap. 


7-3.3 Bayesian Estimation of Parameters (CD Only) 


This book uses methods of statistical inference based on the information in the sample data. 
In effect, these methods interpret probabilities as relative frequencies. Sometimes we call 
probabilities that are interpreted in this manner objective probabilities. There is another ap- 
proach to statistical inference, called the Bayesian approach, that combines sample informa- 
tion with other information that may be available prior to collecting the sample. In this section 
we briefly illustrate how this approach may be used in parameter estimation. 

Suppose that the random variable X has a probability distribution that is a function of one 
parameter 0. We will write this probability distribution as f(x| 0). This notation implies that 
the exact form of the distribution of X is conditional on the value assigned to 0. The classical ap- 
proach to estimation would consist of taking a random sample of size n from this distribution 
and then substituting the sample values x; into the estimator for 0. This estimator could have 
been developed using the maximum likelihood approach, for example. 

Suppose that we have some additional information about 0 and that we can summarize 
that information in the form of a probability distribution for 0, say, f(0). This probability dis- 
tribution is often called the prior distribution for 0, and suppose that the mean of the prior is 
wo and the variance is o2. This is a very novel concept insofar as the rest of this book is con- 
cerned because we are now viewing the parameter 0 as a random variable. The probabilities 
associated with the prior distribution are often called subjective probabilities, in that they 
usually reflect the analyst's degree of belief regarding the true value of 0. The Bayesian 
approach to estimation uses the prior distribution for 0, f(0), and the joint probability distri- 
bution of the sample, say f(x, X2, ..., x,| 0), to find a posterior distribution for 0, say, 
f 0 | x, x5, ... , Xn). This posterior distribution contains information both from the sample and 
the prior distribution for 0. In a sense, it expresses our degree of belief regarding the true value 
of 0 after observing the sample data. It is easy conceptually to find the posterior distribution. 
The joint probability distribution of the sample X;, X, ..., X, and the parameter 0 (remember 
that 0 is a random variable) is 


fn. X» «++» Xn 9) = fons x»... Xn| 8) /(8) 


and the marginal distribution of X;, X>, ..., X, 18 


> O EA er) 0 discrete 
Tio; Soo) = d 
| f(x,, X5,...,x,,0) d0, O continuous 


=00 
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EXAMPLE $7-2 


Therefore, the desired distribution 1s 


Teint enss as 8) 
fO | xi, x», ... Xn) f ED 
We define the Bayes estimator of 0 as the value 6 that corresponds to the mean of the poste- 
rior distribution /(0 | x1, x», ... , Xp). 

Sometimes, the mean of the posterior distribution of 0 can be determined easily. As a 
function of 0, /(0| x;,..., x,) is a probability density function and x4, ... , X„ are just con- 
stants. Because 0 enters into /(0 | x1, ..., x,) only through f(x;,...,x,, 0) if f(x, ..., Xm 0), 
as a function of 0 is recognized as a well-known probability function, the posterior mean of 0 
can be deduced from the well-known distribution without integration or even calculation of 


T'es 


Let Xj, X5,..., X, be a random sample from the normal distribution with mean u and variance 
o°, where u is unknown and o? is known. Assume that the prior distribution for u is normal 
with mean pọ and variance 05; that is 


I - eg (B7 uon + po)/(200) 


IW) 7 an, * T Vino} 


The joint probability distribution of the sample is 


g (120?) (a-a) 


i=1 


fen xs SIM) = ty 


= ol e (1/20 {Èx1 - 29x + np’) 
(2mo2y'? 


Thus, the joint probability distribution of the sample and p. is 


l 
f(X, x», ro Xp M) (2:0?) ^ N/25a, 


-a (s) (5 -) | 
— oô o/n á o  o?/n J h 


e 


e (0/20/00 n/a?) — (29/09 + 2 xo^). Zxi/o^ + 44/05] 


NOC Ree oF a: TM o$) 


Upon completing the square in the exponent 


ava 1 il J| ; Qus NES J 
7 2 2 Bo E. 
oj o/n 06 T 07 /n 06 07 /n h(x, TS a jio; 04) 








F(X], X5, ..., X, M) = e 


where A;(x,, ... , Xp, 0^, uo, OG) is a function of the observed values, o?, wo, and o5. 
Now, because f(x, ..., x,) does not depend on p, 


-am (d+ Jo (iei 


Me US 2 2 ) 
JU s acea, = 928 gu: Se Bs X45 aves Xn a, Mo, 59) 
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This is recognized as a normal probability density function with posterior mean 


(o^/n)ug + oox 


og + o^/n 


and posterior variance 





(. id J- og(o^/n) 
Go o^ /n — 02 + o?/n 


Consequently, the Bayes estimate of w is a weighted average of [uy and x. For purposes of 
comparison, note that the maximum likelihood estimate of u is [i = x. 

To illustrate, suppose that we have a sample of size n = 10 from a normal distribution 
with unknown mean y. and variance o? = 4. Assume that the prior distribution for p is nor- 
mal with mean p = 0 and variance of = 1. If the sample mean is 0.75, the Bayes estimate 
of u is 


(4/10)0 + 1(0.75) 0.75 _ 
1+(4/10) 14 oe 


Note that the maximum likelihood estimate of u is x = 0.75. 


There is a relationship between the Bayes estimator for a parameter and the maximum 
likelihood estimator of the same parameter. For large sample sizes, the two are nearly 
equivalent. In general, the difference between the two estimators is small compared to 
1/Vn. In practical problems, a moderate sample size will produce approximately the same 
estimate by either the Bayes or maximum likelihood method, if the sample results are con- 
sistent with the assumed prior information. If the sample results are inconsistent with the 
prior assumptions, the Bayes estimate may differ considerably from the maximum likeli- 
hood estimate. In these circumstances, if the sample results are accepted as being correct, 
the prior information must be incorrect. The maximum likelihood estimate would then be 
the better estimate to use. 

If the sample results are very different from the prior information, the Bayes estimator 
will always tend to produce an estimate that is between the maximum likelihood estimate and 
the prior assumptions. If there is more inconsistency between the prior information and the 
sample, there will be more difference between the two estimates. 


EXERCISES FOR SECTION 7-3.3 


S7-1. Suppose that X is a normal random variable 
with unknown mean p and known variance o°. The prior 
distribution for u is a normal distribution with mean p and 
variance o2. Show that the Bayes estimator for u becomes 
the maximum likelihood estimator when the sample size n is 
large. 

S7-2. Suppose that X is a normal random variable with un- 
known mean p and known variance o°. The prior distribution 
for u is a uniform distribution defined over the interval [a, b]. 


(a) Find the posterior distribution for p. 

(b) Find the Bayes estimator for u. 

S7-3. Suppose that X is a Poisson random variable with pa- 
rameter \. Let the prior distribution for à be a gamma distri- 
bution with parameters m + 1 and (m + 1)/Ag. 

(a) Find the posterior distribution for A. 

(b) Find the Bayes estimator for A. 

S7-4. Suppose that X is a normal random variable with un- 
known mean and known variance o? = 9. The prior distribution 


1-6 


for u is normal with uy = 4 and o$ = 1. A random sample of 

n = 25 observations is taken, and the sample mean is x = 4.85. 

(a) Find the Bayes estimate of w. 

(b) Compare the Bayes estimate with the maximum likeli- 
hood estimate. 


S7-5. The weight of boxes of candy is a normal random 
variable with mean p. and variance 1/10 pound. The prior dis- 
tribution for u is normal with mean 5.03 pound. and variance 
1/25 pound. A random sample of 10 boxes gives a sample 
mean of x — 5.05 pound. 

(a) Find the Bayes estimate of w. 


(b) Compare the Bayes estimate with the maximum likeli- 
hood estimate. 


S7-6. The time between failures of a machine has an expo- 
nential distribution with parameter A. Suppose that the prior 
distribution for X is exponential with mean 100 hours. Two 
machines are observed, and the average time between failures 
is x — 1125 hours. 

(a) Find the Bayes estimate for A. 

(b) What proportion of the machines do you think will fail be- 

fore 1000 hours? 
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8-3.3 The t Confidence Interval on pu 


CONFIDENCE INTERVAL ON THE 
VARIANCE AND STANDARD 
DEVIATION OF A NORMAL 
POPULATION 


A LARGE-SAMPLE CONFIDENCE 
INTERVAL FOR A POPULATION 
PROPORTION 


A PREDICTION INTERVAL FOR A 
FUTURE OBSERVATION 


TOLERANCE INTERVALS FOR A 
NORMAL DISTRIBUTION 


After careful study of this chapter, you should be able to do the following: 


1. 


Ui A WO N 


Construct confidence intervals on the mean of a normal distribution, using either the normal 


distribution or the t distribution method 


. Construct confidence intervals on the variance and standard deviation of a normal distribution 
. Construct confidence intervals on a population proportion 
. Construct prediction intervals for a future observation 


. Construct a tolerance interval for a normal population 
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CHAPTER 8 STATISTICAL INTERVALS FOR A SINGLE SAMPLE 


6. Explain the three types of interval estimates: confidence intervals, prediction intervals, and 
tolerance intervals 


7. Use the general method for constructing a confidence interval 


CD MATERIAL 


8. Use the bootstrap technique to construct a confidence interval 


Answers for many odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e- Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e-Text immediately following the section they accompany. 


8.1 INTRODUCTION 


In the previous chapter we illustrated how a parameter can be estimated from sample data. 
However, it is important to understand how good is the estimate obtained. For example, sup- 
pose that we estimate the mean viscosity of a chemical product to be ù = x = 1000. Now 
because of sampling variability, it is almost never the case that ju = x. The point estimate says 
nothing about how close Á is to w. Is the process mean likely to be between 900 and 1100? Or 
is it likely to be between 990 and 1010? The answer to these questions affects our decisions 
regarding this process. Bounds that represent an interval of plausible values for a parameter 
are an example of an interval estimate. Surprisingly, it is easy to determine such intervals in 
many cases, and the same data that provided the point estimate are typically used. 

An interval estimate for a population parameter is called a confidence interval. We can- 
not be certain that the interval contains the true, unknown population parameter— we only use 
a sample from the full population to compute the point estimate and the interval. However, 
the confidence interval 1s constructed so that we have high confidence that it does contain the 
unknown population parameter. Confidence intervals are widely used in engineering 
and the sciences. 

A tolerance interval is another important type of interval estimate. For example, the 
chemical product viscosity data might be assumed to be normally distributed. We might like 
to calculate limits that bound 95% of the viscosity values. For a normal distribution, we know 
that 95% of the distribution is in the interval 


u — 1.960, w+ 1.960 (8-1) 


However, this is not a useful tolerance interval because the parameters and o are unknown. 
Point estimates such as x and s can be used in Equation 8-1 for u and o. However, we need to 
account for the potential error in each point estimate to form a tolerance interval for the 
distribution. The result is an interval of the form 


= RS x P ES (8-2) 


where k is an appropriate constant (that is larger than 1.96 to account for the estimation 
error). As for a confidence interval, it is not certain that Equation 8-2 bounds 95% of the dis- 
tribution, but the interval is constructed so that we have high confidence that it does. 
Tolerance intervals are widely used and, as we will subsequently see, they are easy to cal- 
culate for normal distributions. 
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Confidence and tolerance intervals bound unknown elements of a distribution. In this 
chapter you will learn to appreciate the value of these intervals. A prediction interval pro- 
vides bounds on one (or more) future observations from the population. For example, a 
prediction interval could be used to bound a single, new measurement of viscosity— another 
useful interval. With a large sample size, the prediction interval for normally distributed data 
tends to the tolerance interval in Equation 8-1, but for more modest sample sizes the predic- 
tion and tolerance intervals are different. 

Keep the purpose of the three types of interval estimates clear: 


e A confidence interval bounds population or distribution parameters (such as the mean 
viscosity). 


e A tolerance interval bounds a selected proportion of a distribution. 


e A prediction interval bounds future observations from the population or distribution. 


8-2 CONFIDENCE INTERVAL ON THE MEAN OF A NORMAL 
DISTRIBUTION, VARIANCE KNOWN 


The basic ideas of a confidence interval (CI) are most easily understood by initially consider- 
ing a simple situation. Suppose that we have a normal population with unknown mean p and 
known variance o?. This is a somewhat unrealistic scenario because typically we know the 
distribution mean before we know the variance. However, in subsequent sections we will 
present confidence intervals for more general situations. 


8-2.1 Development of the Confidence Interval and its Basic Properties 


Suppose that X, X, ..., X, is a random sample from a normal distribution with unknown 
mean p. and known variance o°. From the results of Chapter 5 we know that the sample 
mean X is normally distributed with mean p. and variance o?/n. We may standardize X 
by subtracting the mean and dividing by the standard deviation, which results in the 
variable 


X- 
d a/Vn 





(8-3) 


Now Z has a standard normal distribution. 

A confidence interval estimate for u is an interval of the form / = u S u, where the end- 
points / and u are computed from the sample data. Because different samples will produce 
different values of / and u, these end-points are values of random variables L and U, respec- 
tively. Suppose that we can determine values of L and U such that the following probability 
statement is true: 


P{Lspsus=1-a (8-4) 
where 0 = a < 1. There is a probability of 1 — a of selecting a sample for which the CI will 
contain the true value of u. Once we have selected the sample, so that X, = xj, X; = x,,..., 


X, = x,, and computed / and u, the resulting confidence interval for u is 


luu (8-5) 
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Definition 


EXAMPLE 8-1 


The end-points or bounds / and u are called the lower- and upper-confidence limits, respec- 
tively, and 1 — a is called the confidence coefficient. 

In our problem situation, because Z = (X — p)/(o/ Vn) has a standard normal distribu- 
tion, we may write 


Zi at THe bay 
Tn vn Af S 


Now manipulate the quantities inside the brackets by (1) multiplying through by o/ Vn, (2) 
subtracting X from each term, and (3) multiplying through by — 1. This results in 


= oO — O 
PAX atr sw sk un. Tia (8-6) 


From consideration of Equation 8-4, the lower and upper limits of the inequalities in Equation 
8-6 are the lower- and upper-confidence limits L and U, respectively. This leads to the fol- 
lowing definition. 


If x is the sample mean of a random sample of size n from a normal population with 
known variance o?, a 100(1 — «)% CI on p is given by 


"cs Zy20/ Vn == ese ale z,n0/ Nn (8-7) 


where Zan is the upper 1000/2 percentage point of the standard normal distribution. 





ASTM Standard E23 defines standard test methods for notched bar impact testing of metallic 
materials. The Charpy V-notch (CVN) technique measures impact energy and is often used to 
determine whether or not a material experiences a ductile-to-brittle transition with decreasing 
temperature. Ten measurements of impact energy (J) on specimens of A238 steel cut at 60°C 
are as follows: 64.1, 64.7, 64.5, 64.6, 64.5, 64.3, 64.6, 64.8, 64.2, and 64.3. Assume that 
impact energy is normally distributed with o. = 1J. We want to find a 95% CI for u, the mean 
impact energy. The required quantities are z,/ = zoo, = 1.96, n = 10, o = 1, and 
x = 64.46. The resulting 95% CI is found from Equation 8-7 as follows: 


Oo = Oo 
DE ig i E 
l l 
64.46 — 1.96 —— x u S 64.46 + 1.96 —— 
vio” VIO 


63.84 = u = 65.08 


That is, based on the sample data, a range of highly plausible vaules for mean impact energy 
for A238 steel at 60°C is 63.84J S u = 65.08J. 


Interpreting a Confidence Interval 
How does one interpret a confidence interval? In the impact energy estimation problem in 
Example 8-1 the 95% CI is 63.84 5 u <= 65.08, so it is tempting to conclude that w is within 
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Figure 8-1 Repeated 
construction of a con- 
fidence interval for p. 


this interval with probability 0.95. However, with a little reflection, it’s easy to see that this can- 
not be correct; the true value of u is unknown and the statement 63.84 < u = 65.08 is either 
correct (true with probability 1) or incorrect (false with probability 1). The correct interpretation 
lies in the realization that a CI is a random interval because in the probability statement defin- 
ing the end-points of the interval (Equation 8-4), L and U are random variables. Consequently, 
the correct interpretation of a 100(1 — a)% CI depends on the relative frequency view of prob- 
ability. Specifically, if an infinite number of random samples are collected and a 100(1 — a)% 
confidence interval for ju is computed from each sample, 100(1 — «)% of these intervals will 
contain the true value of pu. 

The situation is illustrated in Fig. 8-1, which shows several 100(1 — a)% confidence 
intervals for the mean u of a normal distribution. The dots at the center of the intervals indi- 
cate the point estimate of u (that is, x). Notice that one of the intervals fails to contain the true 
value of w. If this were a 95% confidence interval, in the long run only 5% of the intervals 
would fail to contain w. 

Now in practice, we obtain only one random sample and calculate one confidence interval. 
Since this interval either will or will not contain the true value of u, it is not reasonable to attach 
a probability level to this specific event. The appropriate statement is the observed interval [/, u] 
brackets the true value of u with confidence 100(1 — a). This statement has a frequency inter- 
pretation; that is, we don’t know if the statement is true for this specific sample, but the method 
used to obtain the interval [/, u] yields correct statements 100(1 — a)% of the time. 


Confidence Level and Precision of Estimation 

Notice in Example 8-1 that our choice of the 95% level of confidence was essentially 
arbitrary. What would have happened if we had chosen a higher level of confidence, say, 99%? 
In fact, doesn't it seem reasonable that we would want the higher level of confidence? At a. — 
0.01, we find Za = Zo.01/2 = Zooos = 2.58, while for a = 0.05, Zoo25 = 1.96. Thus, the 
length of the 95% confidence interval is 


2(1.960/Vn) = 3.920/ V/n 
whereas the length of the 99% CT is 
2(2.580/Vn) = 5.160/Vn 
Thus, the 99% CI is longer than the 95% CI. This is why we have a higher level of confidence 


in the 99% confidence interval. Generally, for a fixed sample size n and standard deviation o, 
the higher the confidence level, the longer the resulting CI. 


12 3 4 5 6 7 8 9 1011 12 13 14 15 16 
Interval number 
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Figure 8-2 Error in 
estimating u with x. 


E = error = |x- ul 
[€— —» 


l2x-zgjo0lAn x u u =X + Zg oln 


The length of a confidence interval is a measure of the precision of estimation. From the 
preceeding discussion, we see that precision is inversely related to the confidence level. It is de- 
sirable to obtain a confidence interval that is short enough for decision-making purposes and 
that also has adequate confidence. One way to achieve this is by choosing the sample size n to 
be large enough to give a CI of specified length or precision with prescribed confidence. 


8-2.2 Choice of Sample Size 


Definition 


EXAMPLE 8-2 





The precision of the confidence interval in Equation 8-7 is 2za/20/ Vn. This means that in 
using x to estimate p, the error E = |x — u| is less than or equal to z,5o0/ Vn with 
confidence 100(1 — a). This is shown graphically in Fig. 8-2. In situations where the sam- 
ple size can be controlled, we can choose n so that we are 100(1 — a) percent confident that 
the error in estimating pu is less than a specified bound on the error E. The appropriate sam- 
ple size is found by choosing n such that z,.0/ Vn = E. Solving this equation gives the fol- 
lowing formula for n. 


If x is used as an estimate of u, we can be 100(1 — a)% confident that the error 
|x — p| will not exceed a specified amount E when the sample size is 


(8-8) 


If the right-hand side of Equation 8-8 is not an integer, it must be rounded up. This will ensure 
that the level of confidence does not fall below 100(1 — a)%. Notice that 2E 1s the length of 
the resulting confidence interval. 


To illustrate the use of this procedure, consider the CVN test described in Example 8-1, and 
suppose that we wanted to determine how many specimens must be tested to ensure that the 
95% CI on u for A238 steel cut at 60°C has a length of at most 1.0J. Since the bound on error 
in estimation E is one-half of the length of the CI, to determine n we use Equation 8-8 with 
E = 0.5, ø = 1, and Zą = 0.025. The required sample size is 16 


Zano? 1.96)1 |? 
n= (22°) -|' 1 = 15.37 
E 0.5 


and because n must be an integer, the required sample size is n = 16. 





Notice the general relationship between sample size, desired length of the confidence 
interval 2E, confidence level 100(1 — a), and standard deviation o: 


e As the desired length of the interval 2E decreases, the required sample size n increases 
for a fixed value of o and specified confidence. 
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e Aso increases, the required sample size n increases for a fixed desired length 2E and 
specified confidence. 


* As the level of confidence increases, the required sample size n increases for fixed 
desired length 2E and standard deviation o. 


8-2.3 One-Sided Confidence Bounds 
The confidence interval in Equation 8-7 gives both a lower confidence bound and an upper 


confidence bound for w. Thus it provides a two-sided CI. It is also possible to obtain one-sided 
confidence bounds for u by setting either / = —% oru = © and replacing Za/2 by Za- 


Definition 
A 100(1 — a)% upper-confidence bound for u is 


LBzu-x-czo/WVn 


and a 100(1 — a)% lower-confidence bound for u is 


P= ww dem 





8-2.4 General Method to Derive a Confidence Interval 


It is easy to give a general method for finding a confidence interval for an unknown parame- 
ter 0. Let X, X, ..., X, be a random sample of n observations. Suppose we can find a statistic 
g(Xi, X5, ..., X,; 9) with the following properties: 


1l. g(Xj,X,..., X,; 8) depends on both the sample and 8. 


2. The probability distribution of g(X;, X5, ... , X,; 8) does not depend on 0 or any other 
unknown parameter. 


In the case considered in this section, the parameter 0 = u. The random variable g(Xj, X5, ... , 
X, w) = (X — u)/(o/ Vn) and satisfies both conditions above; it depends on the sample and 
on p, and it has a standard normal distribution since o is known. Now one must find constants 
C; and Cy so that 


P|C; x g(X,, X5, ..., X, 8) S Cyl =l-a (8-11) 


Because of property 2, C; and Cy do not depend on 0. In our example, C; = —z,/; and 
Cy = Za. Finally, you must manipulate the inequalities in the probability statement so that 


P|L(X;, X, T€ , Xn) = 0 = U(X, X», — en) =l-a (8-12) 


This gives L(X,, X, ..., X,) and U(X, X>, ..., X,) as the lower and upper confidence limits 
defining the 100(1 — a)% confidence interval for 0. The quantity g(X;, X>, ... , X,; 0) 1s 
often called a “pivotal quantity" because we pivot on this quantity in Equation 8-11 to pro- 
duce Equation 8-12. In our example, we manipulated the pivotal quantity (X — p)/(o/ Vn) 
to obtain L(X;, Xz, ...,.X,) = X — z,50/ Vn and U(X, X», ..., Xp) = X  z5a/ Vn. 
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8-2.5 A Large-Sample Confidence Interval for u 


Definition 


EXAMPLE 8-3 


We have assumed that the population distribution is normal with unknown mean and known 
standard deviation o. We now present a large-sample CI and y that does not require these as- 
sumptions. Let X,, X5, ... , X, be a random sample from a population with unknown mean 
|. and variance o^. Now if the sample size n is large, the central limit theorem implies that X 
has approximately a normal distribution with mean w and variance o?/n. Therefore 
Z = (X — u)/(o/ Vn) has approximately a standard normal distribution. This ratio could be 
used as a pivotal quantity and manipulated as in Section 8-2.1 to produce an approximate CI 
for u. However, the standard deviation o is unknown. It turns out that when n is large, replac- 
ing o by the sample standard deviation S has little effect on the distribution of Z. This leads to 
the following useful result. 


When 7 is large, the quantity 


X 
S/Vn 


has an approximate standard normal distribution. Consequently, 


m. S = S 
a RY ie ie aor (S) 


is a large sample confidence interval for u, with confidence level of approximately 
100(1 — a)%. 





Equation 8-13 holds regardless of the shape of the population distribution. Generally n should 
be at least 40 to use this result reliably. The central limit theorem generally holds for n = 30, 
but the larger sample size 1s recommended here because replacing o by S in Z results in addi- 
tional variability. 


An article in the 1993 volume of the Transactions of the American Fisheries Society reports 
the results of a study to investigate the mercury contamination in largemouth bass. A sample 
of fish was selected from 53 Florida lakes and mercury concentration in the muscle tissue was 
measured (ppm). The mercury concentration values are 


1.230 0.490 0.490 1.080 0.590 0.280 0.180 0.100 0.940 
1.330 0.190 1.160 0.980 0.340 0.340 0.190 0.210 0.400 
0.040 0.830 0.050 0.630 0.340 0.750 0.040 0.860 0.430 
0.044 0.810 0.150 0.560 0.840 0.870 0.490 0.520 0.250 
1.200 0.710 0.190 0.410 0.500 0.560 1.100 0.650 0.270 
0.270 0.500 0.770 0.730 0.340 0.170 0.160 0.270 
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The summary statistics from Minitab are displayed below: 


Descriptive Statistics: Concentration 


Variable N Mean Median TrMean StDev SE Mean 
Concentration 55 0.5250 0.4900 0.5094 0.3486 0.0479 
Variable Minimum Maximum QI Q3 
Concentration 0.0400 1.3300 0.2300 0.7900 





Figure 8-3(a) and (b) presents the histogram and normal probability plot of the mercury 
concentration data. Both plots indicate that the distribution of mercury concentration is not nor- 
mal and is positively skewed. We want to find an approximate 95% CI on u. Because n > 40, 
the assumption of normality is not necessary to use Equation 8-13. The required quantities are 
n = 53, x = 0.5250, s = 0.3486, and Zoo25 = 1.96. The approximate 95% CI on p is 


03486 L L g saso 4 1 96 02496 
V/53 V/53 


0.4311 = p = 0.6189 


0.5250 — 1.96 








This interval is fairly wide because there is a lot of variability in the mercury concentration 
measurements. 


A General Large Sample Confidence Interval 

The large-sample confidence interval for ju in Equation 8-13 is a special case of a more 
general result. Suppose that 0 is a parameter of a probability distribution and let Ó be an 
estimator of 0. If Ó (1) has an approximate normal distribution, (2) is approximately unbiased 


Percentage 
O1 
e 


Frequency 


OrRFN WF ODN WO o 





0.0 0.5 1.0 


0.0 0.5 1.0 1.5 
Concentration Concentration 
(a) (b) 


Figure 8-3 Mercury concentration in largemouth bass (a) Histogram. (b) Normal probability plot. 
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for 0, and (3) has standard deviation o@ that can be estimated from the sample data, then the 


^ 


quantity (© — 0)/o has an approximate standard normal distribution. Then a large-sample 
approximate CI for 0 is given by 


Ü ES 20/296 = 0 < 0 Jr Zg/200 





Maximum likelihood estimators usually satisfy the three conditions listed above, so Equation 
8-14 is often used when Ó is the maximum likelihood estimator of 0. Finally, note that 
Equation 8-14 can be used even when o is a function of other unknown parameters (or of 0). 
Essentially, all one does is to use the sample data to compute estimates of the unknown 
parameters and substitute those estimates into the expression for o@. 


8-2.6 Bootstrap Confidence Intervals (CD Only) 


EXERCISES FOR SECTION 8-2 


8-1. For a normal population with known variance o^, 


answer the following questions: 

(a) What is the confidence level for the interval x — 2.140/Vn 
<p=x+2.140/Vn? 

(b) What is the confidence level for the interval x — 2.490/ Vin. 

uzx-c2490/Vn? 

(c) What is the confidence level for the interval x — 1.850/Vn 
< u Sx + 1.850/Vn? 

8-2. Fora normal population with known variance o?: 

(a) What value of za in Equation 8-7 gives 98% confidence? 

(b) What value of zan in Equation 8-7 gives 80% confidence? 

(c) What value of z,/; in Equation 8-7 gives 75% confidence? 


8-3. Consider the one-sided confidence interval expres- 
sions, Equations 8-9 and 8-10. 

(a) What value of z, would result in a 90% CT? 

(b) What value of z, would result in a 95% CI? 

(c) What value of z, would result in a 99% CT? 


8-4. A confidence interval estimate is desired for the gain in 
a circuit on a semiconductor device. Assume that gain is nor- 
mally distributed with standard deviation ø = 20. 

(a) Find a 95% CI for ju. when n = 10 and x = 1000. 

(b) Find a 95% CI for u when n = 25 and x = 1000. 

(c) Find a 99% CI for u when n = 10 and x = 1000. 

(d) Find a 99% CI for p. when n = 25 and x = 1000. 


8-5. Consider the gain estimation problem in Exercise 8-4. 
How large must n be if the length of the 95% CI is to be 40? 


8-6. Following are two confidence interval estimates of the 
mean pu of the cycles to failure of an automotive door latch 
mechanism (the test was conducted at an elevated stress level 
to accelerate the failure). 


3124.9 = p x 3215.7 31105 < u = 3230.1 


(a) What is the value of the sample mean cycles to failure? 

(b) The confidence level for one of these CIs is 95% and the 
confidence level for the other is 99%. Both CIs are calcu- 
lated from the same sample data. Which is the 95% CI? 
Explain why. 

8-7. n = 100 random samples of water from a fresh water 

lake were taken and the calcium concentration (milligrams 

per liter) measured. A 95% CI on the mean calcium concen- 

tration is 0.49 < u S 0.82. 

(a) Would a 99% CI calculated from the same sample data 
been longer or shorter? 

(b) Consider the following statement: There is a 95% chance 
that u is between 0.49 and 0.82. Is this statement correct? 
Explain your answer. 

(c) Consider the following statement: If n = 100 random 
samples of water from the lake were taken and the 95% CI 
on u computed, and this process was repeated 1000 times, 
950 of the CIs will contain the true value of w. Is this state- 
ment correct? Explain your answer. 


8-8. The breaking strength of yarn used in manufacturing 
drapery material is required to be at least 100 psi. Past experi- 
ence has indicated that breaking strength is normally distrib- 
uted and that o. = 2 psi. A random sample of nine specimens 
is tested, and the average breaking strength is found to be 98 
psi. Find a 9596 two-sided confidence interval on the true 
mean breaking strength. 


8-9. The yield of a chemical process is being studied. From 
previous experience yield is known to be normally distributed 
and o — 3. The past five days of plant operation have resulted 
in the following percent yields: 91.6, 88.75, 90.8, 89.95, and 
91.3. Find a 95% two-sided confidence interval on the true 
mean yield. 
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8-10. The diameter of holes for cable harness is known to 
have a normal distribution with o = 0.01 inch. A random 
sample of size 10 yields an average diameter of 1.5045 inch. 
Find a 99% two-sided confidence interval on the mean hole 
diameter. 


8-11. A manufacturer produces piston rings for an auto- 

mobile engine. It is known that ring diameter is normally dis- 

tributed with o. = 0.001 millimeters. A random sample of 15 

rings has a mean diameter of x = 74.036 millimeters. 

(a) Construct a 99% two-sided confidence interval on the 
mean piston ring diameter. 

(b) Construct a 95% lower-confidence bound on the mean 
piston ring diameter. 


8-12. The life in hours of a 75-watt light bulb is known to be 

normally distributed with o = 25 hours. A random sample of 

20 bulbs has a mean life of x = 1014 hours. 

(a) Construct a 95% two-sided confidence interval on the 
mean life. 

(b) Construct a 95% lower-confidence bound on the mean 


(a) Construct a 95% two-sided confidence interval on mean 
compressive strength. 

(b) Construct a 99% two-sided confidence interval on mean 
compressive strength. Compare the width of this confi- 
dence interval with the width of the one found in part (a). 


8-14. Suppose that in Exercise 8-12 we wanted to be 95% 
confident that the error in estimating the mean life is less than 
five hours. What sample size should be used? 


8-15. Suppose that in Exercise 8-12 we wanted the total 
width of the two-sided confidence interval on mean life to be 
six hours at 95% confidence. What sample size should be 
used? 


8-16. Suppose that in Exercise 8-13 it is desired to estimate 
the compressive strength with an error that is less than 15 psi 
at 99% confidence. What sample size is required? 

8-17. By how much must the sample size n be increased if 
the length of the CI on p in Equation 8-7 is to be halved? 


8-18. If the sample size n is doubled, by how much is the 


length of the CI on u in Equation 8-7 reduced? What happens 
to the length of the interval if the sample size is increased by a 
factor of four? 


life. 
8-13. A civil engineer is analyzing the compressive strength 
of concrete. Compressive strength is normally distributed with 
o? = 1000(psi). A random sample of 12 specimens has a 
mean compressive strength of x = 3250 psi. 


8-3 CONFIDENCE INTERVAL ON THE MEAN OF A NORMAL 
DISTRIBUTION, VARIANCE UNKNOWN 


When we are constructing confidence intervals on the mean u of a normal population when 
o? is known, we can use the procedure in Section 8-2.1. This CI is also approximately valid 
(because of the central limit theorem) regardless of whether or not the underlying population 
is normal, so long as n 1s reasonably large (n = 40, say). As noted in Section 8-2.5, we can 
even handle the case of unknown variance for the large-sample-size situation. However, when 
the sample is small and o? is unknown, we must make an assumption about the form of the un- 
derlying distribution to obtain a valid CI procedure. A reasonable assumption in many cases is 
that the underlying distribution is normal. 

Many populations encountered in practice are well approximated by the normal distribu- 
tion, so this assumption will lead to confidence interval procedures of wide applicability. In 
fact, moderate departure from normality will have little effect on validity. When the assump- 
tion is unreasonable, an alternate is to use the nonparametric procedures in Chapter 15 that are 
valid for any underlying distribution. 

Suppose that the population of interest has a normal distribution with unknown mean pu 
and unknown variance o?. Assume that a random sample of size n, say X,, X, ..., X,, is avail- 
able, and let X and S? be the sample mean and variance, respectively. 

We wish to construct a two-sided CI on w. If the variance o? is known, we know that 
Z = (X — u)/(o/ Vn) has a standard normal distribution. When o? is unknown, a logical pro- 
cedure is to replace o with the sample standard deviation S. The random variable Z now be- 
comes T = (X — w)/(S/V/n). A logical question is what effect does replacing o by S have on the 
distribution of the random variable T? If n is large, the answer to this question is “very little," 
and we can proceed to use the confidence interval based on the normal distribution from 
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Section 8-2.5. However, n is usually small in most engineering problems, and in this situation 
a different distribution must be employed to construct the CI. 


8-3.1 The t Distribution 


Definition 
Let Xi, X5, ..., X, be a random sample from a normal distribution with unknown 
mean p and unknown variance o°. The random variable 


X- 


if (8-15) 


© S/Nn 


has a ¢ distribution with n — 1 degrees of freedom. 





The ¢ probability density function is 


NI SIS 1 
JO V/akT(k/2) [(x?/k) a 1 1)/2 
where k is the number of degrees of freedom. The mean and variance of the ¢ distribution are 
zero and kk — 2) (for k > 2), respectively. 

Several ¢ distributions are shown in Fig. 8-4. The general appearance of the t distribution is 
similar to the standard normal distribution in that both distributions are symmetric and 
unimodal, and the maximum ordinate value is reached when the mean u = 0. However, the t 
distribution has heavier tails than the normal; that is, it has more probability in the tails than the 
normal distribution. As the number of degrees of freedom k — ^o, the limiting form of the t dis- 
tribution is the standard normal distribution. Generally, the number of degrees of freedom for f 
are the number of degrees of freedom associated with the estimated standard deviation. 

Appendix Table IV provides percentage points of the ¢ distribution. We will let £, , be the 
value of the random variable T with k degrees of freedom above which we find an area 
(or probability) a. Thus, 1, , is an upper-tail 100a percentage point of the ¢ distribution with k 
degrees of freedom. This percentage point 1s shown in Fig. 8-5. In the Appendix Table IV the 
a values are the column headings, and the degrees of freedom are listed in the left column. To 


—00 « x «. 00 (8-16) 


k = 10 






k = œ [N (O, 1)] 


0 x ti_a,k =—ta,k O ta, k t 


Figure 8-4 Probability density functions of several t Figure 8-5 Percentage points of the t 
distributions. distribution. 
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illustrate the use of the table, note that the ¢-value with 10 degrees of freedom having an area 
of 0.05 to the right is f9 95, 19 = 1.812. That is, 


P(Tio > to.05,10) = P(Tio > 1.812) m 0.05 


Since the ¢ distribution is symmetric about zero, we have ti-a = —1,; that is, the t-value hav- 
ing an area of 1 — a to the right (and therefore an area of a to the left) 1s equal to the nega- 
tive of the t-value that has area o in the right tail of the distribution. Therefore, f 95 19 = 
—0.95,10 = —1.812. Finally, because f£, is the standard normal distribution, the familiar z, val- 
ues appear in the last row of Appendix Table IV. 


8-3.2 Development of the t Distribution (CD Only) 


8-3.3 The t Confidence Interval on p 


Definition 


It is easy to find a 100(1 — a) percent confidence interval on the mean of a normal distribu- 
tion with unknown variance by proceeding essentially as we did in Section 8-2.1. We know 
that the distribution of T = (X — u)/(S/ Vn) is t with n — 1 degrees of freedom. Letting 
ta n—-1 be the upper 100a/2 percentage point of the ¢ distribution with n — 1 degrees of 
freedom, we may write: 


Pid 9d € hpi) = Lo 


Or 


" 
P(t = Svan = T cd cer 


Rearranging this last equation yields 
P(X — ta/2n-15/ Vn = p < X+ ta2n-15/ VN) = l =Z Q (8-17) 


This leads to the following definition of the 100(1 — a) percent two-sided confidence inter- 
val on u. 


If x and s are the mean and standard deviation of a random sample from a normal 
distribution with unknown variance o°, a 100(1 — a) percent confidence interval 
on p. is given by 


Dey) Vie ES es n a yeni 0 (8-18) 


where 1,7? ,.., is the upper 100a/2 percentage point of the ¢ distribution with n — 1 
degrees of freedom. 





One-sided confidence bounds on the mean of a normal distribution are also of interest 
and are easy to find. Simply use only the appropriate lower or upper confidence limit from 
Equation 8-18 and replace ta/2,n-1 by fa,n-1- 
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EXAMPLE 8-4 


Normal probability plot 





20.5 90 
_ 80 
18.0 C 70 
: 
S 15.5 a 40 
z 30 
20 
> 13.0 
S 10 
— 
10.5 > 
1 
8.0 5 10 i15 -20 25 


Load at failure 
Figure 8-7 Normal probability 
plot of the load at failure data from 
Example 8-4. 


Figure 8-6 Box and whisker plot for the 
load at failure data in Example 8-4. 


An article in the journal Materials Engineering (1989, Vol. II, No. 4, pp. 275—281) describes 
the results of tensile adhesion tests on 22 U-700 alloy specimens. The load at specimen failure 
is as follows (in megapascals): 


19.8 10.1 14.9 ps 15.4 15.4 
15.4 18.5 H9 12:7 14.9 11.4 
11.4 14.1 17.6 16.7 15.8 
19.5 8.8 13.6 11.9 11.4 


The sample mean is x = 13.71, and the sample standard deviation is s = 3.55. Figures 8-6 
and 8-7 show a box plot and a normal probability plot of the tensile adhesion test data, re- 
spectively. These displays provide good support for the assumption that the population is nor- 
mally distributed. We want to find a 95% CI on pw. Since n = 22, we have n — 1 = 21 degrees 
of freedom for t, so to.02521 = 2.080. The resulting CI is 


X — tapn-1S/ Vn S p EF + fuas as/ Vn 
13.71 — 2.080(3.55)/ V22 < p < 13.71 + 2.080(3.55)/N/22 
i371 = 157 2%, = 13-71-45 1557 
1214 = 1525 


The CT is fairly wide because there is a lot of variability in the tensile adhesion test measurements. 


It is not as easy to select a sample size n to obtain a specified length (or precision of estima- 
tion) for this CI as it was in the known-o case because the length of the interval involves s (which 
is unknown before the data are collected), n, and 1,» ,..;. Note that the t-percentile depends on the 
sample size n. Consequently, an appropriate n can only be obtained through trial and error. The re- 
sults of this will, of course, also depend on the reliability of our prior "guess" for o. 


EXERCISES FOR SECTION 8-3 


8-19. Find the values of the following percentiles: 1o 995 15, 8-21. Determine the /-percentile that is required to construct 


to.05,10> £0.10,20> 10.005,25» ANd 19,901.30. 
8-20. Determine the f-percentile that is required to construct 
each of the following two-sided confidence intervals: 


each of the following one-sided confidence intervals: 
(a) Confidence level = 95%, degrees of freedom = 14 
(b) Confidence level = 99%, degrees of freedom = 19 


(a) Confidence level = 95%, degrees of freedom = 12 
(b) Confidence level = 95%, degrees of freedom = 24 
(c) Confidence level = 99%, degrees of freedom = 13 
(d) Confidence level = 99.9%, degrees of freedom = 15 


(c) Confidence level = 99.9%, degrees of freedom = 24 

8-22. A research engineer for a tire manufacturer is investi- 
gating tire life for a new rubber compound and has built 16 tires 
and tested them to end-of-life in a road test. The sample mean 
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and standard deviation are 60,139.7 and 3645.94 kilometers. 
Find a 9596 confidence interval on mean tire life. 


8-23. An Izod impact test was performed on 20 specimens of 
PVC pipe. The sample mean is x = 1.25 and the sample stan- 
dard deviation is s = 0.25. Find a 99% lower confidence 
bound on Izod impact strength. 


8-24. The brightness of a television picture tube can be eval- 
uated by measuring the amount of current required to achieve a 
particular brightness level. A sample of 10 tubes results in 
x = 317.2 and s = 15.7. Find (in microamps) a 99% confi- 
dence interval on mean current required. State any necessary 
assumptions about the underlying distribution of the data. 


8-25. A particular brand of diet margarine was analyzed to 

determine the level of polyunsaturated fatty acid (in percent- 

ages). A sample of six packages resulted in the following data: 

16.8, 17.2, 17.4, 16.9, 16.5, 17.1. 

(a) Is there evidence to support the assumption that the level 
of polyunsaturated fatty acid is normally distributed? 

(b) Find a 99% confidence interval on the mean p. Provide a 
practical interpretation of this interval. 

8-26. The compressive strength of concrete is being tested 

by a civil engineer. He tests 12 specimens and obtains the 

following data. 


2216 2237 2249 2204 
2223 2301 2281 2263 
2318 2255 2213 2295 


(a) Is there evidence to support the assumption that compres- 
sive strength is normally distributed? Does this data set 
support your point of view? Include a graphical display in 
your answer. 

(b) Construct a 95% two-sided confidence interval on the 
mean strength. 

(c) Construct a 95% lower-confidence bound on the mean 
strength. 


8-27. A machine produces metal rods used in an automobile 
suspension system. A random sample of 15 rods is selected, 
and the diameter is measured. The resulting data (in millime- 
ters) are as follows: 


8.24 8.25 8.20 8.23 8.24 
8.21 8.26 8.26 8.20 8.25 
8.23 8:23 8.19 8.28 8.24 


(a) Check the assumption of normality for rod diameter. 
(b) Find a 95% two-sided confidence interval on mean rod 
diameter. 


8-28. Rework Exercise 8-27 to compute a 95% lower con- 
fidence bound on rod diameter. Compare this bound with the 
lower limit of the two-sided confidence limit from Exercise 
8-27. Discuss why they are different. 


8-29. The wall thickness of 25 glass 2-liter bottles was meas- 
ured by a quality-control engineer. The sample mean was 
x — 4.05 millimeters, and the sample standard deviation was 
s = 0.08 millimeter. Find a 95% lower confidence bound for 
mean wall thickness. Interpret the interval you have obtained. 


8-30. An article in Nuclear Engineering International 
(February 1988, p. 33) describes several characteristics of fuel 
rods used in a reactor owned by an electric utility in Norway. 
Measurements on the percentage of enrichment of 12 rods 
were reported as follows: 


2.94 3.00 2.90 2.75 3.00 2:95 
2.90 2.73 2.05 2.82 2.51 3.05 


(a) Use a normal probability plot to check the normality as- 
sumption. 

(b) Find a 99% two-sided confidence interval on the mean 
percentage of enrichment. Are you comfortable with the 
statement that the mean percentage of enrichment is 2.95 
percent? Why? 


8-31. A postmix beverage machine is adjusted to release a 
certain amount of syrup into a chamber where it is mixed with 
carbonated water. A random sample of 25 beverages was 
found to have a mean syrup content of x — 1.10 fluid ounces 
and a standard deviation of s — 0.015 fluid ounces. Find a 
95% CI on the mean volume of syrup dispensed. 


8-32. An article in the Journal of Composite Materials 
(December 1989, Vol 23, p. 1200) describes the effect of delam- 
ination on the natural frequency of beams made from composite 
laminates. Five such delaminated beams were subjected to loads, 
and the resulting frequencies were as follows (in hertz): 


230.66, 233.05, 232.58, 229.48, 232.58 


Find a 9096 two-sided confidence interval on mean natural 
frequency. Is there evidence to support the assumption of nor- 
mality in the population? 


8-4 CONFIDENCE INTERVAL ON THE VARIANCE AND 
STANDARD DEVIATION OF A NORMAL POPULATION 


Sometimes confidence intervals on the population variance or standard deviation are needed. 
When the population is modeled by a normal distribution, the tests and intervals described in 
this section are applicable. The following result provides the basis of constructing these con- 


fidence intervals. 
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Definition 


Figure 8-8 Proba- 
bility density functions 
of several x? distribu- 
tions. 


Let X;, X5, ..., X, bea random sample from a normal distribution with mean u and 
variance o?, and let S? be the sample variance. Then the random variable 


(n — 1)S? 


x (8-19) 


has a chi-square ( X?) distribution with n — 1 degrees of freedom. 





The probability density function of a x” random variable is 


l — —X 
f(x) = PTD) o 3-0 (8-20) 


where k is the number of degrees of freedom. The mean and variance of the x^ distribution are 
k and 2k, respectively. Several chi-square distributions are shown in Fig. 8-8. Note that the 
chi-square random variable is nonnegative and that the probability distribution is skewed to 
the right. However, as k increases, the distribution becomes more symmetric. As k — co, the 
limiting form of the chi-square distribution is the normal distribution. 

The percentage points of the X? distribution are given in Table III of the Appendix. 
Define Xak as the percentage point or value of the chi-square random variable with k degrees 
of freedom such that the probability that X? exceeds this value is a. That is, 


oO 


PU? > x24) = | fü) du - 
Nak 
This probability is shown as the shaded area in Fig. 8-9(a). To illustrate the use of Table III, 
note that the areas a are the column headings and the degrees of freedom Ñ are given in the left 


column. Therefore, the value with 10 degrees of freedom having an area (probability) of 0.05 
to the right is X6.05.10 = 18.31.This value is often called an upper 5% point of chi-square with 


f(x) 
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Definition 


f (x) fx) 


0 XR id O X6.95, 10 X6.08, 10 
= 3.94 = 18.31 


(a) (b) 


Figure 8-9 Percentage point of the x’ distribution. (a) The percentage point X (b) The upper 
percentage point X6.05,10 = 18.31 and the lower percentage point X695.10 — 3.94. 


10 degrees of freedom. We may write this as a probability statement as follows: 


Conversely, a lower 5% point of chi-square with 10 degrees of freedom would be X695.10 — 3.94 
(from Appendix Table III). Both of these percentage points are shown in Figure 8-9(b). 
The construction of the 100(1 — a)96 CI for o? is straightforward. Because 


(n — 1)S? 


g^ 


X? - 


is chi-square with n — 1 degrees of freedom, we may write 


P(X anni sra X5/2.n-1) eub e 
so that 
(n — 1)S? 


2 _ 
c 5 asi) ze 


2 
P( xt NE = 


This last equation can be rearranged as 


2e EDS. "S (n — o 


2 — miU 
Xo/2,n—1 X1—2a/2,1-1 


This leads to the following definition of the confidence interval for o°. 


If s? is the sample variance from a random sample of n observations from a normal dis- 
tribution with unknown variance o^, then a 100(1 — œ)% confidence interval on o? is 


(n — Ds 


9 
Xo/2,n—1 


(n — 1)? 


9 
X1—-o/2,n-1 


Dec 


(8-21) 


where x2 Pai and X10 y2.n-1 are the upper and lower 100a/2 percentage points of 
the chi-square distribution with n — 1 degrees of freedom, respectively. A confidence 
interval for o has lower and upper limits that are the square roots of the correspon- 
ding limits in Equation 8-21. 
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It is also possible to find a 100(1 — a)% lower confidence bound or upper confidence bound 


on oO 


The 100(1 — «)% lower and upper confidence bounds on o? are 


(n — 1)s? 


2 
Xo,n—1 


respectively. 


EXAMPLE 8-5 


=o 


_ (n — Ds 





An automatic filling machine is used to fill bottles with liquid detergent. A random sample of 
20 bottles results in a sample variance of fill volume of s? = 0.0153 (fluid ounces)’. If the 
variance of fill volume is too large, an unacceptable proportion of bottles will be under- or 
overfilled. We will assume that the fill volume is approximately normally distributed. A 95% 


upper-confidence interval 1s found from Equation 8-22 as follows: 


B 2 
rm (n 1)s 
X0.95,19 
or 
, (19)0.0153 j 
~ 10.117 = 0.0287 (fluid ounce) 


This last expression may be converted into a confidence interval on the standard deviation o 
by taking the square root of both sides, resulting in 


o = 0.17 


Therefore, at the 95% level of confidence, the data indicate that the process standard deviation 


could be as large as 0.17 fluid ounce. 


EXERCISES FOR SECTION 8-4 


8-33. 

X.05,10» X 6.025,15: X0.01,12 X0.9520» X0.99,18» X 995,16 and X 6.00525 

8-34. Determine the x? percentile that is required to 

construct each of the following CIs: 

(a) Confidence level = 95%, degrees of freedom = 24, 
one-sided (upper) 

(b) Confidence level = 99%, degrees of freedom = 9, one- 
sided (lower) 

(c) Confidence level = 90%, degrees of freedom = 19, two- 
sided. 


8-35. A rivet is to be inserted into a hole. A random sample 
ofn = 15 parts is selected, and the hole diameter is measured. 


Determine the values of the following percentiles: 


The sample standard deviation of the hole diameter measure- 
ments is s = 0.008 millimeters. Construct a 99% lower confi- 
dence bound for o. 

8-36. The sugar content of the syrup in canned peaches is 
normally distributed. A random sample ofn = 10 cans yields 
a sample standard deviation of s — 4.8 milligrams. Find a 
95% two-sided confidence interval for o. 

8-37. Consider the tire life data in Exercise 8-22. Find a 
95% lower confidence bound for o°. 

8-38. Consider the Izod impact test data in Exercise 8-23. 
Find a 99% two-sided confidence interval for o°. 
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8-39. The percentage of titanium in an alloy used in aero- 8-40. Consider the hole diameter data in Exercise 8-35. 
space castings is measured in 51 randomly selected parts. The Construct a 99% two-sided confidence interval for o. 

sample standard deviation is s = 0.37. Construct a 95% two- — 8.41. Consider the sugar content data in Exercise 8-37. Find 
sided confidence interval for o. a 90% lower confidence bound for o. 


8-5 A LARGE-SAMPLE CONFIDENCE INTERVAL FOR A 
POPULATION PROPORTION 


Definition 


It is often necessary to construct confidence intervals on a population proportion. For exam- 
ple, suppose that a random sample of size n has been taken from a large (possibly infinite) 
population and that X(- n) observations in this sample belong to a class of interest. Then 
P = X/nis a point estimator of the proportion of the population p that belongs to this class. 
Note that n and p are the parameters of a binomial distribution. Furthermore, from Chapter 4 
we know that the sampling distribution of P is approximately normal with mean p and vari- 
ance p(1 — p)/n, if p is not too close to either 0 or 1 and if 7 is relatively large. Typically, to 
apply this approximation we require that np and n(1 — p) be greater than or equal to 5. We 
will make use of the normal approximation in this section. 


If n 1s large, the distribution of 


B ZU np 
Vnp(l — p) 


is approximately standard normal. 





To construct the confidence interval on p, note that 
PES Eu) ee] ese 


SO 


This may be rearranged as 


: |p(1 — p) : p(l — p) 
p| p- Za/2 n =p < P + Za/2 n = 1 — Q (8-23) 


The quantity Vp(1 — p)/n in Equation 8-23 is called the standard error of the point esti- 
mator P. Unfortunately, the upper and lower limits of the confidence interval obtained from 
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Definition 


EXAMPLE 8-6 


Equation 8-23 contain the unknown parameter p. However, as suggested at the end of Section 
8-2.5, a satisfactory solution is to replace p by P in the standard error, which results in 


" P(1 — P) E P(1 — P) 
PiPPa r PaPa pep eg (8-24) 


This leads to the approximate 100(1 — «)% confidence interval on p. 


If p is the proportion of observations in a random sample of size n that belongs to a 
class of interest, an approximate 100(1 — a)% confidence interval on the proportion 
p of the population that belongs to this class 1s 

; P(l - p) , P(1- Pp) 

1D = Zn 7 PE 7 (8-25) 


where z,/? is the upper a/2 percentage point of the standard normal distribution. 





This procedure depends on the adequacy of the normal approximation to the binomial. To 
be reasonably conservative, this requires that np and n(1 — p) be greater than or equal to 5. In 
situations where this approximation 1s inappropriate, particularly in cases where n is small, 
other methods must be used. Tables of the binomial distribution could be used to obtain a con- 
fidence interval for p. However, we could also use numerical methods based on the binomial 
probability mass function that are implemented in computer programs. 


In a random sample of 85 automobile engine crankshaft bearings, 10 have a surface finish that 
is rougher than the specifications allow. Therefore, a point estimate of the proportion of bear- 
ings in the population that exceeds the roughness specification is p — x/n — 10/85 — 0.12. 
A 95% two-sided confidence interval for p is computed from Equation 8-25 as 


R DU — P) : ÐU — p) 
P — Zoos n =P=P © Zoos " 


Or 
012 — 195, / Oo”) | 012 + L96 [ABD 
2 85 
which simplifies to 
0.05 =p x 0.19 


Choice of Sample Size 
Since P is the point estimator of p, we can define the error in estimating p by P as 
E=\|p- PJ. Note that we are approximately 100(1 — a)% confident that this error is less 
than Zan Vp(1 — p)/n. For instance, in Example 8-6, we are 95% confident that the sample 
proportion p — 0.12 differs from the true proportion p by an amount not exceeding 0.07. 

In situations where the sample size can be selected, we may choose n to be 100 (1 — a)% 
confident that the error is less than some specified value E. If we set E = zap Vp(1 — p)/n 
and solve for n, the appropriate sample size is 


EXAMPLE 8-7 
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An estimate of p is required to use Equation 8-26. If an estimate p from a previous sam- 
ple is available, it can be substituted for p in Equation 8-26, or perhaps a subjective estimate 
can be made. If these alternatives are unsatisfactory, a preliminary sample can be taken, p 
computed, and then Equation 8-26 used to determine how many additional observations are 
required to estimate p with the desired accuracy. Another approach to choosing n uses the fact 
that the sample size from Equation 8-26 will always be a maximum for p = 0.5 [that is, 
p(l — p) 5 0.25 with equality for p = 0.5], and this can be used to obtain an upper bound on 
n. In other words, we are at least 100(1 — a)% confident that the error in estimating p by p 
is less than Æ if the sample size is 





Consider the situation in Example 8-6. How large a sample is required if we want to be 95% 
confident that the error in using p to estimate p is less than 0.05? Using p = 0.12 as an initial 
estimate of p, we find from Equation 8-26 that the required sample size is 


2 2 
Z0.025 1.96 
= | ——] ə(1 — pb) = | ——] 0.12(0.88) = 163 
n ( 5 ) * Ê) (eos) (0.88) 


If we wanted to be at least 95% confident that our estimate p of the true proportion p was 
within 0.05 regardless of the value of p, we would use Equation 8-27 to find the sample size 


2 2 
Z0.025 1.96 
= = | ——_ 25) = 
. ( E em) 63 Dres 


Notice that if we have information concerning the value of p, either from a preliminary sam- 
ple or from past experience, we could use a smaller sample while maintaining both the desired 
precision of estimation and the level of confidence. 





One-Sided Confidence Bounds 
We may find approximate one-sided confidence bounds on p by a simple modification of 
Equation 8-25. 


The approximate 100(1 — a)% lower and upper confidence bounds are 


Bü - P pU - P 


=) IE (8-28) 


respectively. 
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EXERCISES FOR SECTION 8-5 


8-42. Of 1000 randomly selected cases of lung cancer, 823 
resulted in death within 10 years. Construct a 95% two-sided 
confidence interval on the death rate from lung cancer. 


8-43. How large a sample would be required in Exercise 
8-42 to be at least 95% confident that the error in estimating 
the 10-year death rate from lung cancer is less than 0.03? 


8-44. A random sample of 50 suspension helmets used by 
motorcycle riders and automobile race-car drivers was sub- 
jected to an impact test, and on 18 of these helmets some dam- 
age was observed. 

(a) Find a 9596 two-sided confidence interval on the true pro- 
portion of helmets of this type that would show damage 
from this test. 

(b) Using the point estimate of p obtained from the prelimi- 
nary sample of 50 helmets, how many helmets must be 
tested to be 95% confident that the error in estimating the 
true value of p is less than 0.02? 

(c) How large must the sample be if we wish to be at least 
95% confident that the error in estimating p is less than 
0.02, regardless of the true value of p? 


8-45. The Arizona Department of Transportation wishes to 
survey state residents to determine what proportion of the 
population would like to increase statewide highway speed 
limits to 75 mph from 65 mph. How many residents do they 
need to survey if they want to be at least 99% confident that 
the sample proportion is within 0.05 of the true proportion? 


8-46. A manufacturer of electronic calculators is interested 
in estimating the fraction of defective units produced. A ran- 
dom sample of 800 calculators contains 10 defectives. 
Compute a 99% upper-confidence bound on the fraction 
defective. 


8-47. A study is to be conducted of the percentage of home- 
owners who own at least two television sets. How large a 
sample is required if we wish to be 99% confident that the 
error in estimating this quantity is less than 0.017? 


8-48. The fraction of defective integrated circuits produced 
in a photolithography process is being studied. A random sam- 
ple of 300 circuits is tested, revealing 13 defectives. Find a 
95% two-sided CI on the fraction of defective circuits pro- 
duced by this particular tool. 


8-6 A PREDICTION INTERVAL FOR A FUTURE OBSERVATION 


In some problem situations, we may be interested in predicting a future observation of a 
variable. This is a different problem than estimating the mean of that variable, so a confidence 
interval is not appropriate. In this section we show how to obtain a 100(1 — a)% prediction 
interval on a future value of a normal random variable. 


Suppose that X, X5, .. 


., X, 1s a random sample from a normal population. We wish to 


predict the value X,,,, a single future observation. A point prediction of X,,, is X, 
the sample mean. The prediction error is X,,, — X. The expected value of the prediction 


error is 


E(Xi41 —-X)=p—-p=0 


and the variance of the prediction error is 


because the future observation, X, ,, is independent of the mean of the current sample X. The 
prediction error X,,, — X is normally distributed. Therefore 


rn — X 


l 
O Lo 


Definition 


EXAMPLE 8-8 
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has a standard normal distribution. Replacing o with S results in 
m AniS X 
1 l 
S D 


which has a : distribution with n — 1 degrees of freedom. Manipulating T as we have done previ- 
ously in the development of a CI leads to a prediction interval on the future observation X, , ,. 


A 100(1 — a)% prediction interval on a single future observation from a normal 
distribution is given by 


E: _ l 
Mee loj2,n—15 x la/2n-15 1 + n (8-29) 


The prediction interval for X,,, will always be longer than the confidence interval for u 
because there is more variability associated with the prediction error than with the error of es- 
timation. This is easy to see because the prediction error is the difference between two random 
variables (X,,, — X), and the estimation error in the Cl is the difference between one random 
variable and a constant (X — u). As n gets larger (n — o), the length of the CI decreases to 
zero, essentially becoming the single value u, but the length of the prediction interval 
approaches 2z,/.0. So as n increases, the uncertainty in estimating u goes to zero, although 
there will always be uncertainty about the future value X,,, even when there is no need to 
estimate any of the distribution parameters. 


Reconsider the tensile adhesion tests on specimens of U-700 alloy described in Example 8-4. 
The load at failure for n = 22 specimens was observed, and we found that x = 13.71 and 
s = 3.55. The 95% confidence interval on u was 12.14 S pw S 15.28. We plan to test 
a twenty-third specimen. A 95% prediction interval on the load at failure for this specimen is 


E = l 
X — fa/2,n-15 1+ = X,41 <x + loj2,n—15 ] + n 


| 1l 
13.71 — (2.080)3.55 y/1 + 55 = Xs 


6.16 < X4, < 21.26 


IA 


l 
13.71 + (2.080)3.55 A1 + 55 


A 


Notice that the prediction interval is considerably longer than the CI. 


EXERCISES FOR SECTION 8-6 


8-49. Consider the tire-testing data described in Exercise 8-22. 8-50. Consider the Izod impact test described in Exercise 8-23. 
Compute a 9596 prediction interval on the life of the next tire of Compute a 99% prediction interval on the impact strength of 
this type tested under conditions that are similar to those em- the next specimen of PVC pipe tested. Compare the length of 
ployed in the original test. Compare the length of the prediction the prediction interval with the length of the 99% CI on the 
interval with the length of the 9596 CI on the population mean. population mean. 
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8-51. Consider the television tube brightness test described 
in Exercise 8-24. Compute a 99% prediction interval on the 
brightness of the next tube tested. Compare the length of the 
prediction interval with the length of the 99% CI on the popu- 
lation mean. 


8-52. Consider the margarine test described in Exercise 8-25. 
Compute a 99% prediction interval on the polyunsaturated 
fatty acid in the next package of margarine that is tested. 
Compare the length of the prediction interval with the length 
of the 99% CI on the population mean. 


8-53. Consider the test on the compressive strength of con- 
crete described in Exercise 8-26. Compute a 90% prediction 
interval on the next specimen of concrete tested. 


8-54. Consider the suspension rod diameter measurements 
described in Exercise 8-27. Compute a 95% prediction inter- 
val on the diameter of the next rod tested. Compare the length 
of the prediction interval with the length of the 95% CI on the 
population mean. 

8-55. Consider the bottle wall thickness measurements 
described in Exercise 8-29. Compute a 90% prediction interval 
on the wall thickness of the next bottle tested. 


8-56. How would you obtain a one-sided prediction bound 
on a future observation? Apply this procedure to obtain a 95% 
one-sided prediction bound on the wall thickness of the next 
bottle for the situation described in Exercise 8-29. 


8-57. Consider the fuel rod enrichment data described 
in Exercise 8-30. Compute a 99% prediction interval on the 
enrichment of the next rod tested. Compare the length of the 
prediction interval with the length of the 95% CI on the 
population mean. 


8-58. Consider the syrup dispensing measurements de- 
scribed in Exercise 8-31. Compute a 95% prediction interval 
on the syrup volume in the next beverage dispensed. Compare 
the length of the prediction interval with the length of the 95% 
CI on the population mean. 


8-59. Consider the natural frequency of beams described 
in Exercise 8-32. Compute a 90% prediction interval on the 
diameter of the natural frequency of the next beam of this 
type that will be tested. Compare the length of the prediction 
interval with the length of the 95% CI on the population 
mean. 


8-7 TOLERANCE INTERVALS FOR A NORMAL DISTRIBUTION 


Consider a population of semiconductor processors. Suppose that the speed of these processors 
has a normal distribution with mean u = 600 megahertz and standard deviation o. = 30 mega- 
hertz. Then the interval from 600 — 1.96(30) = 541.2 to 600 + 1.96(30) = 658.8 megahertz 
captures the speed of 95% of the processors in this population because the interval from 
—1.96 to 1.96 captures 95% of the area under the standard normal curve. The interval from 
|. — Zy/2 tO W + Zan0 is called a tolerance interval. 

If u and o are unknown, we can use the data from a random sample of size n to compute 
x and s, and then form the interval (x — 1.96s, x + 1.96s). However, because of sampling 
variability in x and s, it 1s likely that this interval will contain less than 95% of the values in 
the population. The solution to this problem is to replace 1.96 by some value that will make 
the proportion of the distribution contained in the interval 9596 with some level of confidence. 


Fortunately, it is easy to do this. 


Definition 


A tolerance interval for capturing at least y% of the values in a normal distribution 
with confidence level 100(1 — a)% is 


Sn 


x + ks 


where K is a tolerance interval factor found in Appendix Table XI. Values are given 
for y = 90%, 95%, and 95% and for 95% and 99% confidence. 





EXAMPLE 8-9 
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One-sided tolerance bounds can also be computed. The tolerance factors for these bounds are 
also given in Appendix Table XI. 


Let’s reconsider the tensile adhesion tests originally described in Example 8-4. The load 
at failure for n = 22 specimens was observed, and we found that x = 31.71 and s = 3.55. 
We want to find a tolerance interval for the load at failure that includes 90% of the 
values in the population with 95% confidence. From Appendix Table XI the tolerance 
factor k for n = 22, y = 0.90, and 95% confidence is k = 2.264. The desired tolerance 
interval is 

(x — ks, x + ks) or [31.71 — (2.264)3.55, 31.71 + (2.264)3.55 | 


which reduces to (23.67, 39.75). We can be 95% confident that at least 90% of the values of 
load at failure for this particular alloy lie between 23.67 and 39.75 megapascals. 


From Appendix Table XI, we note that as n — œ, the value of k goes to the z-value associated 
with the desired level of containment for the normal distribution. For example, if we want 
90% of the population to fall in the two-sided tolerance interval, k approaches Zoos = 1.645 as 
n — ©, Note that as n > ©, a 100(1 — a)% prediction interval on a future value approaches a 


tolerance interval that contains 100(1 — a)% of the distribution. 


EXERCISES FOR SECTION 8-7 


8-60. Compute a 95% tolerance interval on the life of the 
tires described in Exercise 8-22, that has confidence level 
9595. Compare the length of the tolerance interval with the 
length of the 95% CI on the population mean. Which interval 
is shorter? Discuss the difference in interpretation of these 
two intervals. 


8-61. Consider the Izod impact test described in Exercise 
8-23. Compute a 99% tolerance interval on the impact 
strength of PVC pipe that has confidence level 90%. 
Compare the length of the tolerance interval with the length 
of the 99% CI on the population mean. Which interval is 
shorter? Discuss the difference in interpretation of these two 
intervals. 


8-62. Compute a 99% tolerance interval on the brightness 
of the television tubes in Exercise 8-24 that has confidence 
level 9596. Compare the length of the prediction interval with 
the length of the 99% CI on the population mean. Which 
interval 1s shorter? Discuss the difference in interpretation of 
these two intervals. 


8-63. Consider the margarine test described in Exercise 8-25. 
Compute a 99% tolerance interval on the polyunsaturated 
fatty acid in this particular type of margarine that has confi- 
dence level 95%. Compare the length of the prediction in- 
terval with the length of the 99% CI on the population mean. 
Which interval is shorter? Discuss the difference in inter- 
pretation of these two intervals. 


8-64. Compute a 9096 tolerance interval on the compres- 
sive strength of the concrete described in Exercise 8-26 that 
has 9096 confidence. 


8-65. Compute a 959^ tolerance interval on the diameter of 
the rods described in Exercise 8-27 that has 9096 confidence. 
Compare the length of the prediction interval with the length 
of the 95% CI on the population mean. Which interval is 
shorter? Discuss the difference in interpretation of these two 
intervals. 


8-66. Consider the bottle wall thickness measurements 
described in Exercise 8-29. Compute a 90% tolerance interval 
on bottle wall thickness that has confidence level 90%. 


8-67. Consider the bottle wall thickness measurements 
described in Exercise 8-29. Compute a 90% lower tolerance 
bound on bottle wall thickness that has confidence level 
90%. Why would a lower tolerance bound likely be of 
interest here? 


8-68. Consider the fuel rod enrichment data described in 
Exercise 8-30. Compute a 99% tolerance interval on rod 
enrichment that has confidence level 95%. Compare the 
length of the prediction interval with the length of the 9596 
CI on the population mean. 


8-69. Compute a 95% tolerance interval on the syrup vol- 
ume described in Exercise 8-31 that has confidence level 909^. 
Compare the length of the prediction interval with the length 
of the 95% CI on the population mean. 


ZZ 


Supplemental Exercises 


8-70. Consider the confidence interval for p. with known 
standard deviation o: 


x Z,,0/Vn = um z,,0/ Vn 


where a, + a, = a. Let a = 0.05 and find the interval for 
Q, = a = a/2 = 0.025. Now find the interval for the case 
a, = 0.01 and a, = 0.04. Which interval is shorter? Is there 
any advantage to a “symmetric” confidence interval? 


8-71. A normal population has a known mean 50 and 

unknown variance. 

(a) A random sample of n = 16 is selected from this popula- 
tion, and the sample results are x = 52 and s = 8. How 
unusual are these results? That is, what 1s the probability 
of observing a sample average as large as 52 (or larger) if 
the known, underlying mean is actually 50? 

(b) A random sample of n = 30 is selected from this popula- 
tion, and the sample results are x = 52 and s = 8. How 
unusual are these results? 

(c) A random sample of n = 100 is selected from this popula- 
tion, and the sample results are x — 52 and s — 8. How 
unusual are these results? 

(d) Compare your answers to parts (a)-(c) and explain why 
they are the same or differ. 


8-72. A normal population has known mean u = 50 and 

variance o? — 5. What is the approximate probability that the 

sample variance is greater than or equal to 7.44? less than or 

equal to 2.56? 

(a) For a random sample of n — 16. 

(b) For a random sample of n = 30. 

(c) For a random sample of n — 71. 

(d) Compare your answers to parts (a)-(c) for the approxi- 
mate probability that the sample variance is greater than 
or equal to 7.44. Explain why this tail probability is 
increasing or decreasing with increased sample size. 

(e) Compare your answers to parts (a)-(c) for the approxi- 
mate probability that the sample variance 1s less than or 
equal to 2.56. Explain why this tail probability is increas- 
ing or decreasing with increased sample size. 


8-73. An article in the Journal of Sports Science (1987, Vol. 
5, pp. 261—271) presents the results of an investigation of the 
hemoglobin level of Canadian Olympic ice hockey players. 
The data reported are as follows (in g/dl): 


15.3 16.0 14.4 16.2 16.2 
14.9 15.7 15.3 14.6 15.7 
16.0 15.0 15.7 16.2 14.7 
14.8 14.6 15.6 14.5 152 
(a) Given the following probability plot of the data, what is a 


logical assumption about the underlying distribution of 
the data? 
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Percentage 
O1 
Oo 





13.5 14.0 145. 150. 15.5 16.0 16.5 17.0 
Hemoglobin Level 


(b) Explain why this check of the distribution underlying the 
sample data is important if we want to construct a confi- 
dence interval on the mean. 

(c) Based on this sample data, a 95% confidence interval for 
the mean is (15.04, 15.62). Is it reasonable to infer that the 
true mean could be 14.5? Explain your answer. 

(d) Explain why this check of the distribution underlying the 
sample data is important if we want to construct a confi- 
dence interval on the variance. 

(e) Based on this sample data, a 95% confidence interval 
for the variance is (0.22, 0.82). Is it reasonable to infer 
that the true variance could be 0.35? Explain your 
answer. 

(f) Is it reasonable to use these confidence intervals to draw 
an inference about the mean and variance of hemoglobin 
levels 

(1) of Canadian doctors? Explain your answer. 
(11) of Canadian children ages 6—12? Explain your answer. 


8-74. The article “Mix Design for Optimal Strength 
Development of Fly Ash Concrete" (Cement and Concrete 
Research, 1989, Vol. 19, No. 4, pp. 634—640) investigates 
the compressive strength of concrete when mixed with fly 
ash (a mixture of silica, alumina, iron, magnesium oxide, 
and other ingredients). The compressive strength for nine 
samples in dry conditions on the twenty-eighth day are as 
follows (in megapascals): 


40.2 30.4 28.9 30.5 22.4 
25.8 18.4 14.2 15.5 


(a) Given the following probability plot of the data, what 1s a 
logical assumption about the underlying distribution of 
the data? 


(b 


Percentage 


— 


(c) 


(d 


— 


(e) 


(f) 


(g 


(h 


) 


— 
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50 


Strength 


Find a 99% lower one-sided confidence interval on mean 
compressive strength. Provide a practical interpretation of 
this interval. 

Find a 98% two-sided confidence interval on mean com- 
pressive strength. Provide a practical interpretation of this 
interval and explain why the lower end-point of the inter- 
val is or is not the same as in part (b). 

Find a 99% upper one-sided confidence interval on the 
variance of compressive strength. Provide a practical in- 
terpretation of this interval. 

Find a 9896 two-sided confidence interval on the variance 
of compression strength. Provide a practical interpretation 
of this interval and explain why the upper end-point of the 
interval 1s or is not the same as in part (d). 

Suppose that it was discovered that the largest observation 
40.2 was misrecorded and should actually be 20.4. Now 
the sample mean x = 23 and the sample variance 
s? — 36.9. Use these new values and repeat parts (c) 
and (e). Compare the original computed intervals and the 
newly computed intervals with the corrected observation 
value. How does this mistake affect the values of the sam- 
ple mean, sample variance, and the width of the two-sided 
confidence intervals? 

Suppose, instead, that it was discovered that the largest 
observation 40.2 1s correct, but that the observation 25.8 1s 
incorrect and should actually be 24.8. Now the sample 
mean x = 25 and the sample variance s = 8.41. Use these 
new values and repeat parts (c) and (e). Compare the origi- 
nal computed intervals and the newly computed intervals 
with the corrected observation value. How does this mis- 
take affect the values of the sample mean, sample variance, 
and the width of the two-sided confidence intervals? 

Use the results from parts (f) and (g) to explain the effect 
of mistakenly recorded values on sample estimates. 
Comment on the effect when the mistaken values are near 
the sample mean and when they are not. 
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8-75.  Anoperating system for a personal computer has been 

studied extensively, and it is known that the standard deviation 

of the response time following a particular command is o = 8 

milliseconds. A new version of the operating system is 

installed, and we wish to estimate the mean response time for 

the new system to ensure that a 95% confidence interval for u 

has length at most 5 milliseconds. 

(a) If we can assume that response time is normally distributed 
and that o = 8 for the new system, what sample size would 
you recommend? 

(b) Suppose that we are told by the vendor that the standard 
deviation of the response time of the new system is 
smaller, say o = 6; give the sample size that you recom- 
mend and comment on the effect the smaller standard 
deviation has on this calculation. 


8-76. Consider the hemoglobin data in Exercise 8-73. Find 

the following: 

(a) An interval that contains 95% of the hemoglobin values 
with 90% confidence. 

(b) An interval that contains 99% of the hemoglobin values 
with 90% confidence. 


8-77. Consider the compressive strength of concrete data 
from Exercise 8-74. Find a 95% prediction interval on the 
next sample that will be tested. 


8-78. The maker of a shampoo knows that customers like 

this product to have a lot of foam. Ten sample bottles of the 

product are selected at random and the foam heights observed 

are as follows (in millimeters): 210, 215, 194, 195, 211, 201, 

198, 204, 208, and 196. 

(a) Is there evidence to support the assumption that foam 
height is normally distributed? 

(b) Find a 95% CI on the mean foam height. 

(c) Find a 95% prediction interval on the next bottle of sham- 
poo that will be tested. 

(d) Find an interval that contains 95% of the shampoo foam 
heights with 99% confidence. 

(e) Explain the difference in the intervals computed in parts 
(b), (c), and (d). 

8-79. During the 1999 and 2000 baseball seasons, there was 

much speculation that the unusually large number of home 

runs that were hit was due at least in part to a livelier ball. One 

way to test the “liveliness” of a baseball is to launch the ball at 

a vertical surface with a known velocity V, and measure the 

ratio of the outgoing velocity Vo of the ball to V}. The ratio 

R = Vo/V, is called the coefficient of restitution. Following 

are measurements of the coefficient of restitution for 40 

randomly selected baseballs. The balls were thrown from a 

pitching machine at an oak surface. 


0.6248 0.6237 0.6118 0.6159 0.6298 0.6192 
0.6520 0.6368 0.6220 0.6151 0.6121 0.6548 
0.6226 0.6280 0.6096 0.6300 0.6107 0.6392 
0.6230 0.6131 0.6223 0.6297 0.6435 0.5978 
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0.6351 0.6275 0.6261 0.6262 0.6262 0.6314 
0.6128 0.6403 0.6521 0.6049 0.6170 
0.6134 0.6310 0.6065 0.6214 0.6141 


(a) Is there evidence to support the assumption that the coef- 
ficient of restitution is normally distributed? 

(b) Find a 99% CI on the mean coefficient of restitution. 

(c) Find a 99% prediction interval on the coefficient of resti- 
tution for the next baseball that will be tested. 

(d) Find an interval that will contain 99% of the values of the 
coefficient of restitution with 95% confidence. 

(e) Explain the difference in the three intervals computed in 
parts (b), (c), and (d). 

8-80. Consider the baseball coefficient of restitution data in 

Exercise 8-79. Suppose that any baseball that has a coefficient 

of restitution that exceeds 0.635 is considered too lively. 

Based on the available data, what proportion of the baseballs 

in the sampled population are too lively? Find a 95% lower 

confidence bound on this proportion. 


8-81. An article in the ASCE Journal of Energy Engineering 

(“Overview of Reservoir Release Improvements at 20 TVA 

Dams,” Vol. 125, April 1999, pp. 1—17) presents data on 

dissolved oxygen concentrations in streams below 20 dams in 

the Tennessee Valley Authority system. The observations are (in 

milligrams per liter): 5.0, 3.4, 3.9, 1.3, 0.2, 0.9, 2.7, 3.7, 3.8, 4.1, 

1.0, 1.0, 0.8, 0.4, 3.8, 4.5, 5.3, 6.1, 6.9, and 6.5. 

(a) Is there evidence to support the assumption that the 
dissolved oxygen concentration is normally distributed? 

(b) Find a 95% CI on the mean dissolved oxygen concentra- 
tion. 

(c) Find a 95% prediction interval on the dissolved oxygen 
concentration for the next stream in the system that will be 
tested. 

(d) Find an interval that will contain 95% of the values of the 
dissolved oxygen concentration with 99% confidence. 

(e) Explain the difference in the three intervals computed in 
parts (b), (c), and (d). 


8-82. The tar content in 30 samples of cigar tobacco 
follows: 

1.542 1.585 1.532 1.466 1.499 1.611 
1.622 1.466 1.546 1.494 1.548 1.626 
1.440 1.608 1.520 1.478 1.542 1.511 
1.459 1.533 1.532 1.523 1.597 1.487 
1.598 1.498 1.600 1.504 1.545 1.558 


(a) Is there evidence to support the assumption that the tar 
content is normally distributed? 
(b) Find a 99% CI on the mean tar content. 


(c) Find a 99% prediction interval on the tar content for the 
next observation that will be taken on this particular type 
of tobacco. 

(d) Find an interval that will contain 99% of the values of the 
tar content with 9596 confidence. 

(e) Explain the difference in the three intervals computed in 
parts (b), (c), and (d). 

8-83. A manufacturer of electronic calculators takes a 

random sample of 1200 calculators and finds that there are 

eight defective units. 

(a) Construct a 95% confidence interval on the population 
proportion. 

(b) Is there evidence to support a claim that the fraction of 
defective units produced is 1% or less? 


8-84. An article in The Engineer (“Redesign for Suspect 
Wiring,” June 1990) reported the results of an investigation 
into wiring errors on commercial transport aircraft that may 
produce faulty information to the flight crew. Such a wiring 
error may have been responsible for the crash of a British 

Midland Airways aircraft in January 1989 by causing the pilot 

to shut down the wrong engine. Of 1600 randomly selected 

aircraft, eight were found to have wiring errors that could 
display incorrect information to the flight crew. 

(a) Find a 99% confidence interval on the proportion of air- 
craft that have such wiring errors. 

(b) Suppose we use the information in this example to 
provide a preliminary estimate of p. How large a sample 
would be required to produce an estimate of p that we are 
99% confident differs from the true value by at most 
0.008? 

(c) Suppose we did not have a preliminary estimate of p. How 
large a sample would be required if we wanted to be at 
least 99% confident that the sample proportion differs 
from the true proportion by at most 0.008 regardless of the 
true value of p? 

(d) Comment on the usefulness of preliminary information in 
computing the needed sample size. 


8-85. An article in Engineering Horizons (Spring 1990, 
p. 26) reported that 117 of 484 new engineering graduates 
were planning to continue studying for an advanced degree. 

Consider this as a random sample of the 1990 graduating 

class. 

(a) Find a 90% confidence interval on the proportion of such 
graduates planning to continue their education. 

(b) Find a 95% confidence interval on the proportion of such 
graduates planning to continue their education. 

(c) Compare your answers to parts (a) and (b) and explain 
why they are the same or different. 

(d) Could you use either of these confidence intervals to 
determine whether the proportion is actually 0.25? 
Explain your answer. Hint: Use the normal approximation 
to the binomial. 
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MIND-EXPANDING EXERCISES 


8-86. An electrical component has a time-to-failure 
(or lifetime) distribution that is exponential with param- 
eter \, so the mean lifetime is ju. = 1/A. Suppose that a 
sample of n of these components is put on test, and let 
X; be the observed lifetime of component 7. The test con- 
tinues only until the rth unit fails, where r < n. This re- 
sults in a censored life test. Let X, denote the time at 
which the first failure occurred, X, denote the time at 
which the second failure occurred, and so on. Then the 
total lifetime that has been accumulated at test termina- 
tion is 
r 
gu NC u 7) 
5 


We have previously shown in Exercise 7-72 that T,/r is 

an unbiased estimator for pu. 

(a) It can be shown that 2A 7, has a chi-square distribution 
with 2r degrees of freedom. Use this fact to develop a 
100(1 — o)96 confidence interval for mean lifetime 
w= 1/X. 

(b) Suppose 20 units were put on test, and the test 
terminated after 10 failures occurred. The failure 
times (in hours) are 15, 18, 19, 20, 21, 21, 22, 27, 
28, 29. Find a 95% confidence interval on mean 
lifetime. 


8-87. Consider a two-sided confidence interval for 
the mean p. when o is known; 


xc ZU QUIT oan sm PE z,,0/ Vn 


where a, + a) = a. If a, = a, = a/2, we have the usual 
100(1 — a)% confidence interval for u. In the above, 
when a, # o», the interval is not symmetric about p. 
The length of the interval is L = o(z,, + z,,)/ Vn. 
Prove that the length of the interval L is minimized when 
o, = œ = a/2. Hint: Remember that P(z,) = 1 — o, 
so ®'(1 — a) = z,, and the relationship between the 
derivative of a function y = f(x) and the inverse 
x —f' (y)is(d/dy)f (y) = V[(d/dx) f(x)). 

8-88. It is possible to construct a nonparametric tol- 
erance interval that is based on the extreme values in a 
random sample of size n from any continuous population. 
If p is the minimum proportion of the population con- 
tained between the smallest and largest sample observa- 
tions with confidence 1 — a, it can be shown that 


nc 


n" —(n-lp-a 


and n 1s approximately 








(a) In order to be 95% confident that at least 90% of the 
population will be included between the extreme 
values of the sample, what sample size will be re- 
quired? 

(b) A random sample of 10 transistors gave the follow- 
ing measurements on saturation current (in mil- 
liamps): 10.25, 10.41, 10.30, 10.26, 10.19, 10.37, 
10.29, 10.34, 10.23, 10.38. Find the limits that con- 
tain a proportion p of the saturation current meas- 
urements at 95% confidence. What is the proportion 
p contained by these limits? 


8-89. Suppose that Xj, X5, ... , X, 1s a random 
sample from a continuous probability distribution 
with median p.. 

(a) Show that 


Hint: The complement of the event [min(X;) < Ñ 

< max(X)] is [max(X) = fi] U [min(X) = fi], but 

max(X;) = Ñ if and only if X; < fi for all i. | 

(b) Write down a 100(1 — a)% confidence interval for 
the median (X, where 


8-90. Students in the industrial statistics lab at ASU 
calculate a lot of confidence intervals on u. Suppose all 
these CIs are independent of each other. Consider the 
next one thousand 95% confidence intervals that will be 
calculated. How many of these CIs do you expect to 
capture the true value of u? What is the probability that 
between 930 and 970 of these intervals contain the true 
value of u? 
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IMPORTANT TERMS AND CONCEPTS 


In the E-book, click on any 
term or concept below to 
go to that subject. 

Confidence coefficient 

Confidence interval 

Confidence interval for 
a population 
proportion 

Chi-squared distribution 


Confidence intervals for 
the mean of a 
normal distribution 
Confidence interval for 
the variance of a 
normal distribution 
Confidence level 
Error in estimation 
Large sample confidence 
intervals 


One-sided confidence 
bounds 

Precision of parameter 
estimation 

Prediction interval 

Tolerance interval 

Two-sided confidence 
interval 

t distribution 


CD MATERIAL 


Boostrap samples 

Percentile method for 
boostrap confidence 
intervals 


8-2.6 Bootstrap Confidence Intervals (CD Only) 


In Section 7-2.5 we showed how a technique called the bootstrap could be used to estimate 
the standard error o $, where @ is an estimate of a parameter 0. We can also use the bootstrap 
to find a confidence interval on the parameter 0. To illustrate, consider the case where 0 is the 
mean u of a normal distribution with o known. Now the estimator of 0 is X. Also notice that 
Zy/20/ Vn is the 100(1 — o/2) percentile of the distribution of X — u, and —Z9/20/ Vn is 


the 100(a/2) percentile of this distribution. Therefore, we can write the probability statement 
associated with the 100(1 — a)% confidence interval as 


P(100(a/2) percentile = X — p = 100(1 — o/2) percentile) = 1 — a 


Or 


P(X — 100(1 — a/2) percentile = j. - X — 100(o/2) percentile) = 1 — a 


This last probability statement implies that the lower and upper 100(1 — a)% confidence lim- 
its for u are 


L = X — 100(1 — a/2) percentile of X — u = X — z,pn0/ Vn 
U = X — 100(a/2) percentile of X — u = X + Za20/ Nn 


We may generalize this to an arbitrary parameter 0. The 100(1 — a@)% confidence limits 
for 0 are 


L=6- 100(1 — a/2) percentile of 6 — 0 
U = 6 — 100(a/2) percentile of 6 — 0 


Unfortunately, the percentiles of Ó — 0 may not be as easy to find as in the case of the normal 
distribution mean. However, they could be estimated from bootstrap samples. Suppose we 
find B ? bootstrap samples and calculate 6;, 65, ... , 65 and 0 and then calculate 6, — 9, 
6; - 0,..., 0; 5 — 0 . The required percentiles can re obtained directly from the differences. 
For M if B = 200 and a 95% confidence interval on 0 is desired, the fifth smallest and 
fifth largest of the differences 6, — @ are the estimates of the necessary percentiles. 

We will illustrate this procedure using the situation first described in Example 7-3, 
involving the parameter A of an exponential distribution. Following that example, a random 
sample of n = 8 engine controller modules were tested to failure, and the estimate of A 
obtained was A = 0.0462, where A=1 /X is a maximum likelihood estimator. We used 200 
bootstrap samples to obtain an estimate of the standard error for A. 

Figure S8-1(a) is a histogram of the 200 bootstrap estimates x 7 i= 1,2,...,200. Notice 
that the histogram is not symmetrical and is skewed to the right, indicating that the sam- 
pling distribution of À also has this same shape. We subtracted the sample average of these 
bootstrap estimates A * = 0.5013 from each A. The histogram of the differences Â; - N, 
= 1, 2,..., 200, is shown in Figure S8-1(b). Suppo we wish to. find a 90% COH donee ies 
val for A. Now the fifth percentile of the bootstrap samples Â; — à is —0.0228 and the ninety- 
fifth percentile is 0.03135. Therefore the lower and upper 90% bootstrap confidence limits are 


L = À — 95 percentile ofÁ; — A` = 0.0462 — 0.03135 = 0.0149 
U = À— 5 percentileofÁ; — A” = 0.0462 — (—0.0228) = 0.0690 
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Figure S8-1 Histograms of the bootstrap estimates of A and the differences Â; — Á : used in finding the bootstrap 
confidence interval. 


Therefore, our 90% bootstrap confidence interval for A is 0.0149 = X S 0.0690. There is an 
exact confidence interval for the parameter \ in an exponential distribution. For the engine 
controller failure data following Example 7-3, the exact 90% confidence interval* for A is 
0.0230 = X S 0.0759. Notice that the two confidence intervals are very similar. The length of 
the exact confidence interval is 0.0759 — 0.0230 = 0.0529, while the length of the bootstrap 
confidence interval is 0.0690 — 0.0149 = 0.0541, which is only slightly longer. The per- 
centile method for bootstrap confidence intervals works well when the estimator is unbiased 
and the standard error of 6 is approximately constant (as a function of 0). An improvement, 
known as the bias-corrected and accelerated method, adjusts the percentiles in more general 
cases. It could be applied in this example (because À is a biased estimator), but at the cost of 
additional complexity. 


8-3.2 Development of the t-Distribution (CD Only) 
We will give a formal development of the f¢-distribution using the techniques presented in 


Section 5-8. It will be helpful to review that material before reading this section. 
First consider the random variable 





This quantity can be written as 





c Vn (S8-1) 





*The confidence interval is X now X “Az Xi opa > x;) where XÀ2.2n and X1 535 are the lower and 
upper a/2 percentage points of the chi-square distribution (which was introduced briefly in Chapter 4 and discussed 
further in Section 8-4), and the x; are the 7 sample observations. 


Theorem S8-1: 
The 
t-Distribution 
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Now the numerator of Equation S8-1 is a standard normal random variable. The ratio $7/o? in 
the denominator is a chi-square random variable with n — 1 degrees of freedom, divided by 
the number of degrees of freedom*. Furthermore, the two random variables in Equation 
S8-1, X and S, are independent. We are now ready to state and prove the main result. 


Let Z be a standard normal random variable and V be a chi-square random variable with 
k degrees of freedom. If Z and V are independent, the distribution of the random variable 


Z 
T= 


V V/k 


is the ¢-distribution with k degrees of freedom. The probability density function is 


f = TE EG esr Ud 1 
= V/skT (k/2) (2/k) n ese , 


—00 « f «00 





Proof Since Z and V are independent, their probability distribution is 
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Define a new random variable U — V. Thus, the inverse solutions of 





Z 
t — 
Vv/k 
and 
u-—v 
are 
í k 
and 
v=u 
The Jacobian is 
u t m 
JE k 2uk| ^ Nk 
0 l 





*We use the fact that (n — 1)S 2/9? follows a chi-square distribution with n — 1 degrees of freedom in Section 8-4 
to find a confidence interval on the variance and standard deviation of a normal distribution. 
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Thus, 
JI - 
k 
and the joint probability distribution of T and U is 
fru(t, u) = — yk/2)— 15-7 ue + u]/2 
vasior( 2) 


Now, since V > 0, we must require that U > 0, and since —o» < Z < o, =œ% < T < œ. On 
rearranging this last equation, we have 


l 
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The probability distribution of T is found by 


fr(t) = | nct u) du 


- —— | FED Q2] Jy 
V 2mk2 PT (5) 0 
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This is the distribution given in Theorem S8-1. 

The probability distribution of the random variable T was first published by W. S. Gosset 
in a famous 1908 paper. Gosset was employed by the Guiness Brewers in Ireland. Since his 
employer discouraged publication of employee research, Gosset published these results under 
the pseudonym “Student.” As a result, this probability distribution is sometimes called the 
Student r-distribution. 
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LEARNING OBJECTIVES 


After careful study of this chapter, you should be able to do the following: 

. Structure engineering decision-making problems as hypothesis tests 

. Test hypotheses on the mean of a normal distribution using either a Z-test or a t-test procedure 
. Test hypotheses on the variance or standard deviation of a normal distribution 

. Test hypotheses on a population proportion 


. Use the P-value approach for making decisions in hypotheses tests 


Nm BB WO HO mz 


. Compute power, type II error probability, and make sample size selection decisions for tests on 
means, variances, and proportions 


7. Explain and use the relationship between confidence intervals and hypothesis tests 
8. Use the chi-square goodness of fit test to check distributional assumptions 


9. Use contingency table tests 


CD MATERIAL 
10. Appreciate the likelihood ratio approach to construction of test statistics 


11. Conduct small sample tests on a population proportion 


Answers for many odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e-Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e-Text immediately following the section they accompany. 


9-1 HYPOTHESIS TESTING 


9-1.1 Statistical Hypotheses 


Definition 


In the previous chapter we illustrated how to construct a confidence interval estimate of a pa- 
rameter from sample data. However, many problems in engineering require that we decide 
whether to accept or reject a statement about some parameter. The statement is called a 
hypothesis, and the decision-making procedure about the hypothesis is called hypothesis 
testing. This is one of the most useful aspects of statistical inference, since many types of 
decision-making problems, tests, or experiments in the engineering world can be formulated 
as hypothesis-testing problems. Furthermore, as we will see, there is a very close connection 
between hypothesis testing and confidence intervals. 

Statistical hypothesis testing and confidence interval estimation of parameters are the funda- 
mental methods used at the data analysis stage of a comparative experiment, in which the engi- 
neer is interested, for example, in comparing the mean of a population to a specified value. These 
simple comparative experiments are frequently encountered in practice and provide a good foun- 
dation for the more complex experimental design problems that we will discuss in Chapters 13 
and 14. In this chapter we discuss comparative experiments involving a single population, and our 
focus is on testing hypotheses concerning the parameters of the population. 

We now give a formal definition of a statistical hypothesis. 


A statistical hypothesis is a statement about the parameters of one or more populations. 
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Since we use probability distributions to represent populations, a statistical hypothesis 
may also be thought of as a statement about the probability distribution of a random variable. 
The hypothesis will usually involve one or more parameters of this distribution. 

For example, suppose that we are interested in the burning rate of a solid propellant used 
to power aircrew escape systems. Now burning rate is a random variable that can be described 
by a probability distribution. Suppose that our interest focuses on the mean burning rate (a 
parameter of this distribution). Specifically, we are interested in deciding whether or not the 
mean burning rate is 50 centimeters per second. We may express this formally as 


Ho: u = 50 centimeters per second 


Hi: u # 50 centimeters per second (9-1) 


The statement Ho: ju = 50 centimeters per second in Equation 9-1 is called the null 
hypothesis, and the statement H,: ju # 50 centimeters per second is called the alternative 
hypothesis. Since the alternative hypothesis specifies values of u that could be either greater 
or less than 50 centimeters per second, it is called a two-sided alternative hypothesis. In some 
situations, we may wish to formulate a one-sided alternative hypothesis, as in 


Ho: u = 50 centimeters per second Ho: uw = 50 centimeters per second 


or (9-2) 
Hi: u < 50 centimeters per second Hi: u > 50 centimeters per second 


It is important to remember that hypotheses are always statements about the population or 
distribution under study, not statements about the sample. The value of the population param- 
eter specified in the null hypothesis (50 centimeters per second in the above example) is usu- 
ally determined in one of three ways. First, it may result from past experience or knowledge 
of the process, or even from previous tests or experiments. The objective of hypothesis testing 
then is usually to determine whether the parameter value has changed. Second, this value may 
be determined from some theory or model regarding the process under study. Here the objec- 
tive of hypothesis testing is to verify the theory or model. A third situation arises when the 
value of the population parameter results from external considerations, such as design or en- 
gineering specifications, or from contractual obligations. In this situation, the usual objective 
of hypothesis testing is conformance testing. 

A procedure leading to a decision about a particular hypothesis is called a test of a 
hypothesis. Hypothesis-testing procedures rely on using the information in a random sample 
from the population of interest. If this information is consistent with the hypothesis, we will con- 
clude that the hypothesis is true; however, if this information is inconsistent with the hypothesis, 
we will conclude that the hypothesis is false. We emphasize that the truth or falsity of a particu- 
lar hypothesis can never be known with certainty, unless we can examine the entire population. 
This is usually impossible in most practical situations. Therefore, a hypothesis-testing procedure 
should be developed with the probability of reaching a wrong conclusion in mind. 

The structure of hypothesis-testing problems is identical in all the applications that we 
will consider. The null hypothesis is the hypothesis we wish to test. Rejection of the null 
hypothesis always leads to accepting the alternative hypothesis. In our treatment of hypothe- 
sis testing, the null hypothesis will always be stated so that it specifies an exact value of the 
parameter (as in the statement Hp: ju = 50 centimeters per second in Equation 9-1). The 
alternate hypothesis will allow the parameter to take on several values (as in the statement 
Hi: u # 50 centimeters per second in Equation 9-1). Testing the hypothesis involves taking 
a random sample, computing a test statistic from the sample data, and then using the test 
statistic to make a decision about the null hypothesis. 
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9.1.2 Tests of Statistical Hypotheses 


Definition 


Definition 


To illustrate the general concepts, consider the propellant burning rate problem introduced 
earlier. The null hypothesis is that the mean burning rate is 50 centimeters per second, and the 
alternate is that it is not equal to 50 centimeters per second. That is, we wish to test 


Ho: u = 50 centimeters per second 


Hi: u # 50 centimeters per second 


Suppose that a sample of n = 10 specimens is tested and that the sample mean burning 
rate x is observed. The sample mean is an estimate of the true population mean w. A value of 
the sample mean x that falls close to the hypothesized value of ju. = 50 centimeters per second 
is evidence that the true mean p is really 50 centimeters per second; that is, such evidence sup- 
ports the null hypothesis Hp. On the other hand, a sample mean that is considerably different 
from 50 centimeters per second is evidence in support of the alternative hypothesis 77,. Thus, 
the sample mean is the test statistic in this case. 

The sample mean can take on many different values. Suppose that if 48.5 = x =s 51.5, we 
will not reject the null hypothesis Hp: w = 50, and if either x < 48.5 or x > 51.5, we will 
reject the null hypothesis in favor of the alternative hypothesis H,: w # 50. This is illustrated 
in Fig. 9-1. The values of x that are less than 48.5 and greater than 51.5 constitute the critical 
region for the test, while all values that are in the interval 48.5 < x S 51.5 form a region for 
which we will fail to reject the null hypothesis. By convention, this is usually called the 
acceptance region. The boundaries between the critical regions and the acceptance region are 
called the critical values. In our example the critical values are 48.5 and 51.5. It is customary 
to state conclusions relative to the null hypothesis Ho. Therefore, we reject H, in favor of Hj 
if the test statistic falls in the critical region and fail to reject Hp otherwise. 

This decision procedure can lead to either of two wrong conclusions. For example, the 
true mean burning rate of the propellant could be equal to 50 centimeters per second. 
However, for the randomly selected propellant specimens that are tested, we could observe a 
value of the test statistic x that falls into the critical region. We would then reject the null 
hypothesis Ho in favor of the alternate H, when, in fact, H, is really true. This type of wrong 
conclusion is called a type I error. 


Rejecting the null hypothesis Ho when it is true is defined as a type I error. 





Now suppose that the true mean burning rate is different from 50 centimeters per second, yet 
the sample mean x falls in the acceptance region. In this case we would fail to reject Hy when 
it is false. This type of wrong conclusion is called a type II error. 


Failing to reject the null hypothesis when it is false is defined as a type II error. 





Thus, in testing any statistical hypothesis, four different situations determine whether the final 
decision is correct or in error. These situations are presented in Table 9-1. 
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Reject Ho Fail to Reject Hg Reject Ho Table 9-1 Decisions in Hypothesis Testing 
Pe ills Bele Pin Gs Decision Hy Is True Hy Is False 
48.5 50 51.5 x : f 
Fail to reject Ho no error type II error 


Figure 9-1 Decision criteria for testing Hp: y = 
50 centimeters per second versus H,: w # 50 centime- 
ters per second. 


Reject Ho type I error no error 


Because our decision is based on random variables, probabilities can be associated with 
the type I and type II errors in Table 9-1. The probability of making a type I error is denoted 
by the Greek letter a. That is, 


a = P(type I error) = P(reject Hy when Hj, is true) (9-3) 


Sometimes the type I error probability is called the significance level, or the a-error, or the 
size of the test. In the propellant burning rate example, a type I error will occur when either 
x > 51.5 or x < 48.5 when the true mean burning rate is ju = 50 centimeters per second. 
Suppose that the standard deviation of burning rate is o. = 2.5 centimeters per second and that 
the burning rate has a distribution for which the conditions of the central limit theorem apply, 
so the distribution of the sample mean is approximately normal with mean u = 50 and stan- 
dard deviation o/Vn = 2.5/V/10 = 0.79. The probability of making a type I error (or the 
significance level of our test) is equal to the sum of the areas that have been shaded in the tails 
of the normal distribution in Fig. 9-2. We may find this probability as 


a = P(X < 48.5 when u = 50) + P(X > 51.5 when p = 50) 
The z-values that correspond to the critical values 48.5 and 51.5 are 


| 48.5 — 50 


51.5 — 50 
71 0.79 


= —1.90 and Z = 0.79 








= 1.90 


Therefore 
a = P(Z < —1.90) + P(Z > 1.90) = 0.028717 + 0.028717 = 0.057434 
This implies that 5.76% of all random samples would lead to rejection of the hypothesis 


Ho: u = 50 centimeters per second when the true mean burning rate is really 50 centimeters 
per second. 


a/2 = 0.0287 a/2 = 0.0287 


48.5 u=50 51.5 X 


Figure 9-2 The critical region for Hp: jp. = 50 
versus H,: w # 50 andn = 10. 
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From inspection of Fig. 9-2, notice that we can reduce a by widening the acceptance 
region. For example, if we make the critical values 48 and 52, the value of a is 


= P z<) + p(z> St) = pg < -253) + PZ > 253) 
m 0.79 079 ]) l 


= 0.0057 + 0.0057 = 0.0114 


We could also reduce a by increasing the sample size. If n = 16, 0/Vn = 2.5/V16 = 
0.625, and using the original critical region from Fig. 9-1, we find 


| 48.5 — 50 


2051.5 — 50 
“1 ~ 0.625 


= —2.40 and Z) = 0.625 — 2.40 


Therefore 
a = P(Z < —2.40) + P(Z > 2.40) = 0.0082 + 0.0082 = 0.0164 


In evaluating a hypothesis-testing procedure, it is also important to examine the proba- 
bility of a type II error, which we will denote by B. That is, 


D = P(type IL error) = P(fail to reject Hy when Hj is false) (9-4) 


To calculate B (sometimes called the B-error), we must have a specific alternative hypothe- 
sis; that is, we must have a particular value of u. For example, suppose that it is important to 
reject the null hypothesis Hp: ju. = 50 whenever the mean burning rate w is greater than 52 
centimeters per second or less than 48 centimeters per second. We could calculate the proba- 
bility of a type II error f for the values ju = 52 and u = 48 and use this result to tell us some- 
thing about how the test procedure would perform. Specifically, how will the test procedure 
work if we wish to detect, that is, reject Hp, for a mean value of ju = 52 or p. = 48? Because 
of symmetry, it is necessary only to evaluate one of the two cases—say, find the probability of 
accepting the null hypothesis Ho: ju. = 50 centimeters per second when the true mean is p. = 
52 centimeters per second. 

Figure 9-3 will help us calculate the probability of type II error B. The normal distribution 
on the left in Fig. 9-3 is the distribution of the test statistic XY when the null hypothesis 
Ho: u = 50 is true (this is what is meant by the expression “under Hp: ju = 50"), and the nor- 
mal distribution on the right is the distribution of X when the alternative hypothesis is true and 
the value of the mean is 52 (or “under H,: u = 52"). Now a type II error will be committed if 
the sample mean X falls between 48.5 and 51.5 (the critical region boundaries) when u = 52. 
As seen in Fig. 9-3, this is just the probability that 48.5 - X < 51.5 when the true mean is 
| = 52, or the shaded area under the normal distribution on the right. Therefore, referring to 
Fig. 9-3, we find that 


B = P(48.5 < X < 51.5 when p = 52) 
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Figure 9-3 The probability of type II error 
when u = 52 andn = 10. 
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Figure 9-4 The probability of type II error 
when u = 50.5 and n = 10. 


The z-values corresponding to 48.5 and 51.5 when w = 52 are 


2515-52 © ee 
0.79 °° 
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Therefore 


B = P(—4.43 = Z = —0.63) = P(Z = —0.63) — P(Z = —443) 
= 0.2643 — 0.0000 = 0.2643 


Thus, if we are testing Hp: w = 50 against H,: u # 50 with n = 10, and the true value of the 
mean is = 52, the probability that we will fail to reject the false null hypothesis is 0.2643. By 
symmetry, if the true value of the mean is ju = 48, the value of B will also be 0.2643. 

The probability of making a type II error B increases rapidly as the true value of u 
approaches the hypothesized value. For example, see Fig. 9-4, where the true value of the 
mean is ju = 50.5 and the hypothesized value is Hp: ju = 50. The true value of w is very close 
to 50, and the value for Q is 


B = P(48.5 < X < 51.5 when u = 50.5) 
As shown in Fig. 9-4, the z-values corresponding to 48.5 and 51.5 when p = 50.5 are 


31:5. 505 
= 1.27 


| 48.5 — 50.5 
^ 07 0.79 


Zi = 0.79 and 


= —2.53 


Ly 


Therefore 


pR = = ZH=127) = Z-€ 127) — PZ = 253) 
= 0.8980 — 0.0057 = 0.8923 


Thus, the type II error probability is much higher for the case where the true mean is 50.5 
centimeters per second than for the case where the mean is 52 centimeters per second. Of course, 
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Figure 9-5 The 


probability of type II 
error when u = 52 
andn = 16. 
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in many practical situations we would not be as concerned with making a type II error if the mean 
were "close" to the hypothesized value. We would be much more interested in detecting large 
differences between the true mean and the value specified in the null hypothesis. 

The type II error probability also depends on the sample size n. Suppose that the null 
hypothesis is Hp: ju. = 50 centimeters per second and that the true value of the mean is 
|. = 52. If the sample size is increased from n = 10 ton = 16, the situation of Fig. 9-5 results. 
The normal distribution on the left is the distribution of Y when the mean |. = 50, and the 
normal distribution on the right is the distribution of X when u = 52. As shown in Fig. 9-5, 
the type II error probability is 


B = P(48.5 < X < 51.5 when u = 52) 


When n = 16, the standard deviation of X is o/ Vn = 2.5/V/16 = 0.625, and the z-values 
corresponding to 48.5 and 51.5 when u = 52 are 


2485-52 a no 35. og 
4- "083g  — C 9M o 0.625 | 


Therefore 


B = P(—5.60 = Z = —0.80) = P(Z = —0.80) — P(Z = —5.60) 
= 0.2119 — 0.0000 = 0.2119 


Recall that when n = 10 and u = 52, we found that B = 0.2643; therefore, increasing the 
sample size results in a decrease in the probability of type II error. 

The results from this section and a few other similar calculations are summarized in the 
following table: 











Acceptance Sample 
Region Size a Bat pw = 52 Bat u = 50.5 
48.5 « x « 51.5 10 0.0576 0.2643 0.8923 
48 <x< 52 10 0.0114 0.5000 0.9705 
48.5 « x « 51.5 16 0.0164 0.2119 0.9445 














48 <x<52 16 0.0014 0.5000 0.9918 
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The results in boxes were not calculated in the text but can easily be verified by the 
reader. This display and the discussion above reveal four important points: 


1. The size of the critical region, and consequently the probability of a type I error a, 
can always be reduced by appropriate selection of the critical values. 


2. Type land type II errors are related. A decrease in the probability of one type of error 
always results in an increase in the probability of the other, provided that the sample 
size n does not change. 


3. An increase in sample size will generally reduce both a and B, provided that the 
critical values are held constant. 


4. When the null hypothesis is false, Q increases as the true value of the parameter 
approaches the value hypothesized in the null hypothesis. The value of (9 decreases 
as the difference between the true mean and the hypothesized value increases. 


Generally, the analyst controls the type I error probability a when he or she selects the 
critical values. Thus, it is usually easy for the analyst to set the type I error probability at 
(or near) any desired value. Since the analyst can directly control the probability of 
wrongly rejecting Hp, we always think of rejection of the null hypothesis A, as a strong 
conclusion. 

On the other hand, the probability of type II error B is not a constant, but depends on 
the true value of the parameter. It also depends on the sample size that we have selected. 
Because the type II error probability B is a function of both the sample size and the extent to 
which the null hypothesis Hp is false, it is customary to think of the decision to accept Hp as a 
weak conclusion, unless we know that Q is acceptably small. Therefore, rather than saying we 
"accept 7,9", we prefer the terminology “fail to reject Hp”. Failing to reject Hy implies that we 
have not found sufficient evidence to reject Ho, that is, to make a strong statement. Failing to 
reject Ho does not necessarily mean that there is a high probability that Họ is true. It may 
simply mean that more data are required to reach a strong conclusion. This can have impor- 
tant implications for the formulation of hypotheses. 

An important concept that we will make use of is the power of a statistical test. 


The power of a statistical test is the probability of rejecting the null hypothesis Ho 


when the alternative hypothesis 1s true. 





The power is computed as 1 — B, and power can be interpreted as the probability of 
correctly rejecting a false null hypothesis. We often compare statistical tests by comparing 
their power properties. For example, consider the propellant burning rate problem when we 
are testing Hp: w = 50 centimeters per second against H,: w # 50 centimeters per second. 
Suppose that the true value of the mean is jp. = 52. When n = 10, we found that B = 0.2643, 
so the power of this testis 1 — B = 1 — 0.2643 = 0.7357 when p = 52. 

Power is a very descriptive and concise measure of the sensitivity of a statistical test, 
where by sensitivity we mean the ability of the test to detect differences. In this case, the 
sensitivity of the test for detecting the difference between a mean burning rate of 50 centime- 
ters per second and 52 centimeters per second is 0.7357. That is, if the true mean is really 
52 centimeters per second, this test will correctly reject Hp: ju = 50 and “detect” this differ- 
ence 73.57% of the time. If this value of power is judged to be too low, the analyst can increase 
either a or the sample size n. 
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9.1.35 One-Sided and Two-Sided Hypotheses 


EXAMPLE 9-1 


A test of any hypothesis such as 


Ho: = Mo 
Hi: u £ uo 


is called a two-sided test, because it is important to detect differences from the hypothesized value 
of the mean po that lie on either side of io. In such a test, the critical region is split into two parts, 
with (usually) equal probability placed in each tail of the distribution of the test statistic. 

Many hypothesis-testing problems naturally involve a one-sided alternative hypothesis, 
such as 


Ho: u = Mo x Ho: u = po 
Hi: p > po Hi: u < po 


If the alternative hypothesis is H,: w > wo, the critical region should lie in the upper tail 
of the distribution of the test statistic, whereas if the alternative hypothesis is 7: u < po, 
the critical region should lie in the lower tail of the distribution. Consequently, these tests 
are sometimes called one-tailed tests. The location of the critical region for one-sided tests 
is usually easy to determine. Simply visualize the behavior of the test statistic if the null 
hypothesis is true and place the critical region in the appropriate end or tail of the distri- 
bution. Generally, the inequality in the alternative hypothesis "points" in the direction of 
the critical region. 

In constructing hypotheses, we will always state the null hypothesis as an equality so that 
the probability of type I error a can be controlled at a specific value. The alternative hypoth- 
esis might be either one-sided or two-sided, depending on the conclusion to be drawn if Hj is 
rejected. If the objective is to make a claim involving statements such as greater than, less 
than, superior to, exceeds, at least, and so forth, a one-sided alternative is appropriate. If no 
direction is implied by the claim, or if the claim not equal to is to be made, a two-sided alter- 
native should be used. 


Consider the propellant burning rate problem. Suppose that if the burning rate is less than 
50 centimeters per second, we wish to show this with a strong conclusion. The hypotheses 
should be stated as 


Ho: u = 50 centimeters per second 


Hi: y. < 50 centimeters per second 


Here the critical region lies in the lower tail of the distribution of X. Since the rejection of Hp 
is always a strong conclusion, this statement of the hypotheses will produce the desired out- 
come if Ho is rejected. Notice that, although the null hypothesis is stated with an equal sign, it 
is understood to include any value of u not specified by the alternative hypothesis. Therefore, 
failing to reject Ho does not mean that = 50 centimeters per second exactly, but only that we 
do not have strong evidence in support of H4. 


In some real-world problems where one-sided test procedures are indicated, it is 
occasionally difficult to choose an appropriate formulation of the alternative hypothesis. For 
example, suppose that a soft-drink beverage bottler purchases 10-ounce bottles from a glass 
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company. The bottler wants to be sure that the bottles meet the specification on mean internal 
pressure or bursting strength, which for 10-ounce bottles is a minimum strength of 200 psi. 
The bottler has decided to formulate the decision procedure for a specific lot of bottles as a 
hypothesis testing problem. There are two possible formulations for this problem, either 


Ho: w = 200 psi 

Hi: w > 200 psi (9-5) 
Or 

Ho: w = 200 psi 

Hi: w < 200 psi (9-6) 


Consider the formulation in Equation 9-5. If the null hypothesis is rejected, the bottles will be 
judged satisfactory; if Hp is not rejected, the implication is that the bottles do not conform to 
specifications and should not be used. Because rejecting Hp is a strong conclusion, this for- 
mulation forces the bottle manufacturer to "demonstrate" that the mean bursting strength of 
the bottles exceeds the specification. Now consider the formulation in Equation 9-6. In this 
situation, the bottles will be judged satisfactory unless Hj, is rejected. That is, we conclude that 
the bottles are satisfactory unless there is strong evidence to the contrary. 

Which formulation is correct, the one of Equation 9-5 or Equation 9-6? The answer is it 
depends. For Equation 9-5, there is some probability that 77 will not be rejected (1.e., we 
would decide that the bottles are not satisfactory), even though the true mean is slightly 
greater than 200 psi. This formulation implies that we want the bottle manufacturer to demon- 
strate that the product meets or exceeds our specifications. Such a formulation could be 
appropriate if the manufacturer has experienced difficulty in meeting specifications in the past 
or if product safety considerations force us to hold tightly to the 200 psi specification. On the 
other hand, for the formulation of Equation 9-6 there is some probability that Hp will be ac- 
cepted and the bottles judged satisfactory, even though the true mean is slightly less than 
200 psi. We would conclude that the bottles are unsatisfactory only when there is strong evi- 
dence that the mean does not exceed 200 psi, that is, when Hp: ju = 200 psi is rejected. This 
formulation assumes that we are relatively happy with the bottle manufacturer's past per- 
formance and that small deviations from the specification of ju = 200 psi are not harmful. 

In formulating one-sided alternative hypotheses, we should remember that rejecting Ho is 
always a strong conclusion. Consequently, we should put the statement about which it is 1m- 
portant to make a strong conclusion in the alternative hypothesis. In real-world problems, this 
will often depend on our point of view and experience with the situation. 


9-1.4 General Procedure for Hypothesis Tests 


This chapter develops hypothesis-testing procedures for many practical problems. Use of the 
following sequence of steps in applying hypothesis-testing methodology is recommended. 
From the problem context, identify the parameter of interest. 

State the null hypothesis, Hp. 

Specify an appropriate alternative hypothesis, 77. 


pe e peg 


Choose a significance level a. 
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5. Determine an appropriate test statistic. 


6. State the rejection region for the statistic. 


7. Compute any necessary sample quantities, substitute these into the equation for the 
test statistic, and compute that value. 


8. Decide whether or not Hy should be rejected and report that in the problem context. 


Steps 1—4 should be completed prior to examination of the sample data. This sequence of 
steps will be illustrated in subsequent sections. 


EXERCISES FOR SECTION 9-1 


9-1. In each of the following situations, state whether it is a 

correctly stated hypothesis testing problem and why. 

(a) Hy: w = 25, H: p # 25 

(b) Hy: o > 10, Hi: o = 10 

(c) Hy: x = 50, Hi: x # 50 

(d) Ay p 01 Ap = 05 

(e) Aly: s = 30, Hy: s > 30 

9-2. A textile fiber manufacturer is investigating a new 

drapery yarn, which the company claims has a mean thread 

elongation of 12 kilograms with a standard deviation of 0.5 

kilograms. The company wishes to test the hypothesis 

Ho: w = 12 against Hj: w < 12, using a random sample of 

four specimens. 

(a) What is the type I error probability if the critical region is 
defined as x < 11.5 kilograms? 

(b) Find B for the case where the true mean elongation is 
11.25 kilograms. 


9-5. Repeat Exercise 9-2 using a sample size of n = 16 and 
the same critical region. 


9-4. In Exercise 9-2, find the boundary of the critical region 
if the type I error probability is specified to be a = 0.01. 


9-5, [n Exercise 9-2, find the boundary of the critical region 
if the type I error probability is specified to be 0.05. 


9-6. The heat evolved in calories per gram of a cement mix- 

ture is approximately normally distributed. The mean is 

thought to be 100 and the standard deviation is 2. We wish to 

test Hp: w = 100 versus Hj: j, # 100 with a sample of 

n = 9 specimens. 

(a) If the acceptance region is defined as 98.5 = x S 101.5, 
find the type I error probability a. 

(b) Find B for the case where the true mean heat evolved is 103. 

(c) Find B for the case where the true mean heat evolved is 
105. This value of B is smaller than the one found in part 
(b) above. Why? 


9-7. Repeat Exercise 9-6 using a sample size of n = 5 and 
the same acceptance region. 


9-8. A consumer products company is formulating a new 
shampoo and is interested in foam height (in millimeters). 
Foam height is approximately normally distributed and has a 
standard deviation of 20 millimeters. The company wishes to 


test Hp: ju. = 175 millimeters versus H,: w > 175 millime- 

ters, using the results of n = 10 samples. 

(a) Find the type I error probability a if the critical region is 
x > 185. 

(b) What is the probability of type II error if the true mean 
foam height is 195 millimeters? 


9-9. In Exercise 9-8, suppose that the sample data result in 

x = 190 millimeters. 

(a) What conclusion would you reach? 

(b) How “unusual” is the sample value x = 190 millimeters 
if the true mean is really 175 millimeters? That is, what is 
the probability that you would observe a sample average 
as large as 190 millimeters (or larger), if the true mean 
foam height was really 175 millimeters? 


9-10. Repeat Exercise 9-8 assuming that the sample size is 
n = 16 and the boundary of the critical region is the same. 


9-11. Consider Exercise 9-8, and suppose that the sample 

size 1s increased ton = 16. 

(a) Where would the boundary of the critical region be placed 
if the type I error probability were to remain equal to the 
value that it took on when n = 10? 

(b) Using n = 16 and the new critical region found in part (a), 
find the type II error probability B if the true mean foam 
height is 195 millimeters. 

(c) Compare the value of (9 obtained in part (b) with the value 
from Exercise 9-8 (b). What conclusions can you draw? 


9-12. A manufacturer is interested in the output voltage of a 

power supply used in a PC. Output voltage is assumed to be 

normally distributed, with standard deviation 0.25 Volts, and 

the manufacturer wishes to test Hp: w = 5 Volts against 

Hi: u # 5 Volts, using n = 8 units. 

(a) The acceptance region is 4.85 = x = 5.15. Find the value 
of a. 

(b) Find the power of the test for detecting a true mean output 
voltage of 5.1 Volts. 


9-13. Rework Exercise 9-12 when the sample size is 16 and 
the boundaries of the acceptance region do not change. 

9-14. Consider Exercise 9-12, and suppose that the manu- 
facturer wants the type I error probability for the test to be 
a = 0.05. Where should the acceptance region be located? 
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9-15. If we plot the probability of accepting Hp: u = po 
versus various values of u and connect the points with a 
smooth curve, we obtain the operating characteristic curve 
(or the OC curve) of the test procedure. These curves are used 
extensively in industrial applications of hypothesis testing to 
display the sensitivity and relative performance of the test. 
When the true mean is really equal to to, the probability of ac- 
cepting y is 1 — a. Construct an OC curve for Exercise 9-8, 
using values of the true mean p of 178, 181, 184, 187, 190, 
193, 196, and 199. 


9-16. Convert the OC curve in Exercise 9-15 into a plot of 
the power function of the test. 


9-17. A random sample of 500 registered voters in Phoenix 
is asked if they favor the use of oxygenated fuels year-round 
to reduce air pollution. If more than 400 voters respond posi- 
tively, we will conclude that at least 60% of the voters favor 
the use of these fuels. 
(a) Find the probability of type I error if exactly 6096 of the 
voters favor the use of these fuels. 
(b) What is the type II error probability B if 75% of the voters 
favor this action? 
Hint: use the normal approximation to the binomial. 
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9-18. The proportion of residents in Phoenix favoring the 

building of toll roads to complete the freeway system is be- 

lieved to be p — 0.3. If a random sample of 10 residents 

shows that 1 or fewer favor this proposal, we will conclude 

that p « 0.3. 

(a) Find the probability of type I error if the true proportion is 
p = 0.3. 

(b) Find the probability of committing a type II error with this 
procedure if p = 0.2. 

(c) What is the power of this procedure if the true proportion 
is p = 0.2? 

9-19. The proportion of adults living in Tempe, Arizona, 

who are college graduates is estimated to be p = 0.4. To test 

this hypothesis, a random sample of 15 Tempe adults is 

selected. If the number of college graduates is between 4 and 

8, the hypothesis will be accepted; otherwise, we will 

conclude that p # 0.4. 

(a) Find the type I error probability for this procedure, assum- 
ing that p = 0.4. 

(b) Find the probability of committing a type II error if the 
true proportion is really p = 0.2. 


9.2 TESTS ON THE MEAN OF A NORMAL DISTRIBUTION, 


VARIANCE KNOWN 


In this section, we consider hypothesis testing about the mean p of a single, normal population 
where the variance of the population o? is known. We will assume that a random sample X, 
X>,..., X, has been taken from the population. Based on our previous discussion, the sample 
mean X is an unbiased point estimator of u with variance o7/n. 


9-2.1 Hypothesis Tests on the Mean 


Suppose that we wish to test the hypotheses 


Ho: p = Mo 


(9-7) 
Hi: u £ po 


where uy is a specified constant. We have a random sample X;, X5, ..., X, from a normal pop- 
ulation. Since X has a normal distribution (i.e., the sampling distribution of X is normal) 
with mean pọ and standard deviation o'/ Vn if the null hypothesis is true, we could construct 
a critical region based on the computed value of the sample mean X, as in Section 9-1.2. 

It is usually more convenient to standardize the sample mean and use a test statistic based on 
the standard normal distribution. That is, the test procedure for Hp: ju = puo uses the test statistic 
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Figure 9-6 The distribution of Zọ when Hp: w = po is true, with critical region for (a) the two-sided alternative H,: y # po, 
(b) the one-sided alternative H,: y > wo, and (c) the one-sided alternative H,: y < uo. 


EXAMPLE 9-2 


If the null hypothesis Hp: y = pois true, E(X) = po, and it follows that the distribution of Zo 
is the standard normal distribution [denoted N(0, 1)]. Consequently, if Ho: j. = po is true, the 
probability is 1 — a that the test statistic Z; falls between —zą and z,»?, where Ząy2 is the 
1000/2 percentage point of the standard normal distribution. The regions associated with 
Zaj and —z, are illustrated in Fig. 9-6(a). Note that the probability is a that the test statistic Zp 
will fall in the region Zo > Za or Zo < —z,y; when Ho: p = Mo is true. Clearly, a sample 
producing a value of the test statistic that falls in the tails of the distribution of Z, would be 
unusual if Ho: |j. = pọ is true; therefore, it is an indication that Hj, is false. Thus, we should 
reject Ho if the observed value of the test statistic zy is either 


Zo = Za/2 Or Zo < ~ 2/2 (9-9) 
and we should fail to reject Ho if 
—Zgp = Z9 = Zap (9-10) 


The inequalities in Equation 9-10 defines the acceptance region for Ho, and the two inequali- 
ties in Equation 9-9 define the critical region or rejection region. The type I error probability 
for this test procedure is a. 

It is easier to understand the critical region and the test procedure, in general, when the 
test statistic is Z, rather than X. However, the same critical region can always be written in 
terms of the computed value of the sample mean x. A procedure identical to the above is as 
follows: 


Reject Hp: y = po if either x > aorx < b 
where 


a = po s Zy20/Vn and b= Wo — z,/50/ Vn 


Aircrew escape systems are powered by a solid propellant. The burning rate of this pro- 
pellant is an important product characteristic. Specifications require that the mean burning 
rate must be 50 centimeters per second. We know that the standard deviation of burning 
rate is o. = 2 centimeters per second. The experimenter decides to specify a type I error 
probability or significance level of a = 0.05 and selects a random sample of n = 25 and 
obtains a sample average burning rate of x — 51.3 centimeters per second. What conclu- 
sions should be drawn? 
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We may solve this problem by following the eight-step procedure outlined in Section 9-1.4. 
This results in 


1. The parameter of interest is jz, the mean burning rate. 





2. Hy u = 50 centimeters per second 
3. H,: w # 50 centimeters per second 
4. a= 0.05 
5. The test statistic 1s 
nc X — Mo 
a/Vn 


6. Reject Ho if zọ > 1.96 or if zọ < — 1.96. Note that this results from step 4, where we 
specified a = 0.05, and so the boundaries of the critical region are at Zp 94 = 1.96 
and ^ 20.025 — — 1.96. 


7. Computations: Since x = 51.3 ando = 2, 


8. Conclusion: Since zọ = 3.25 > 1.96, we reject Hy: w = 50 at the 0.05 level of 
significance. Stated more completely, we conclude that the mean burning rate dif- 
fers from 50 centimeters per second, based on a sample of 25 measurements. In 
fact, there is strong evidence that the mean burning rate exceeds 50 centimeters 
per second. 


We may also develop procedures for testing hypotheses on the mean w where the alter- 
native hypothesis is one-sided. Suppose that we specify the hypotheses as 


Ho: M — Mo (9-11) 
Hi: p > po 


In defining the critical region for this test, we observe that a negative value of the test statistic 
Zo would never lead us to conclude that Hp: w = p is false. Therefore, we would place the 
critical region in the upper tail of the standard normal distribution and reject Ho if the com- 
puted value of zọ is too large. That is, we would reject Hj, if 


26 Es (9-12) 
as shown in Figure 9-6(b). Similarly, to test 


Ho: = Mo 
Hi: u < Mo (9-13) 


we would calculate the test statistic Z) and reject Ho if the value of Zp is too small. That is, the 
critical region is in the lower tail of the standard normal distribution as shown in Figure 
9-6(c), and we reject Ho if 


Zo < Ly (9-14) 
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9.2.2 P-Values in Hypothesis Tests 


Definition 


One way to report the results of a hypothesis test 1s to state that the null hypothesis was or was 
not rejected at a specified a-value or level of significance. For example, in the propellant 
problem above, we can say that Hp: ju. = 50 was rejected at the 0.05 level of significance. This 
statement of conclusions is often inadequate because it gives the decision maker no idea about 
whether the computed value of the test statistic was just barely in the rejection region or 
whether it was very far into this region. Furthermore, stating the results this way imposes the 
predefined level of significance on other users of the information. This approach may be un- 
satisfactory because some decision makers might be uncomfortable with the risks implied by 
a = 0.05. 

To avoid these difficulties the P-value approach has been adopted widely in practice. 
The P-value is the probability that the test statistic will take on a value that is at least as 
extreme as the observed value of the statistic when the null hypothesis AH is true. Thus, a 
P-value conveys much information about the weight of evidence against Ho, and so a deci- 
sion maker can draw a conclusion at any specified level of significance. We now give a 
formal definition of a P-value. 


The P-value is the smallest level of significance that would lead to rejection of the 


null hypothesis Ho with the given data. 





It is customary to call the test statistic (and the data) significant when the null hypoth- 
esis Ho is rejected; therefore, we may think of the P-value as the smallest level a at which 
the data are significant. Once the P-value is known, the decision maker can determine how 
significant the data are without the data analyst formally imposing a preselected level of 
significance. 

For the foregoing normal distribution tests it 1s relatively easy to compute the P-value. If 
Zo is the computed value of the test statistic, the P-value is 


2|1 — ®(|zo|)| for a two-tailed test: Ho: ju = po Hy: u # uo 
P = 41— (z) for a upper-tailed test: Ho: w = po Hi: u > mo (9-15) 
P(zo) for a lower-tailed test: Hp: w = po Ay u < po 


Here, (z) is the standard normal cumulative distribution function defined in Chapter 4. 
Recall that ®(z) = P(Z < z), where Z is N(0, 1). To illustrate this, consider the propellant 
problem in Example 9-2. The computed value of the test statistic is z, = 3.25 and since the 
alternative hypothesis is two-tailed, the P-value is 


P-value = 2|1 — ®(3.25)] = 0.0012 


Thus, Ho: ju. = 50 would be rejected at any level of significance a = P-value = 0.0012. For 
example, Hp) would be rejected if a = 0.01, but it would not be rejected if a = 0.001. 

It is not always easy to compute the exact P-value for a test. However, most modern 
computer programs for statistical analysis report P-values, and they can be obtained on some 
hand-held calculators. We will also show how to approximate the P-value. Finally, if the 
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P-value approach is used, step 6 of the hypothesis-testing procedure can be modified. 
Specifically, it is not necessary to state explicitly the critical region. 


9-2.3 Connection between Hypothesis Tests and Confidence Intervals 


There is a close relationship between the test of a hypothesis about any parameter, say 0, and 
the confidence interval for 0. If [/, u] is a 100(1 — a)% confidence interval for the parameter 
0, the test of size a of the hypothesis 


Hg: 0 = Oo 
H,:0 == Oo 


will lead to rejection of Hj if and only if 05 is not in the 100(1 — a)% CI [/, u]. As an illus- 
tration, consider the escape system propellant problem discussed above. The null hypothesis 
Ho: u = 50 was rejected, using a = 0.05. The 95% two-sided CI on p can be calculated using 
Equation 8-7. This CI is 50.52 < u < 52.08. Because the value p = 50 is not included in this 
interval, the null hypothesis Hp: ju = 50 is rejected. 

Although hypothesis tests and CIs are equivalent procedures insofar as decision mak- 
ing or inference about u is concerned, each provides somewhat different insights. For 
instance, the confidence interval provides a range of likely values for u at a stated confi- 
dence level, whereas hypothesis testing is an easy framework for displaying the risk levels 
such as the P-value associated with a specific decision. We will continue to illustrate the 
connection between the two procedures throughout the text. 


9.2.4 Type II Error and Choice of Sample Size 


In testing hypotheses, the analyst directly selects the type I error probability. However, the 
probability of type II error B depends on the choice of sample size. In this section, we will 
show how to calculate the probability of type II error B. We will also show how to select the 
sample size to obtain a specified value of p. 


Finding the Probability of Type II Error B 
Consider the two-sided hypothesis 


Ho: u = Mo 
Hi: u T po 


Suppose that the null hypothesis is false and that the true value of the mean is u = puo + 9, 
say, where ò > 0. The test statistic Zp is 


| X — po X (Ho +8) , BVn 


40 = o/Vn |  o/Wn o 








Therefore, the distribution of Zọ when Hi, is true is 


Z) ~ JO 1) (9-16) 





O ? 
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Figure 9-7 The 
distribution of Zo 
under Hp and Hj. 


Under Hy: u - ug Under Hj: uz ug 





The distribution of the test statistic Z, under both the null hypothesis Hp and the alternate 
hypothesis H, is shown in Fig. 9-7. From examining this figure, we note that if H; is true, a 
type II error will be made only if —z,/ — Zo S Za where Zo ~ N(8 V/n/o, 1). That is, the 
probability of the type II error 6 is the probability that Zo falls between —z,/ and z,/; given 
that H, is true. This probability is shown as the shaded portion of Fig. 9-7. Expressed mathe- 
matically, this probability is 





where ®(z) denotes the probability to the left of z in the standard normal distribution. Note 
that Equation 9-17 was obtained by evaluating the probability that Z, falls in the interval 
| —Za/2» Za/2| when Hj is true. Furthermore, note that Equation 9-17 also holds if à < 0, due 
to the symmetry of the normal distribution. It is also possible to derive an equation similar to 
Equation 9-17 for a one-sided alternative hypothesis. 


Sample Size Formulas 
One may easily obtain formulas that determine the appropriate sample size to obtain a partic- 
ular value of B for a given 6 and a. For the two-sided alternative hypothesis, we know from 


Equation 9-17 that 
6 Vn 6 Vn 
p - v (s, a 2d c a(-y S 20 


or if 6 > 0, 


svi) - 


pos v (s, ~ o 


since P(—Zy. — 6 V n/a) = 0 when ô is positive. Let Zg be the 100 upper percentile of the 
standard normal distribution. Then, 8 = ®(—zg). From Equation 9-18 


6 Vn 


—Zg ~ Zo I — 


Or 


EXAMPLE 9-3 
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This approximation is good when ®(—z,/. — 8 V/n/o) is small compared to B. For either of 
the one-sided alternative hypotheses the sample size required to produce a specified type II 
error with probability B given 6 and a is 





Consider the rocket propellant problem of Example 9-2. Suppose that the analyst wishes to 
design the test so that if the true mean burning rate differs from 50 centimeters per second by 
as much as 1 centimeter per second, the test will detect this (1.e., reject Hp: ju = 50) with a high 
probability, say 0.90. Now, we note that o. = 2,6 = 51 — 50 = 1, a = 0.05, and B = 0.10. 
Since Zy/2 = Zo.025 = 1.96 and zg = zo,o9 = 1.28, the sample size required to detect this 
departure from Hp: w = 50 is found by Equation 9-19 as 


(Za + zg) o^ (1.96 + 1282? 
n => —————M——— m a c 


8? (1) 


The approximation is good here, since ®(—z,;. — 8 Vn/o) = ®(—1.96 — (1)V42/2) = 
®(—5.20) = 0, which is small relative to B. 


Using Operating Characteristic Curves 

When performing sample size or type II error calculations, it is sometimes more convenient to 
use the operating characteristic curves in Appendix Charts VIa and VIb. These curves plot 
B as calculated from Equation 9-17 against a parameter d for various sample sizes n. Curves 
are provided for both a = 0.05 and a = 0.01. The parameter d is defined as 


ip — pol [9l 
o EN 


d = (9-21) 
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EXAMPLE 9-4 


EXAMPLE 9-5 


so one set of operating characteristic curves can be used for all problems regardless of the 
values of pọ and o. From examining the operating characteristic curves or Equation 9-17 and 
Fig. 9-7, we note that 


1. The further the true value of the mean p is from po, the smaller the probability of 
type II error B for a given n and a. That is, we see that for a specified sample size and 
a, large differences in the mean are easier to detect than small ones. 


2. Fora given 6 and o, the probability of type II error B decreases as n increases. That 
is, to detect a specified difference ò in the mean, we may make the test more power- 
ful by increasing the sample size. 


Consider the propellant problem in Example 9-2. Suppose that the analyst is concerned about 
the probability of type II error if the true mean burning rate is ju = 51 centimeters per second. 
We may use the operating characteristic curves to find B. Note that ò = 51 — 50 = 1, n = 25, 
o = 2, and a = 0.05. Then using Equation 9-21 gives 





o o J 


- 5| 1 
g- ea wl Bl 


and from Appendix Chart VIa, with n = 25, we find that B = 0.30. That is, if the true mean 
burning rate is ju = 51 centimeters per second, there is approximately a 30% chance that this 
will not be detected by the test with n = 25. 


Once again, consider the propellant problem in Example 9-2. Suppose that the analyst would 
like to design the test so that 1f the true mean burning rate differs from 50 centimeters per sec- 
ond by as much as | centimeter per second, the test will detect this (1.e., reject Hp: ju. = 50) 
with a high probability, say, 0.90. This is exactly the same requirement as in Example 9-3, 
where we used Equation 9-19 to find the required sample size to be n = 42. The operating 
characteristic curves can also be used to find the sample size for this test. Since 
d= |u — po|/o = 1/2, a = 0.05, and B = 0.10, we find from Appendix Chart VIa that the 
required sample size is approximately n = 40. This closely agrees with the sample size calcu- 
lated from Equation 9-19. 


In general, the operating characteristic curves involve three parameters: B, d, and n. 
Given any two of these parameters, the value of the third can be determined. There are two 
typical applications of these curves: 


1. Foragiven n and d, find B (as illustrated in Example 9-3). This kind of problem is often 
encountered when the analyst is concerned about the sensitivity of an experiment 
already performed, or when sample size is restricted by economic or other factors. 


2. Fora given Q and d, find n. This was illustrated in Example 9-4. This kind of problem 
is usually encountered when the analyst has the opportunity to select the sample size 
at the outset of the experiment. 


Operating characteristic curves are given in Appendix Charts VIc and VId for the one- 
sided alternatives. If the alternative hypothesis is either H,: y > pọ or Hy: p. < uo, the abscissa 
scale on these charts is 


42M — Mol (9-22) 


o 
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Using the Computer 
Many statistics software packages will calculate sample sizes and type II error probabilities. To 
illustrate, here are some computations from Minitab for the propellant burning rate problem. 


Power and Sample Size 


]-Sample Z Test 
Testing mean — null (versus not — null) 
Calculating power for mean = null + difference 
Alpha = 0.05 Sigma = 2 

Sample Target Actual 


Difference Size Power Power 
1 43 0.9000 0.9064 


Power and Sample Size 


1-Sample Z Test 
Testing mean = null (versus not = null) 
Calculating power for mean = null + difference 
Alpha = 0.05 Sigma = 2 

Sample Target Actual 


Difference Size Power Power 
1 28 0.7500 0.7536 


Power and Sample Size 


l-Sample Z Test 
Testing mean = null (versus not = null) 
Calculating power for mean = null + difference 
Alpha = 0.05 Sigma = 2 
Sample 
Difference Size Power 
1 25 0.7054 





In the first part of the boxed display, we asked Minitab to work Example 9-3, that is, to find 
the sample size n that would allow detection of a difference from p = 50 of 1 centimeter per 
second with power of 0.9 and a — 0.05. The answer, n — 43, agrees closely with the calcu- 
lated value from Equation 9-19 in Example 9-3, which was n = 42. The difference is due to 
Minitab using a value of zg that has more than two decimal places. The second part of the com- 
puter output relaxes the power requirement to 0.75. Note that the effect 1s to reduce the 
required sample size to n = 28. The third part of the output is the solution to Example 9-4, 
where we wish to determine the type II error probability of (8) or the power = 1 — Q for the 
sample size n — 25. Note that Minitab computes the power to be 0.7054, which agrees closely 
with the answer obtained from the O.C. curve in Example 9-4. Generally, however, the com- 
puter calculations will be more accurate than visually reading values from an O.C. curve. 


9.2.5 Large-Sample Test 


We have developed the test procedure for the null hypothesis Hp: |. = o assuming that the pop- 
ulation is normally distributed and that o? is known. In many if not most practical situations o? 


298 


CHAPTER9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE 


will be unknown. Furthermore, we may not be certain that the population is well modeled by a 
normal distribution. In these situations if n is large (say n > 40) the sample standard deviation s 
can be substituted for o in the test procedures with little effect. Thus, while we have given a test 
for the mean of a normal distribution with known c^, it can be easily converted into a large- 
sample test procedure for unknown o? that is valid regardless of the form of the distribution 
of the population. This large-sample test relies on the central limit theorem just as the large- 
sample confidence interval on p. that was presented in the previous chapter did. Exact treatment 
of the case where the population is normal, o? is unknown, and n is small involves use of the 
t distribution and will be deferred until Section 9-3. 


9.2.6 Some Practical Comments on Hypothesis Tests 


The Eight-Step Procedure 

In Section 9-1.4 we described an eight-step procedure for statistical hypothesis testing. This 
procedure was illustrated in Example 9-2 and will be encountered many times in both this 
chapter and Chapter 10. In practice, such a formal and (seemingly) rigid procedure is not 
always necessary. Generally, once the experimenter (or decision maker) has decided on 
the question of interest and has determined the design of the experiment (that is, how the data 
are to be collected, how the measurements are to be made, and how many observations are 
required), only three steps are really required: 


1. Specify the test statistic to be used (such as Zp). 
2. Specify the location of the critical region (two-tailed, upper-tailed, or lower-tailed). 


3. Specify the criteria for rejection (typically, the value of a, or the P-value at which 
rejection should occur). 


These steps are often completed almost simultaneously in solving real-world problems, 
although we emphasize that it is important to think carefully about each step. That is why we 
present and use the eight-step process: it seems to reinforce the essentials of the correct 
approach. While you may not use it every time in solving real problems, it is a helpful frame- 
work when you are first learning about hypothesis testing. 


Statistical versus Practical Significance 

We noted previously that reporting the results of a hypothesis test in terms of a P-value is very 
useful because it conveys more information than just the simple statement “reject Hp” or “fail 
to reject Ho”. That is, rejection of Hp at the 0.05 level of significance is much more meaning- 
ful if the value of the test statistic is well into the critical region, greatly exceeding the 5% crit- 
ical value, than if it barely exceeds that value. 

Even a very small P-value can be difficult to interpret from a practical viewpoint when 
we are making decisions because, while a small P-value indicates statistical significance in 
the sense that Hp should be rejected in favor of Hi, the actual departure from Hj, that has been 
detected may have little (if any) practical significance (engineers like to say “engineering 
significance"). This is particularly true when the sample size n is large. 

For example, consider the propellant burning rate problem of Example 9-3 where we are 
testing Hp: ju = 50 centimeters per second versus H,: u # 50 centimeters per second with o 
= 2. If we suppose that the mean rate is really 50.5 centimeters per second, this is not a seri- 
ous departure from Hp: u = 50 centimeters per second in the sense that if the mean really is 
50.5 centimeters per second there is no practical observable effect on the performance of the 
aircrew escape system. In other words, concluding that ju = 50 centimeters per second when 
it 1s really 50.5 centimeters per second is an inexpensive error and has no practical signifi- 
cance. For a reasonably large sample size, a true value of w = 50.5 will lead to a sample x that 
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is close to 50.5 centimeters per second, and we would not want this value of x from the sam- 
ple to result in rejection of Hp. The following display shows the P-value for testing Hp: p = 50 
when we observe x = 50.5 centimeters per second and the power of the test at a = 0.05 when 
the true mean is 50.5 for various sample sizes n: 


sample Size P-value Power (at a = 0.05) 
n When x = 50.5 When True p = 50.5 

10 0.4295 0.1241 

23 0.2113 0.2396 

50 0.0767 0.4239 

100 0.0124 0.7054 

400 5.73 X 107” 0.9988 

1000 207 10 1.0000 


The P-value column in this display indicates that for large sample sizes the observed 
sample value of x = 50.5 would strongly suggest that Họ: ju = 50 should be rejected, even 
though the observed sample results imply that from a practical viewpoint the true mean does 
not differ much at all from the hypothesized value pọ = 50. The power column indicates that 
if we test a hypothesis at a fixed significance level a and even if there is little practical differ- 
ence between the true mean and the hypothesized value, a large sample size will almost 
always lead to rejection of Hj. The moral of this demonstration is clear: 


Be careful when interpreting the results from hypothesis testing when the sample size 


is large, because any small departure from the hypothesized value pty) will probably be 
detected, even when the difference is of little or no practical significance. 





EXERCISES FOR SECTION 9-2 


9-20. The mean water temperature downstream from a 
power plant cooling tower discharge pipe should be no more 
than 100°F Past experience has indicated that the standard 
deviation of temperature is 2°K The water temperature is 
measured on nine randomly chosen days, and the average 
temperature is found to be 98°F. 

(a) Should the water temperature be judged acceptable with 
a = 0.05? 

(b) What is the P-value for this test? 

(c) What is the probability of accepting the null hypothesis 
at a = 0.05 if the water has a true mean temperature of 
104 °F? 

9-21. Reconsider the chemical process yield data from 

Exercise 8-9. Recall that o = 3, yield is normally distributed 

and that n = 5 observations on yield are 91.6%, 88.75%, 90.8%, 

89.95%, and 91.3%. Use a = 0.05. 

(a) Is there evidence that the mean yield is not 90%? 

(b) What is the P-value for this test? 

(c) What sample size would be required to detect a true mean 
yield of 85% with probability 0.95? 


(d) What is the type II error probability if the true mean yield 
is 92%? 

(e) Compare the decision you made in part (c) with the 95% 
CI on mean yield that you constructed in Exercise 8-7. 


9-22. A manufacturer produces crankshafts for an automo- 

bile engine. The wear of the crankshaft after 100,000 miles 

(0.0001 inch) is of interest because it is likely to have an 

impact on warranty claims. A random sample of n = 15 shafts 

is tested and x = 2.78. It is known that ø = 0.9 and that wear 

is normally distributed. 

(a) Test Ho: w = 3 versus Ho: u 73 using a = 0.05. 

(b) What is the power of this test if ju = 3.25? 

(c) What sample size would be required to detect a true mean 
of 3.75 if we wanted the power to be at least 0.9? 


9-25. <A melting point test of n = 10 samples of a binder 
used in manufacturing a rocket propellant resulted in 
x = 154.2?F. Assume that melting point is normally distrib- 
uted with o = 1.5°F. 

(a) Test Ho: w = 155 versus Hp: u # 155 using a = 0.01. 
(b) What is the P-value for this test? 
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(c) What is the B-error if the true mean is p. = 150? 
(d) What value of n would be required if we want B < 0.1 
when u = 150? Assume that a = 0.01. 


9-24. The life in hours of a battery is known to be approxi- 

mately normally distributed, with standard deviation o = 1.25 

hours. A random sample of 10 batteries has a mean life of 

x = 40.5 hours. 

(a) Is there evidence to support the claim that battery life 
exceeds 40 hours? Use a = 0.05. 

(b) What is the P-value for the test in part (a)? 

(c) What is the B-error for the test in part (a) if the true mean 
life is 42 hours? 

(d) What sample size would be required to ensure that B does 
not exceed 0.10 if the true mean life is 44 hours? 

(e) Explain how you could answer the question in part (a) 
by calculating an appropriate confidence bound on life. 


9-25. An engineer who is studying the tensile strength of a 

steel alloy intended for use in golf club shafts knows that 

tensile strength is approximately normally distributed with 

o = 60 psi. A random sample of 12 specimens has a mean 

tensile strength of x — 3250 psi. 

(a) Test the hypothesis that mean strength is 3500 psi. Use 
a = 0.01. 

(b) What is the smallest level of significance at which you 
would be willing to reject the null hypothesis? 

(c) Explain how you could answer the question in part (a) 
with a two-sided confidence interval on mean tensile 
strength. 


9-26. Suppose that in Exercise 9-25 we wanted to reject the 
null hypothesis with probability at least 0.8 if mean strength 
|, = 3500. What sample size should be used? 


9-27.  Supercavitation is a propulsion technology for under- 
sea vehicles that can greatly increase their speed. It occurs 
above approximately 50 meters per second, when pressure 
drops sufficiently to allow the water to dissociate into water 
vapor, forming a gas bubble behind the vehicle. When the gas 
bubble completely encloses the vehicle, supercavitation is 
said to occur. Eight tests were conducted on a scale model of 
an undersea vehicle in a towing basin with the average ob- 
served speed x — 102.2 meters per second. Assume that speed 
is normally distributed with known standard deviation o = 
4 meters per second. 
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(a) Test the hypotheses Hp: jp. = 100 versus H,: w < 100 us- 
ing a = 0.05. 

(b) Compute the power of the test if the true mean speed is as 
low as 95 meters per second. 

(c) What sample size would be required to detect a true mean 
speed as low as 95 meters per second if we wanted the 
power of the test to be at least 0.85? 

(d) Explain how the question in part (a) could be answered by 
constructing a one-sided confidence bound on the mean 
speed. 


9-28. A bearing used in an automotive application is suppose 

to have a nominal inside diameter of 1.5 inches. A random sam- 

ple of 25 bearings is selected and the average inside diameter of 

these bearings is 1.4975 inches. Bearing diameter is known to be 

normally distributed with standard deviation o: = 0.01 inch. 

(a) Test the hypotheses Hy: u = 1.5 versus H,: w # 1.5 using 
a = 0.01. 

(b) Compute the power of the test if the true mean diameter is 
1.495 inches. 

(c) What sample size would be required to detect a true mean 
diameter as low as 1.495 inches if we wanted the power of 
the test to be at least 0.9? 

(d) Explain how the question in part (a) could be answered by 
constructing a two-sided confidence interval on the mean 
diameter. 


9-29. Medical researchers have developed a new artificial 

heart constructed primarily of titanium and plastic. The heart 

will last and operate almost indefinitely once it is implanted in 

the patient’s body, but the battery pack needs to be recharged 

about every four hours. A random sample of 50 battery packs 

is selected and subjected to a life test. The average life of these 

batteries is 4.05 hours. Assume that battery life is normally 

distributed with standard deviation øo = 0.2 hour. 

(a) Is there evidence to support the claim that mean battery 
life exceeds 4 hours? Use a = 0.05. 

(b) Compute the power of the test if the true mean battery life 
is 4.5 hours. 

(c) What sample size would be required to detect a true mean 
battery life of 4.5 hours if we wanted the power of the test 
to be at least 0.9? 

(d) Explain how the question in part (a) could be answered by 
constructing a one-sided confidence bound on the mean life. 


9-3 TESTS ON THE MEAN OF A NORMAL DISTRIBUTION, 


VARIANCE UNKNOWN 


9-3.1 Hypothesis Tests on the Mean 


We now consider the case of hypothesis testing on the mean of a population with unknown 
variance o°. The situation is analogous to Section 8-3, where we considered a confidence 
interval on the mean for the same situation. As in that section, the validity of the test procedure 
we will describe rests on the assumption that the population distribution is at least approximately 
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normal. The important result upon which the test procedure relies is that if Xj, X>, ..., X, 18 a 
random sample from a normal distribution with mean u and variance c^, the random variable 


has a ¢ distribution with n — 1 degrees of freedom. Recall that we used this result in Section 
8-3 to devise the ¢-confidence interval for u. Now consider testing the hypotheses 


Ho: p = po 
Hi: u F po 
We will use the test statistic 
X — Ho 
To TERES 9-23 
° S/Vn ias 
If the null hypothesis is true, Tọ has a ¢ distribution with n — 1 degrees of freedom. When we 
know the distribution of the test statistic when Hj, is true (this is often called the reference 
distribution or the null distribution), we can locate the critical region to control the type I 
error probability at the desired level. In this case we would use the / percentage points —£,/ ,. 
and 1,5 ,.., as the boundaries of the critical region so that we would reject Ho: p. = po if 


lo > ta/2n-1 orif 19 «€ —fuo 54-1 


where fp is the observed value of the test statistic Tọ. The test procedure is very similar to the 
test on the mean with known variance described in Section 9-2, except that Tọ is used as the 
test statistic instead of Z, and the ¢,_, distribution is used to define the critical region instead 
of the standard normal distribution. A summary of the test procedures for both two- and one- 
sided alternative hypotheses follows: 


The One- 
Sample t-Test Null hypothesis: Ho: u = po 


p d Wo 
S/Vn 


Test statistic: To = 


Alternative hypothesis Rejection criteria 
Ay: u £ po lo > lan-1 OF do < tann- 
Hi po ze Ho to > e 
Hi: e < po fg « nl 





Figure 9-8 shows the location of the critical region for these situations. 


a2 a2 a o 


—tyj2,n-1 0 to/2,n-1 O ta n -1 -ta n -1 O To 
(a) (b) (c) 
Figure 9-8 The reference distribution for Hp: w = pọ with critical region for (a) Hj: w Æ uo, (b) Hi: w > po, and 
(c) Hi: p < po. 
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EXAMPLE 9-6 


Figure 9-9. Normal 
probability plot of the 
coefficient of restitu- 
tion data from 
Example 9-6. 


The increased availability of light materials with high strength has revolutionized the design and 
manufacture of golf clubs, particularly drivers. Clubs with hollow heads and very thin faces can 
result in much longer tee shots, especially for players of modest skills. This is due partly to the 
"spring-like effect" that the thin face imparts to the ball. Firing a golf ball at the head of the club 
and measuring the ratio of the outgoing velocity of the ball to the incoming velocity can quantify 
this spring-like effect. The ratio of velocities is called the coefficient of restitution of the club. An 
experiment was performed in which 15 drivers produced by a particular club maker were selected 
at random and their coefficients of restitution measured. In the experiment the golf balls were 
fired from an air cannon so that the incoming velocity and spin rate of the ball could be precisely 
controlled. It is of interest to determine if there is evidence (with œ = 0.05) to support a claim that 
the mean coefficient of restitution exceeds 0.82. The observations follow: 


0.8411 0.8191 0.8182 0.8125 0.8750 
0.8580 0.8532 0.8483 0.8276 0.7983 
0.8042 0.8730 0.8282 0.8359 0.8660 


The sample mean and sample standard deviation are x = 0.83725 and s = 0.02456. The normal 
probability plot of the data in Fig. 9-9 supports the assumption that the coefficient of restitution 1s 
normally distributed. Since the objective of the experimenter is to demonstrate that the mean co- 
efficient of restitution exceeds 0.82, a one-sided alternative hypothesis is appropriate. 

The solution using the eight-step procedure for hypothesis testing is as follows: 


1. The parameter of interest is the mean coefficient of restitution, w. 





2. Ho: p = 0.82 
3. Hy: w 0.82. We want to reject Hp if the mean coefficient of restitution exceeds 0.82. 
4. a — 0.05 
5. The test statistic 1s 
m- X — Mo 
s/Vn 


6. Reject Ho IT lo > 19.05.14 — ].761 


Percentage 
O1 
O 





0.78 0.83 0.88 
Coefficient of restitution 
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7. Computations: Since x = 0.83725, s = 0.02456, uo = 0.82, and n = 15, we have 


. 0.83725 — 0.82 


ty = = = 242 
0 0.02456/V/15 


8. Conclusions: Since tọ = 2.72 > 1.761, we reject Hp and conclude at the 0.05 level of 
significance that the mean coefficient of restitution exceeds 0.82. 


Minitab will conduct the one-sample t-test. The output from this software package is in the 
following display: 


One-Sample T: COR 
Test of mu = 0.82 vs mu > 0.82 
Variable N Mean StDev SE Mean 


COR 15 0.83725 0.02456 0.00634 


Variable 95.0% Lower Bound T JE 
COR 0.82608 2T? 0.008 





Notice that Minitab computes both the test statistic Tọ and a 95% lower confidence bound for 
the coefficient of restitution. Because the 95% lower confidence bound exceeds 0.82, we 
would reject the hypothesis that Hj: ju. = 0.82 and conclude that the alternative hypothesis 
Hi: u > 0.82 is true. Minitab also calculates a P-value for the test statistic Tọ. In the next 
section we explain how this is done. 


9.3.2  P- Value for a t- Test 


The P-value for a t-test is just the smallest level of significance at which the null hypothesis 
would be rejected. That is, it is the tail area beyond the value of the test statistic tọ for a one- 
sided test or twice this area for a two-sided test. Because the t-table in Appendix Table IV 
contains only 10 critical values for each f£ distribution, computation of the exact P-value 
directly from the table is usually impossible. However, it is easy to find upper and lower 
bounds on the P-value from this table. 

To illustrate, consider the t-test based on 14 degrees of freedom in Example 9-6. The 
relevant critical values from Appendix Table IV are as follows: 


Critical Value: 0.258 0.692 1.345 1.761 2.145 2.624 2.977 3.326 3.787 4.140 
Tail Area: 0.40 0.25 0.10 O05 Á 0.025 0.01 0.005 0.0025 0.001 0.0005 


Notice that £j = 2.72 in Example 9-6, and that this is between two tabulated values, 2.624 and 
2.977. Therefore, the P-value must be between 0.01 and 0.005. These are effectively the up- 
per and lower bounds on the P-value. 

Example 9-6 is an upper-tailed test. If the test is lower-tailed, just change the sign of tọ and 
proceed as above. Remember that for a two-tailed test the level of significance associated with a 
particular critical value is twice the corresponding tail area in the column heading. This consider- 
ation must be taken into account when we compute the bound on the P-value. For example, sup- 
pose that £j = 2.72 for a two-tailed alternate based on 14 degrees of freedom. The value 
ty > 2.624 (corresponding to a = 0.02) and tọ < 2.977 (corresponding to a = 0.01), so the 
lower and upper bounds on the P-value would be 0.01 < P < 0.02 for this case. 
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Finally, most computer programs report P-values along with the computed value of the 
test statistic. Some hand-held calculators also have this capability. In Example 9-6, Minitab 
gave the P-value for the value tọ = 2.72 in Example 9-6 as 0.008. 


9-3.3 Choice of Sample Size 


EXAMPLE 9-7 


The type II error probability for tests on the mean of a normal distribution with unknown vari- 
ance depends on the distribution of the test statistic in Equation 9-23 when the null hypothe- 
sis Ho: |. = Wo is false. When the true value of the mean is ju = p + ò, the distribution for Tọ 
is called the noncentral ¢ distribution with n — 1 degrees of freedom and noncentrality pa- 
rameter 6 V/n/o. Note that if 8 = 0, the noncentral f distribution reduces to the usual central 
t distribution. Therefore, the type II error of the two-sided alternative (for example) would be 


B = P{—tann-1 = To S tapn-i|5 * 0] 
= Fe = 2 = don 


where Tọ denotes the noncentral t random variable. Finding the type II error probability B for 
the t-test involves finding the probability contained between two points of the noncentral t 
distribution. Because the noncentral ¢-random variable has a messy density function, this in- 
tegration must be done numerically. 

Fortunately, this ugly task has already been done, and the results are summarized in a se- 
ries of O.C. curves in Appendix Charts Vle, VIJ, VIg, and VIA that plot B for the t-test against 
a parameter d for various sample sizes n. Curves are provided for two-sided alternatives on 
Charts VIe and VIf. The abscissa scale factor d on these charts is defined as 





IM — Mol — [òl 
d= v (9-24) 
For the one-sided alternative y > po or u < uo, we use charts VIg and VIh with 
IM — Mol — [òl 
d = - xxm (9-25) 


We note that d depends on the unknown parameter o?. We can avoid this difficulty in sev- 
eral ways. In some cases, we may use the results of a previous experiment or prior information 
to make a rough initial estimate of o?. If we are interested in evaluating test performance after 
the data have been collected, we could use the sample variance s? to estimate o^. If there is no 
previous experience on which to draw in estimating o^, we then define the difference in the 
mean d that we wish to detect relative to o. For example, if we wish to detect a small difference 
in the mean, we might use a value of d = |8|/o = 1 (for example), whereas if we are interested 
in detecting only moderately large differences in the mean, we might select d = |6|/o = 2 (for 
example). That is, it is the value of the ratio |6|/o that is important in determining sample size, 
and if it is possible to specify the relative size of the difference in means that we are interested in 
detecting, then a proper value of d can usually be selected. 


Consider the golf club testing problem from Example 9-6. If the mean coefficient of restitu- 
tion exceeds 0.82 by as much as 0.02, is the sample size n = 15 adequate to ensure that Hp: |. 
— 0.82 will be rejected with probability at least 0.8? 

To solve this problem, we will use the sample standard deviation s — 0.02456 to estimate 
c. Then d = |8|/o = 0.02/0.02456 = 0.81. By referring to the operating characteristic 
curves in Appendix Chart VIg (for a = 0.05) with d = 0.81 and n = 15, we find that B = 0.10, 
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approximately. Thus, the probability of rejecting Hp: ju = 0.82 if the true mean exceeds this 
by 0.02 is approximately 1 — B = 1 — 0.10 = 0.90, and we conclude that a sample size of 
n = 15 is adequate to provide the desired sensitivity. 


Minitab will also perform power and sample size computations for the one-sample t-test. 
Below are several calculations based on the golf club testing problem: 


Power and Sample Size 


]-Sample t Test 

Testing mean = null (versus > null) 
Calculating power for mean — null 4 difference 
Alpha = 0.05 Sigma = 0.02456 


Sample 
Difference Size Power 
0.02 I5 0.9117 


Power and Sample Size 


]-Sample t Test 
Testing mean = null (versus > null) 
Calculating power for mean = null + difference 


Alpha = 0.05 Sigma = 0.02456 


Sample 
Difference Size Power 
0.01 15 0.4425 


Power and Sample Size 


l-Sample t Test 

Testing mean = null (versus > null) 
Calculating power for mean = null + difference 
Alpha = 0.05 Sigma = 0.02456 


Sample Target ^ Actual 
Difference Size Power Power 
0.01 39 0.8000 0.8029 





In the first portion of the computer output, Minitab reproduces the solution to Example 9-7, veri- 
fying that a sample size of n = 15 is adequate to give power of at least 0.8 if the mean coefficient 
of restitution exceeds 0.82 by at least 0.02. In the middle section of the output, we asked Minitab 
to compute the power if the difference in u and pọ = 0.82 we wanted to detect was 0.01. Notice 
that with n = 15, the power drops considerably to 0.4425. The final portion of the output is the 
sample size required to give a power of at least 0.8 if the difference between u and uo of interest 
is actually 0.01. A much larger n is required to detect this smaller difference. 


9.3.4 Likelihood Ratio Approach to Development of Test Procedures 
(CD Only) 


EXERCISES FOR SECTION 9-3 


9.30. An article in the ASCE Journal of Energy Engineer- building material. Five samples of the material were tested in 
ing (1999, Vol. 125, pp. 59—75) describes a study of the ther- a structure, and the average interior temperature (°C) reported 


mal inertia properties of autoclaved aerated concrete used as a. Was as follows: 23.01, 22.22, 22.04, 22.62, and 22.59. 
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(a) Test the hypotheses Hp: w = 22.5 versus H,: w # 22.5, 
using a = 0.05. Find the P-value. 

(b) Is there evidence to support the assumption that interior 
temperature is normally distributed? 

(c) Compute the power of the test if the true mean interior 
temperature is as high as 22.75. 

(d) What sample size would be required to detect a true mean 
interior temperature as high as 22.75 if we wanted the 
power of the test to be at least 0.9? 

(e) Explain how the question in part (a) could be answered by 
constructing a two-sided confidence interval on the mean 
interior temperature. 


9-31. A 1992 article in the Journal of the American Medical 
Association (“A Critical Appraisal of 98.6 Degrees E the Upper 
Limit of the Normal Body Temperature, and Other Legacies of 

Carl Reinhold August Wundrlich") reported body temperature, 

gender, and heart rate for a number of subjects. The body tem- 

peratures for 25 female subjects follow: 97.8, 97.2, 97.4, 97.6, 

97.8, 97.9, 98.0, 98.0, 98.0, 98.1, 98.2, 98.3, 98.3, 98.4, 98.4, 

98.4, 98.5, 98.6, 98.6, 98.7, 98.8, 98.8, 98.9, 98.9, and 99.0. 

(a) Test the hypotheses Hp: w = 98.6 versus Hj: u # 98.6, 
using a = 0.05. Find the P-value. 

(b) Compute the power of the test if the true mean female 
body temperature is as low as 98.0. 

(c) What sample size would be required to detect a true mean 
female body temperature as low as 98.2 if we wanted the 
power of the test to be at least 0.9? 

(d) Explain how the question in part (a) could be answered by 
constructing a two-sided confidence interval on the mean 
female body temperature. 

(e) Is there evidence to support the assumption that female 
body temperature is normally distributed? 


9-32. Cloud seeding has been studied for many decades as 

a weather modification procedure (for an interesting study of 

this subject, see the article in Technometrics by Simpson, 

Alsen, and Eden, “A Bayesian Analysis of a Multiplicative 

Treatment Effect in Weather Modification", Vol. 17, pp. 161— 

166). The rainfall in acre-feet from 20 clouds that were se- 

lected at random and seeded with silver nitrate follows: 18.0, 

30.7 .19.5, 2L L 22,5, 13,8, 318,234 ,21,2, 27.9. 31.9, 2. 

25.0, 24.7, 26.9, 21.8, 29.2, 34.8, 26.7, and 31.6. 

(a) Can you support a claim that mean rainfall from seeded 
clouds exceeds 25 acre-feet? Use a = 0.01. 

(b) Is there evidence that rainfall is normally distributed? 

(c) Compute the power of the test if the true mean rainfall is 
27 acre-feet. 

(d) What sample size would be required to detect a true mean 
rainfall of 27.5 acre-feet if we wanted the power of the test 
to be at least 0.9? 

(e) Explain how the question in part (a) could be answered by 
constructing a one-sided confidence bound on the mean 
diameter. 


9-33. The sodium content of thirty 300-gram boxes of organic 
corn flakes was determined. The data (in milligrams) are as 
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follows: 131.15, 130.69, 130.91, 129.54, 129.64, 128.77, 130.72, 
128.33, 128.24, 129.65, 130.14, 129.29, 128.71, 129.00, 129.39, 
130.42, 129.53, 130.12, 129.78, 130.92, 131.15, 130.69, 130.91, 
129.54, 129.64, 128.77, 130.72, 128.33, 128.24, and 129.65. 

(a) Can you support a claim that mean sodium content of this 
brand of cornflakes is 130 milligrams? Use a = 0.05. 

(b) Is there evidence that sodium content is normally distrib- 
uted? 

(c) Compute the power of the test if the true mean sodium 
content is 130.5 miligrams. 

(d) What sample size would be required to detect a true mean 
sodium content of 130.1 milligrams if we wanted the 
power of the test to be at least 0.75? 

(e) Explain how the question in part (a) could be answered by 
constructing a two-sided confidence interval on the mean 
sodium content. 


9-34. Reconsider the tire testing experiment described in 

Exercise 8-22. 

(a) The engineer would like to demonstrate that the mean life 
of this new tire is in excess of 60,000 kilometers. Formu- 
late and test appropriate hypotheses, and draw conclu- 
sions using a = 0.05. 

(b) Suppose that if the mean life is as long as 61,000 kilome- 
ters, the engineer would like to detect this difference with 
probability at least 0.90. Was the sample size n = 16 used 
in part (a) adequate? Use the sample standard deviation s 
as an estimate of o in reaching your decision. 


9-35. Reconsider the Izod impact test on PVC pipe described 
in Exercise 8-23. Suppose that you want to use the data from this 
experiment to support a claim that the mean impact strength 
exceeds the ASTM standard (foot-pounds per inch). Formulate 
and test the appropriate hypotheses using a = 0.05. 


9-36.  Reconsider the television tube brightness experiment 
in Exercise 8-24. Suppose that the design engineer believes 
that this tube will require 300 microamps of current to pro- 
duce the desired brightness level. Formulate and test an 
appropriate hypothesis using a = 0.05. Find the P-value for 
this test. State any necessary assumptions about the underly- 
ing distribution of the data. 


9-57. Consider the baseball coefficient of restitution data 

first presented in Exercise 8-79. 

(a) Does the data support the claim that the mean coefficient 
of restitution of baseballs exceeds 0.635? Use a — 0.05. 

(b) What is the P-value of the test statistic computed in part (a)? 

(c) Compute the power of the test if the true mean coefficient 
of restitution is as high as 0.64. 

(d) What sample size would be required to detect a true mean 
coefficient of restitution as high as 0.64 if we wanted the 
power of the test to be at least 0.75? 

9-38. Consider the dissolved oxygen concentration at TVA 

dams first presented in Exercise 8-81. 

(a) Test the hypotheses Hp: ju = 4 versus Hj: y # 4. Use 
a = 0.01. 
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(b) What is the P-value of the test statistic computed in part (a)? 

(c) Compute the power of the test if the true mean dissolved 
oxygen concentration is as low as 3. 

(d) What sample size would be required to detect a true mean 
dissolved oxygen concentration as low as 2.5 if we 
wanted the power of the test to be at least 0.9? 


9-39. Consider the cigar tar content data first presented in 

Exercise 8-82. 

(a) Can you support a claim that mean tar content exceeds 
1.5? Use a = 0.05 

(b) Whatis the P-value of the test statistic computed in part (a)? 

(c) Compute the power of the test if the true mean tar content 
is 1.6. 

(d) What sample size would be required to detect a true mean 
tar content of 1.6 if we wanted the power of the test to be 
at least 0.8? 


9-40. Exercise 6-22 gave data on the heights of female 

engineering students at ASU. 

(a) Can you support a claim that mean height of female engi- 
neering students at ASU is 65 inches? Use a = 0.05 

(b) What is the P-value of the test statistic computed in part (a)? 

(c) Compute the power of the test if the true mean height is 
62 inches. 


(d) What sample size would be required to detect a true mean 
height of 64 inches if we wanted the power of the test to be 
at least 0.8? 

9-41. Exercise 6-24 presented data on the concentration of 

suspended solids in lake water. 

(a) Test the hypotheses Hp: ju = 55 versus Hj: u # 55, use 
a = 0.05. 

(b) What is the P-value of the test statistic computed in part (a)? 

(c) Compute the power of the test if the true mean concentra- 
tion is as low as 50. 

(d) What sample size would be required to detect a true mean 
concentration as low as 50 if we wanted the power of the 
test to be at least 0.9? 

9-42. Exercise 6-25 describes testing golf balls for an over- 

all distance standard. 

(a) Can you support a claim that mean distance achieved by 
this particular golf ball exceeds 280 yards? Use a = 0.05. 

(b) What is the P-value of the test statistic computed in part (a)? 

(c) Compute the power of the test if the true mean distance is 
290 yards. 

(e) What sample size would be required to detect a true mean 
distance of 290 yards if we wanted the power of the test to 
be at least 0.8? 


9-4 HYPOTHESIS TESTS ON THE VARIANCE AND STANDARD 
DEVIATION OF A NORMAL POPULATION 


Sometimes hypothesis tests on the population variance or standard deviation are needed. 
When the population is modeled by a normal distribution, the tests and intervals described in 


this section are applicable. 


9-4.1 The Hypothesis Testing Procedures 


Suppose that we wish to test the hypothesis that the variance of a normal population o? equals 
a specified value, say o$, or equivalently, that the standard deviation ø is equal to oy. Let X, 
X>,..., X, bea random sample of n observations from this population. To test 


we will use the test statistic: 


Hy o^ = o$ 


(9-26) 
Hee +a 





If the null hypothesis Hp: o^ = o$ is true, the test statistic X5 defined in Equation 9-27 
follows the chi-square distribution with n — 1 degrees of freedom. This is the reference 
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f (x) 


2 
O XI -a/2,n-1 


2 
Xn-1 


(a) 


f(x) Tx) 
CN Xs 


a/2 a 


2 2 2 
Xai2,n-1 * O Xunoi ~ O X asd T 


(5) (c) 


Figure 9-10 Reference distribution for the test of Hy: o? = o$ with critical region values for (a) Hj: o? # o4, 
(b) Ha > oå, and (c) Hi: 0? o5 
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distribution for this test procedure. Therefore, we calculate x5, the value of the test statistic Xĝ, 


and the null hypothesis Hj: o? = o5 would be rejected if 


2 2 2 2 
Xo 7 Xa/2,n-1 Orit x9 = XT cua ed 


where X52.n-1 and Xi-a/2n-1 are the upper and lower 1000/2 percentage points of the chi- 
square distribution with n — 1 degrees of freedom, respectively. Figure 9-10(a) shows the 
critical region. 

The same test statistic 1s used for one-sided alternative hypotheses. For the one-sided 
hypothesis 


leo =o, 


(9-28) 
Hpo SG, 
we would reject Hy if x5 > ne whereas for the other one-sided hypothesis 
Hy o? = o$ 
abe. (9-29) 


Heo <0 


we would reject H, if x5 < AU emos The one-sided critical regions are shown in Figure 
9-10(b) and (c). 


An automatic filling machine is used to fill bottles with liquid detergent. A random sample of 
20 bottles results in a sample variance of fill volume of s? = 0.0153 (fluid ounces)’. If the 
variance of fill volume exceeds 0.01 (fluid ounces)", an unacceptable proportion of bottles 
will be underfilled or overfilled. Is there evidence in the sample data to suggest that the man- 
ufacturer has a problem with underfilled or overfilled bottles? Use a = 0.05, and assume that 
fill volume has a normal distribution. 

Using the eight-step procedure results in the following: 


1. The parameter of interest is the population variance o. 


2. Hy o0? = 0.01 
3. H: o > 001 
4. a= 0.05 
5. The test statistic is 
4 um 1)s? 
Xo ~ 2 
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6. Reject Ho if X6 > X60s19 = 30.14. 


7. Computations: 


, 19(0.0153) - 2507 
X 0o €^ 


8. Conclusions: Since Xo = 29.07 < X0.05,19 = 30.14, we conclude that there is no 
strong evidence that the variance of fill volume exceeds 0.01 (fluid ounces)”. 


Using Appendix Table III, it is easy to place bounds on the P-value of a chi-square test. 
From inspection of the table, we find that X6.10.19 — 27.20 and X0.05,19 — 30.14. Since 
277.20 « 29.07 « 30.14, we conclude that the P-value for the test in Example 9-8 is in the 
interval 0.05 « P « 0.10. The actual P-value is P = 0.0649. (This value was obtained from 
a calculator.) 


9.4.2  D-Error and Choice of Sample Size 


EXAMPLE 9-9 


Operating characteristic curves for the chi-square tests in Section 9-4.] are provided in 
Appendix Charts VIi through VIn for a = 0.05 and a = 0.01. For the two-sided alternative 
hypothesis of Equation 9-26, Charts VIi and VIj plot Q against an abscissa parameter 


A= — (9-30) 


for various sample sizes n, where o denotes the true value of the standard deviation. Charts 
VIk and VU are for the one-sided alternative H;: g^ > 05. while Charts VIm and VIn are for 
the other one-sided alternative H,: 07 < o. In using these charts, we think of ø as the value 
of the standard deviation that we want to detect. 

These curves can be used to evaluate the Q-error (or power) associated with a particu- 
lar test. Alternatively, they can be used to design a test—that is, to determine what sample 
size is necessary to detect a particular value of o that differs from the hypothesized 
value oy. 


Consider the bottle-filling problem from Example 9-8. If the variance of the filling process 
exceeds 0.01 (fluid ounces)?, too many bottles will be underfilled. Thus, the hypothesized 
value of the standard deviation is og = 0.10. Suppose that if the true standard deviation of the 
filling process exceeds this value by 25%, we would like to detect this with probability at least 
0.8. Is the sample size of n — 20 adequate? 

To solve this problem, note that we require 


" c _ 0.125 "m 
|. 99 010 ` 


This is the abscissa parameter for Chart VIA. From this chart, with n = 20 and À = 1.25, we 
find that B = 0.6. Therefore, there is only about a 40% chance that the null hypothesis will be 
rejected if the true standard deviation is really as large as o. = 0.125 fluid ounce. 

To reduce the (-error, a larger sample size must be used. From the operating characteris- 
tic curve with B = 0.20 and à = 1.25, we find that n = 75, approximately. Thus, if we want 
the test to perform as required above, the sample size must be at least 75 bottles. 
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EXERCISES FOR SECTION 9-4 


9-45. Consider the rivet holes from Exercise 8-35. If the 

standard deviation of hole diameter exceeds 0.01 millimeters, 

there is an unacceptably high probability that the rivet will not 
fit. Recall that n = 15 and s = 0.008 millimeters. 

(a) Is there strong evidence to indicate that the standard devi- 
ation of hole diameter exceeds 0.01 millimeters? Use a = 
0.01. State any necessary assumptions about the underly- 
ing distribution of the data. 

(b) Find the P-value for this test. 

(c) If o is really as large as 0.0125 millimeters, what sample size 
will be required to defect this with power of at least 0.8? 


9-44. Recall the sugar content of the syrup in canned peaches 

from Exercise 8-36. Suppose that the variance is thought to be 

o? = 18 (milligrams)". A random sample of n = 10 cans yields 

a sample standard deviation of s — 4.8 milligrams. 

(a) Test the hypothesis Hy: o? = 18 versus H,: o^ # 18 using 
a = 0.05. 

(b) What is the P-value for this test? 

(c) Discuss how part (a) could be answered by constructing a 
95% two-sided confidence interval for o. 


9-45. Consider the tire life data in Exercise 8-22. 

(a) Can you conclude, using a = 0.05, that the standard devia- 
tion of tire life exceeds 200 kilometers? State any necessary 
assumptions about the underlying distribution of the data. 

(b) Find the P-value for this test. 


9-46. Consider the Izod impact test data in Exercise 8-23. 

(a) Test the hypothesis that o = 0.10 against an alternative 
specifying that o # 0.10, using a = 0.01, and draw a 
conclusion. State any necessary assumptions about the 
underlying distribution of the data. 

(b) What is the P-value for this test? 

(c) Could the question in part (a) have been answered by 
constructing a 99% two-sided confidence interval for o?? 


9-47. Reconsider the percentage of titanium in an alloy used 
in aerospace castings from Exercise 8-39. Recall that s = 0.37 
and n = 51. 

(a) Test the hypothesis Hy: o = 0.25 versus H,: o # 0.25 
using a = 0.05. State any necessary assumptions about 
the underlying distribution of the data. 

(b) Explain how you could answer the question in part (a) by 
constructing a 9596 two-sided confidence interval for o. 


9-48. Consider the hole diameter data in Exercise 8-35. 
Suppose that the actual standard deviation of hole diameter 
exceeds the hypothesized value by 50%. What is the probabil- 
ity that this difference will be detected by the test described in 
Exercise 9-43? 

9-49. Consider the sugar content in Exercise 9-44. Suppose 
that the true variance is o? — 40. How large a sample would be 
required to detect this difference with probability at least 0.90? 


9-5 TESTS ON A POPULATION PROPORTION 


It is often necessary to test hypotheses on a population proportion. For example, suppose that 
a random sample of size n has been taken from a large (possibly infinite) population and that 
X(- n) observations in this sample belong to a class of interest. Then PÊ = X/nisa point esti- 
mator of the proportion of the population p that belongs to this class. Note that n and p are the 
parameters of a binomial distribution. Furthermore, from Chapter 7 we know that the sam- 
pling distribution of Pis approximately normal with mean p and variance p(1 — p)/n, if p is 
not too close to either 0 or 1 and if n 1s relatively large. Typically, to apply this approximation 
we require that np and n(1 — p) be greater than or equal to 5. We will give a large-sample test 
that makes use of the normal approximation to the binomial distribution. 


9.5.1  Large-Sample Tests on a Proportion 


In many engineering problems, we are concerned with a random variable that follows the 
binomial distribution. For example, consider a production process that manufactures items 
that are classified as either acceptable or defective. It is usually reasonable to model the oc- 
currence of defectives with the binomial distribution, where the binomial parameter p repre- 
sents the proportion of defective items produced. Consequently, many engineering decision 
problems include hypothesis testing about p. 


We will consider testing 


Ho: p = po 
Hi: p # po 


(9-31) 


EXAMPLE 9-10 
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An approximate test based on the normal approximation to the binomial will be given. As 
noted above, this approximate procedure will be valid as long as p is not extremely close to 
zero or one, and if the sample size is relatively large. Let X be the number of observations in 
a random sample of size n that belongs to the class associated with p. Then, if the null 
hypothesis Hop: p = po is true, we have X ~ N[npo, npo(1 — po)], approximately. To test 
Hs: p = po, calculate the test statistic 


X — npo 


vnpo(l — po) 





and reject Hy: p = po if 
Zo a> Za/2 Or Zo < —Za/2 


Note that the standard normal distribution is the reference distribution for this test statistic. 
Critical regions for the one-sided alternative hypotheses would be constructed in the usual manner. 


A semiconductor manufacturer produces controllers used in automobile engine applications. 
The customer requires that the process fallout or fraction defective at a critical manufacturing 
step not exceed 0.05 and that the manufacturer demonstrate process capability at this level of 
quality using a = 0.05. The semiconductor manufacturer takes a random sample of 200 
devices and finds that four of them are defective. Can the manufacturer demonstrate process 
capability for the customer? 

We may solve this problem using the eight-step hypothesis-testing procedure as follows: 


1. The parameter of interest is the process fraction defective p. 

2. Ho: P 0.05 
H: p < 0.05 
This formulation of the problem will allow the manufacturer to make a strong claim 
about process capability if the null hypothesis Hj: p = 0.05 is rejected. 


a = 0.05 
The test statistic is (from Equation 9-32) 


X — npo 


V npo(1 = Po) 


Zo = 


where x = 4, n = 200, and p, = 0.05. 
6. Reject Ho: p = 0.05 if Zo < —_ Z0.05 — — 1.645 


7. Computations: The test statistic 1s 


NE a 200(0.05) — 
70 200005095) 
8. Conclusions: Since zọ = —1.95 < —Zo 95 = —1.645, we reject Ho and conclude that the 


process fraction defective p is less than 0.05. The P-value for this value of the test statistic 
zo is P = 0.0256, which is less than a = 0.05. We conclude that the process is capable. 
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Another form of the test statistic Zọ in Equation 9-32 is occasionally encountered. Note 
that if X is the number of observations in a random sample of size n that belongs to a class of 
interest, then P — X/n is the sample proportion that belongs to that class. Now divide both 
numerator and denominator of Zọ in Equation 9-32 by n, giving 





X/n — po 
Zo = 
V po(l — po)/n 
or 
x 
7 = ED (9-33) 


V po(l — po)/n 


This presents the test statistic in terms of the sample proportion instead of the number of items 
X in the sample that belongs to the class of interest. 

Statistical software packages usually provide the one sample Z-test for a proportion. The 
Minitab output for Example 9-10 follows. 


Test and CI for One Proportion 
Test of p — 0.05 vs p « 0.05 


Sample X N Sample p 95.0% Upper Bound Z-Value  P-Value 
1 4 200 0.020000 0.036283 = 95 0.026 
* NOTE * The normal approximation may be inaccurate for small samples. 





Notice that both the test statistic (and accompanying P-value) and the 95% one-sided upper 
confidence bound are displayed. The 9596 upper confidence bound is 0.036283, which is less 
than 0.05. This is consistent with rejection of the null hypothesis H,: p = 0.05. 


9.5.2 Small-.Sample Tests on a Proportion (CD Only) 


9-5.3 Type II Error and Choice of Sample Size 


It is possible to obtain closed-form equations for the approximate B-error for the tests in 
Section 9-5.1. Suppose that p is the true value of the population proportion. The approximate 
D-error for the two-sided alternative H,: p # po is 


B = »(^ —p + Za? V Poll — Po 7 »(^ — P — zy Poll — rU) mu 
Vp(1 = p)/n Vp(1 = p)/n 


If the alternative is H,: p < po, 


P 02 7 P — Za Vp«l — pm) 


Val = pym VEM 


B = 
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whereas if the alternative is H,: p > po, 


B = »(^ — P + 2aV poll — t (9-36) 


Vp(1l — p)/n 


These equations can be solved to find the approximate sample size n that gives a test of level 
a that has a specified B risk. The sample size equations are 


"n E V po(1 EE Do) + Zzg v p(1l -2 


P Po 





for the two-sided alternative and 


e Bi 1) = E ee me WIE) — e 


P Po 





for a one-sided alternative. 


Consider the semiconductor manufacturer from Example 9-10. Suppose that its process fall- 
out is really p = 0.03. What is the B-error for a test of process capability that uses n = 200 
and a — 0.05? 

The B-error can be computed using Equation 9-35 as follows: 


0.05 — 0.03 — (1.645)/0.05(0.95)/200 


pope V0.03(1 — 0.03)/200 


= 1 — $(-044) = 0.67 


Thus, the probability is about 0.7 that the semiconductor manufacturer will fail to con- 
clude that the process is capable if the true process fraction defective is p = 0.03 (3%). That 
is, the power of the test against this particular alternative is only about 0.3. This appears to be 
a large B-error (or small power), but the difference between p = 0.05 and p = 0.03 is fairly 
small, and the sample size n = 200 is not particularly large. 

Suppose that the semiconductor manufacturer was willing to accept a Q-error as large as 
0.10 if the true value of the process fraction defective was p — 0.03. If the manufacturer con- 
tinues to use a = 0.05, what sample size would be required? 

The required sample size can be computed from Equation 9-38 as follows: 


1.645/0.05(0.95) + 1.28/0.03(0.97) 2 
0.03 — 0.05 


n = 
= 832 


where we have used p = 0.03 in Equation 9-38. Note that n = 832 is a very large sample size. 
However, we are trying to detect a fairly small deviation from the null value pọ = 0.05. 
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Minitab will also perform power and sample size calculations for the one-sample Z-test on a 
proportion. Output from Minitab for the engine controllers tested in Example 9-10 follows. 


Power and Sample Size 


Test for One Proportion 
Testing proportion = 0.05 (versus < 0.05) 
Alpha = 0.05 


Alternative Sample 
Proportion Size Power 
3.00E-02 200 0.3287 


Power and Sample Size 


Test for One Proportion 
Testing proportion = 0.05 (versus < 0.05) 
Alpha = 0.05 


Alternative Sample Target Actual 
Proportion Size Power Power 
3.00E-02 833 0.9000 0.9001 


Power and Sample Size 


Test for One Proportion 
Testing proportion = 0.05 (versus < 0.05) 
Alpha = 0.05 


Alternative Sample Target Actual 
Proportion Size Power Power 
3.00E-02 561 0.7500 0.7503 





The first part of the output shows the power calculation based on the situation described in 
Example 9-11, where the true proportion is really 0.03. The power calculation from Minitab 
agrees with the results from Equation 9-35 in Example 9-11. The second part of the output 
computes the sample size necessary to give a power of 0.9 (B = 0.1) if p = 0.03. Again, the 
results agree closely with those obtained from Equation 9-38. The final portion of the display 
shows the sample size that would be required if p — 0.03 and the power requirement is re- 
laxed to 0.75. Notice that the sample size of n = 561 is still quite large because the difference 
between p — 0.05 and p — 0.03 1s fairly small. 


EXERCISES FOR SECTION 9-5 


9-50. In a random sample of 85 automobile engine crank- p — 0.15, how large would the sample size have to be for us 
shaft bearings, 10 have a surface finish roughness that exceeds to have a probability of correctly rejecting the null hypothe- 
the specifications. Does this data present strong evidence that sis of 0.9? 

the proportion of crankshaft bearings exhibiting excess sur- 9.52. Reconsider the integrated circuits described in 
face roughness exceeds 0.10? State and test the appropriate Exercise S48 

hypotheses using a = 0.05. (a) Use the data to test Hp: p = 0.05 versus H,: p # 0.05. Use 
9-5]. Continuation of Exercise 9-50. If it is really the a = 0.05. 

situation that p = 0.15, how likely is it that the test proce- (b) Find the P-value for the test. 


dure in Exercise 9-50 will not reject the null hypothesis? If 


9-53. Consider the defective circuit data in Exercise 8-48. 

(a) Do the data support the claim that the fraction of defective 
units produced is less than 0.05, using a = 0.05? 

(b) Find the P-value for the test. 


9-54. An article in Fortune (September 21, 1992) claimed 
that nearly one-half of all engineers continue academic studies 
beyond the B.S. degree, ultimately receiving either an M.S. or 

a Ph.D. degree. Data from an article in Engineering Horizons 

(Spring 1990) indicated that 117 of 484 new engineering 

graduates were planning graduate study. 

(a) Are the data from Engineering Horizons consistent with 
the claim reported by Fortune? Use a = 0.05 in reaching 
your conclusions. 

(b) Find the P-value for this test. 

(c) Discuss how you could have answered the question in part 
(a) by constructing a two-sided confidence interval on p. 


9-55. A manufacturer of interocular lenses is qualifying a 
new grinding machine and will qualify the machine if the per- 
centage of polished lenses that contain surface defects does 
not exceed 2%. A random sample of 250 lenses contains six 
defective lenses. 
(a) Formulate and test an appropriate set of hypotheses to de- 
termine if the machine can be qualified. Use a = 0.05. 
(b) Find the P-value for the test in part (a). 
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9-56. A researcher claims that at least 10% of all football 

helmets have manufacturing flaws that could potentially cause 

injury to the wearer. A sample of 200 helmets revealed that 16 

helmets contained such defects. 

(a) Does this finding support the researcher's claim? Use 
a = 0.01. 

(b) Find the P-value for this test. 


9-57. Arandom sample of 500 registered voters in Phoenix 

is asked if they favor the use of oxygenated fuels year-round 

to reduce air pollution. If more than 315 voters respond posi- 

tively, we will conclude that at least 60% of the voters favor 

the use of these fuels. 

(a) Find the probability of type I error if exactly 60% of the 
voters favor the use of these fuels. 

(b) What is the type II error probability B if 75% of the voters 
favor this action? 


9-58. The advertized claim for batteries for cell phones is set 
at 48 operating hours, with proper charging procedures. A study 
of 5000 batteries is carried out and 15 stop operating prior to 48 
hours. Do these experimental results support the claim that less 
than 0.2 percent of the company's batteries will fail during the 
advertized time period, with proper charging procedures? Use a 
hypothesis-testing procedure with o. — 0.01. 


9.6 SUMMARY TABLE OF INFERENCE PROCEDURES 


FOR A SINGLE SAMPLE 


The table in the end papers of this book (inside front cover) presents a summary of all the 
single-sample inference procedures from Chapters 8 and 9. The table contains the null 
hypothesis statement, the test statistic, the various alternative hypotheses and the criteria 
for rejecting Ho, and the formulas for constructing the 100(1 — a)% two-sided confidence 


interval. 


9-7 ‘TESTING FOR GOODNESS OF FIT 


The hypothesis-testing procedures that we have discussed in previous sections are designed 
for problems in which the population or probability distribution is known and the hypotheses 
involve the parameters of the distribution. Another kind of hypothesis is often encountered: 
we do not know the underlying distribution of the population, and we wish to test the hypoth- 
esis that a particular distribution will be satisfactory as a population model. For example, we 
might wish to test the hypothesis that the population is normal. 

We have previously discussed a very useful graphical technique for this problem called 
probability plotting and illustrated how it was applied in the case of a normal distribution. 
In this section, we describe a formal goodness-of-fit test procedure based on the chi-square 


distribution. 
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EXAMPLE 9-12 


The test procedure requires a random sample of size n from the population whose proba- 
bility distribution is unknown. These n observations are arranged in a frequency histogram, 
having K bins or class intervals. Let O; be the observed frequency in the ith class interval. From 
the hypothesized probability distribution, we compute the expected frequency in the ith class 
interval, denoted £;. The test statistic is 





It can be shown that, if the population follows the hypothesized distribution, X2 has, approx- 
imately, a chi-square distribution with k — p — 1 degrees of freedom, where p represents the 
number of parameters of the hypothesized distribution estimated by sample statistics. This ap- 
proximation improves as 7 increases. We would reject the hypothesis that the distribution of 
the population is the hypothesized distribution if the calculated value of the test statistic 
M0 = XS gea 

One point to be noted in the application of this test procedure concerns the magnitude 
of the expected frequencies. If these expected frequencies are too small, the test statistic X5 
will not reflect the departure of observed from expected, but only the small magnitude of 
the expected frequencies. There is no general agreement regarding the minimum value of 
expected frequencies, but values of 3, 4, and 5 are widely used as minimal. Some writers 
suggest that an expected frequency could be as small as 1 or 2, so long as most of them ex- 
ceed 5. Should an expected frequency be too small, it can be combined with the expected 
frequency in an adjacent class interval. The corresponding observed frequencies would then 
also be combined, and k would be reduced by 1. Class intervals are not required to be of 
equal width. 

We now give two examples of the test procedure. 


A Poisson Distribution 
The number of defects in printed circuit boards is hypothesized to follow a Poisson distribu- 
tion. A random sample of n = 60 printed boards has been collected, and the following num- 
ber of defects observed. 


Number of Observed 


Defects Frequency 
0 32 
1 15 
2 9 
3 4 


The mean of the assumed Poisson distribution in this example is unknown and must be 
estimated from the sample data. The estimate of the mean number of defects per board is the 
sample average, that is, (322 - 0 + 15-1 +9-2 + 4- 3)/60 = 0.75. From the Poisson 
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distribution with parameter 0.75, we may compute p,, the theoretical, hypothesized probabil- 
ity associated with the ith class interval. Since each class interval corresponds to a particular 
number of defects, we may find the p; as follows: 


g "(DS 


TH — P(X = 0) = 00 — 0.472 
e 079.75)! 

posee x. ceo 
eh OTS) 

pa ~ P(X = 2) = 03 — 0.133 


p4 = P(X = 3) = 1 — (p, + p + py) = 0.041 


The expected frequencies are computed by multiplying the sample size n = 60 times the 
probabilities p; That is, E; = np;. The expected frequencies follow: 


Number of Expected 
Defects Probability Frequency 

0 0.472 28.32 

1 0.354 21.24 

2 0.133 7.98 

3 (or more) 0.041 2.46 


Since the expected frequency in the last cell is less than 3, we combine the last two cells: 


Number of Observed Expected 


Defects Frequency Frequency 
0 32 28.32 
1 15 21.24 
2 (or more) 13 10.44 


The chi-square test statistic in Equation 9-39 will have k — p — 1 = 3 — 1 — 1 = I degree 
of freedom, because the mean of the Poisson distribution was estimated from the data. 

The eight-step hypothesis-testing procedure may now be applied, using a = 0.05, as 
follows: 

1. The variable of interest is the form of the distribution of defects in printed circuit boards. 
Ho: The form of the distribution of defects is Poisson. 
H: The form of the distribution of defects is not Poisson. 
a = 0.05 


The test statistic 1s 


S ue c I. 
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EXAMPLE 9-13 


Reject Ho if x5 > x6051 = 3.84. 


Computations: 


, (32 — 28.32)? (15 — 21.24) (13 — 1044) 
X6 = 8 te = 204 
28:32 21.24 10.44 


8. Conclusions: Since x? = 2.94 < Xo — 3.84, we are unable to reject the null hypothesis 
that the distribution of defects in printed circuit boards is Poisson. The P-value for the 
test is P — 0.0864. (This value was computed using an HP-48 calculator.) 


A Continuous Distribution 

A manufacturing engineer is testing a power supply used in a notebook computer and, using 
a = 0.05, wishes to determine whether output voltage is adequately described by a normal dis- 
tribution. Sample estimates of the mean and standard deviation of x — 5.04 V and s — 0.08 V 
are obtained from a random sample of n — 100 units. 

A common practice in constructing the class intervals for the frequency distribution used 
in the chi-square goodness-of-fit test is to choose the cell boundaries so that the expected fre- 
quencies E; = np; are equal for all cells. To use this method, we want to choose the cell bound- 
aries ao, 41, ..., a, for the k cells so that all the probabilities 


p; = P(agj:. 1 S XS qj) = | feas 


aj-| 


are equal. Suppose we decide to use k = 8 cells. For the standard normal distribution, the inter- 
vals that divide the scale into eight equally likely segments are [0, 0.32), [0.32, 0.675) [0.675, 
1.15), [1.15, ©) and their four “mirror image" intervals on the other side of zero. For each inter- 
val p; = 1/8 = 0.125, so the expected cell frequencies are E; = np;— 100(0.125) = 12.5. The 
complete table of observed and expected frequencies 1s as follows: 


Class Observed Expected 
Interval Frequency o; Frequency £; 

x « 4.948 12 12.5 
4.948 = x < 4.986 14 12.3 
4.986 x x < 5.014 12 12.5 
5.014 = x < 5.040 13 12:3 
5.040 x x « 5.066 12 12:3 
5.066 x x « 5.094 11 12:3 
5.094 = x « 5.132 12 12.5 
51322x 14 12:3 
Totals 100 100 


The boundary of the first class interval is x — 1.15s = 4.948. The second class interval is 
[x — 1.15s, x — 0.6755) and so forth. We may apply the eight-step hypothesis-testing proce- 
dure to this problem. 

1. The variable of interest is the form of the distribution of power supply voltage. 


2. Hg The form of the distribution is normal. 
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Hi: The form of the distribution is nonnormal. 
a = 0.05 


The test statistic 1s 


Since two parameters in the normal distribution have been estimated, the chi-square 
statistic above will have k —p —1=8-—2-— 1 =5 degrees of freedom. 
Therefore, we will reject Hy if x6 > X6.0s.5 = 11.07. 


Computations: 
3 (o; — Ej) 
2 l l 
X — 
 (12—125y (14 — 12.5) — (14 — 12,5)" 
i 12.5 12.5 12.5 
= 0.64 


8. Conclusions: Since x? = 0.64 < X0955 = 11.07, we are unable to reject Ho, and there 
is no strong evidence to indicate that output voltage is not normally distributed. The 
P-value for the chi-square statistic xc = 0.64 is P = 0.9861. 


EXERCISES FOR SECTION 9-7? 


9-59, Consider the following frequency table of observa- 
tions on the random variable X. 


Values 0 l 2 3 4 
Observed Frequency 24 30 3l 11 4 


(a) Based on these 100 observations, is a Poisson distribution 
with a mean of 1.2 an appropriate model? Perform a good- 
ness-of-fit procedure with a = 0.05. 

(b) Calculate the P-value for this test. 


9-60. Let X denote the number of flaws observed on a 
large coil of galvanized steel. Seventy-five coils are in- 
spected and the following data were observed for the values 
of X: 


Values 1 2 3 4 3 6 7 8 
Observed 
Frequency l 11 8 13 11 12 10 9 


(a) Does the assumption of the Poisson distribution seem ap- 
propriate as a probability model for this data? Use a = 0.01. 

(b) Calculate the P-value for this test. 

9-61. The number of calls arriving at a switchboard 

from noon to 1 PM during the business days Monday through 

Friday is monitored for six weeks (1.e., 30 days). Let X be 


defined as the number of calls during that one-hour period. 
The relative frequency of calls was recorded and reported as 


Value 5 6 8 9 10 
Relative 

Frequency 0.067 0.067 0.100 0.133 0.200 
Value 11 12 13 14 15 
Relative 

Frequency 0.133 0.133 0.067 0.033 0.067 


(a) Does the assumption of a Poisson distribution seem appro- 
priate as a probability model for this data? Use a = 0.05. 
(b) Calculate the P-value for this test. 


9-62. Consider the following frequency table of observa- 
tions on the random variable X: 


Values 0 ] 2 3 4 
Frequency 4 21 10 13 2 


(a) Based on these 50 observations, is a binomial distribution 
with n = 6 and p = 0.25 an appropriate model? Perform 
a goodness-of-fit procedure with a = 0.05. 

(b) Calculate the P-value for this test. 
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9-63. Define X as the number of underfilled bottles from a 
. E Vehicles Vehicles 
filling operation in a carton of 24 bottles. Sixty cartons are 
donddfollowiueob Y "e per Observed per Observed 
inspected and the following observations on X are recorded: Minute Frequency Minute Frequency 
Values 1 2 3 40 14 53 102 
Frequency 23 12 1 41 24 54 96 
42 5/ 55 90 
(a) Based on these 75 observations, is a binomial distribution 
l 43 111 56 8l 
an appropriate model? Perform a goodness-of-fit proce- 
dure with a = 0.05. ki d A qe 
(b) Calculate the P-value for this test. 45 256 58 64 
9-64. The number of cars passing eastbound through the in- 46 296 59 61 
tersection of Mill and University Avenues has been tabulated 47 378 60 39 
by a group of civil engineering students. They have obtained 48 250 61 50 
the data in the adjacent table: | mM 49 185 62 42 
(a) Does the assumption of a Poisson distribution seem 50 171 63 29 
appropriate as a probability model for this process? Use 
a = 0.05. 51 150 64 18 
(b) Calculate the P-value for this test. 22 110 65 15 


9.8 CONTINGENCY TABLE TESTS 


Many times, the n elements of a sample from a population may be classified according to two 
different criteria. It is then of interest to know whether the two methods of classification are 
statistically independent; for example, we may consider the population of graduating engi- 
neers, and we may wish to determine whether starting salary is independent of academic dis- 
ciplines. Assume that the first method of classification has r levels and that the second method 
has c levels. We will let O; be the observed frequency for level i of the first classification 
method and level j on the second classification method. The data would, in general, appear as 
shown in Table 9-2. Such a table is usually called an r X c contingency table. 

We are interested in testing the hypothesis that the row-and-column methods of classifi- 
cation are independent. If we reject this hypothesis, we conclude there is some interaction be- 
tween the two criteria of classification. The exact test procedures are difficult to obtain, but an 
approximate test statistic is valid for large n. Let p; be the probability that a randomly selected 


element falls in the ijth cell, given that the two classifications are independent. Then p; = u,v, 


Table 9-2 An r X c Contingency Table 


Columns 
(e 
1 Oie 
2 O5. 


Rows 





EXAMPLE 9-14 


Table 9-3 Observed Data for Example 9-14 


Job Classification 


Salaried workers 
Hourly workers 


Totals 
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where u; is the probability that a randomly selected element falls in row class i and v; is the 
probability that a randomly selected element falls in column class j. Now, assuming inde- 
pendence, the estimators of u; and v; are 


A 1 £ 
nE 
j=l 
„n le 
ye Oj (9-40) 
i=] 
Therefore, the expected frequency of each cell is 
A 1 C r 
Ey = nu; 7j = n O;j Oi (9-41) 
j=l i=] 
Then, for large n, the statistic 
r € (O;- E; 
2 ij ij 
Xo = (9-42) 
! i-l j=l Ej 


has an approximate chi-square distribution with (r — 1)(c — 1) degrees of freedom if the null 
hypothesis is true. Therefore, we would reject the hypothesis of independence if the observed 
value of the test statistic x? exceeded X cedi 


A company has to choose among three pension plans. Management wishes to know whether 
the preference for plans is independent of job classification and wants to use a = 0.05. The 
opinions of a random sample of 500 employees are shown in Table 9-3. 

To find the expected frequencies, we must first compute = (340/500) = 0.68, à; = 
(160/500) = 0.32, $, = (200/500) = 0.40, >, = (200/500) = 0.40, and ® = (100/500) = 
0.20. The expected frequencies may now be computed from Equation 9-41. For example, the 
expected number of salaried workers favoring pension plan | is 


The expected frequencies are shown in Table 9-4. 


The eight-step hypothesis-testing procedure may now be applied to this problem. 


1. The variable of interest is employee preference among pension plans. 


2. Hp: Preference is independent of salaried versus hourly job classification. 


Table 9-4 Expected Frequencies for Example 9-14 


Pension Plan 


Job Classification 1 2 3 


Pension Plan 


1 D 3 Totals Totals 
160 140 40 340 Salaried workers 136 136 68 340 

40 60 60 160 Hourly workers 64 64 32 160 
200 200 100 500 Totals 200 200 100 500 
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3. H: Preference is not independent of salaried versus hourly job classification. 
a = 0.05 


The test statistic 1s 


6. Sincer = 2 and c = 3, the degrees of freedom for chi-square are (r — 1)(c — 1) = 
(1)(2) = 2, and we would reject Hy if x5 > X6955 = 5.99. 


7. Computations: 


j 
(160 — 136) " (140 — 136)? " (40 — 68) F (40 — 64) 
136 136 68 64 
F (60 — 64) N (60 —32) _ ion 
64 32. o 

8. Conclusions: Since x5 = 49.63 > X0.052 = 5.99, we reject the hypothesis of inde- 
pendence and conclude that the preference for pension plans is not independent of 
job classification. The P-value for x5 = 49.63 is P = 1.671 X 10 !!. (This value 
was computed using a hand-held calculator.) Further analysis would be necessary to 
explore the nature of the association between these factors. It might be helpful to 
examine the table of observed minus expected frequencies. 


Using the two-way contingency table to test independence between two variables of 
classification in a sample from a single population of interest is only one application of con- 
tingency table methods. Another common situation occurs when there are r populations of 
interest and each population is divided into the same c categories. A sample is then taken from 
the ith population, and the counts are entered in the appropriate columns of the ith row. In this 
situation we want to investigate whether or not the proportions in the c categories are the same 
for all populations. The null hypothesis in this problem states that the populations are homo- 
geneous with respect to the categories. For example, when there are only two categories, such 
as success and failure, defective and nondefective, and so on, the test for homogeneity 1s really 
a test of the equality of r binomial parameters. Calculation of expected frequencies, determi- 
nation of degrees of freedom, and computation of the chi-square statistic for the test for ho- 
mogeneity are identical to the test for independence. 


EXERCISES FOR SECTION 9-8 


9-65. A company operates four machines three shifts each 


day. From production records, the following data on the num- . Machines .— — 

ber of breakdowns are collected: Bonita 4 ee DEC ME 
1 41 20 12 16 
2 3l Li 9 14 


3 I5 17 16 10 


Test the hypothesis (using a = 0.05) that breakdowns are 
independent of the shift. Find the P-value for this test. 


9-66. Patients in a hospital are classified as surgical or med- 
ical. A record is kept of the number of times patients require 
nursing service during the night and whether or not these 
patients are on Medicare. The data are presented here: 


Patient Category 

Medicare Surgical Medical 
Yes 46 52 
No 36 43 


Test the hypothesis (using a = 0.01) that calls by surgical- 
medical patients are independent of whether the patients are 
receiving Medicare. Find the P-value for this test. 


9-67. Grades in a statistics course and an operations re- 
search course taken simultaneously were as follows for a 
group of students. 


Operation Research Grade 


Statistics Grade A B C Other 
A 25 6 17 13 
B 17 16 15 6 
C 18 4 18 10 
Other 10 8 11 20 


Are the grades in statistics and operations research related? 
Use a = 0.01 in reaching your conclusion. What is the 
P-value for this test? 

9-68. An experiment with artillery shells yields the follow- 
ing data on the characteristics of lateral deflections and 
ranges. Would you conclude that deflection and range are in- 
dependent? Use a = 0.05. What is the P-value for this test? 


Lateral Deflection 


Range (yards) Left Normal Right 
0—1,999 6 14 8 

2,000—5,999 9 11 4 

6,000-1 1,999 8 17 6 


9-69. A study is being made of the failures of an electronic 
component. There are four types of failures possible and two 
mounting positions for the device. The following data have 
been taken: 
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Failure Type 
Mounting Position A B C D 
1 22 46 18 9 
2 4 17 6 12 


Would you conclude that the type of failure is independent of the 
mounting position? Use a = 0.01. Find the P-value for this test. 
9-70. Arandom sample of students is asked their opinions on 
a proposed core curriculum change. The results are as follows. 


Opinion 
Class Favoring Opposing 
Freshman 120 80 
Sophomore 70 130 
Junior 60 70 
Senior 40 60 


Test the hypothesis that opinion on the change is independent of 
class standing. Use a = 0.05. What is the P-value for this test? 


Supplemental Exercises 


9-71. A manufacturer of semiconductor devices takes a ran- 
dom sample of size n of chips and tests them, classifying each 
chip as defective or nondefective. Let X; = 0 if the chip is non- 
defective and X; — 1 if the chip is defective. The sample frac- 
tion defective 1s 


n 


~ tht +X, 





i n 


What are the sampling distribution, the sample mean, and 

sample variance estimates of p when 

(a) The sample size is n — 50? 

(b) The sample size is n = 80? 

(c) The sample size is n — 100? 

(d) Compare your answers to parts (a)-(c) and comment on 
the effect of sample size on the variance of the sampling 
distribution. 


9-72. Consider the situation of Exercise 9-76. After collecting 
a sample, we are interested in testing Hy: p = 0.10 versus 
Hi: p # 0.10 with a = 0.05. For each of the following situa- 
tions, compute the p-value for this test: 

(a) n = 50, p = 0.095 


(b) n = 100, p = 0.095 
(c) n — 500, p — 0.095 
(d) n = 1000, p = 0.095 


(e) Comment on the effect of sample size on the observed 
P-value of the test. 
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9-73. An inspector of flow metering devices used to admin- 

ister fluid intravenously will perform a hypothesis test to 

determine whether the mean flow rate is different from the flow 

rate setting of 200 milliliters per hour. Based on prior 

information the standard deviation of the flow rate is assumed 

to be known and equal to 12 milliliters per hour. For each of the 

following sample sizes, and a fixed a = 0.05, find the probabil- 

ity of a type II error if the true mean is 205 milliliters per hour. 

(a) n = 20 

(b) n = 50 

(c) n = 100 

(d) Does the probability of a type II error increase or decrease 
as the sample size increases? Explain your answer. 

9-74. Suppose that in Exercise 9-73, the experimenter had 

believed that o = 14. For each of the following sample sizes, 

and a fixed a = 0.05, find the probability of a type II error if 

the true mean is 205 milliliters per hour. 


(a) n = 20 
(b) n = 50 
(c) n = 100 


(d) Comparing your answers to those in Exercise 9-73, does 
the probability of a type II error increase or decrease with 
the increase in standard deviation? Explain your answer. 


9-75. The marketers of shampoo products know that cus- 

tomers like their product to have a lot of foam. A manufacturer 

of shampoo claims that the foam height of his product exceeds 

200 millimeters. It is known from prior experience that the 

standard deviation of foam height is 8 millimeters. For each of 

the following sample sizes, and a fixed a = 0.05, find the 

power of the test if the true mean is 204 millimeters. 

(a) n = 20 

(b) n = 50 

(c) n — 100 

(d) Does the power of the test increase or decrease as the sam- 
ple size increases? Explain your answer. 


9-76. Suppose we wish to test the hypothesis Hy: p. = 85 
versus the alternative Hj: u > 85 where o = 16. Suppose that 
the true mean is p = 86 and that in the practical context of the 
problem this is not a departure from p = 85 that has practical 
significance. 

(a) For a test with a = 0.01, compute B for the sample sizes 
n = 25,100, 400, and 2500 assuming that u = 86. 

(b) Suppose the sample average is x = 86. Find the P-value 
for the test statistic for the different sample sizes speci- 
fied in part (a). Would the data be statistically significant 
ata = 0.01? 

(c) Comment on the use of a large sample size in this problem. 


9-77. The cooling system in a nuclear submarine consists of 

an assembly of welded pipes through which a coolant is circu- 

lated. Specifications require that weld strength must meet or 

exceed 150 psi. 

(a) Suppose that the design engineers decide to test the 
hypothesis Hy: w~ = 150 versus H,: p > 150. Explain 
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why this choice of alternative hypothesis is better than 
Hi: w < 150. 

(b) A random sample of 20 welds results in x = 153.7 psi and 
s = 11.3 psi. What conclusions can you draw about the 
hypothesis in part (a)? State any necessary assumptions 
about the underlying distribution of the data. 


9-78. Suppose we are testing Hy: p = 0.5 versus Hy: p # 0.5. 

Suppose that p is the true value of the population proportion. 

(a) Using a = 0.05, find the power of the test for n = 100, 
150, and 300 assuming that p — 0.6. Comment on the 
effect of sample size on the power of the test. 

(b) Using a = 0.01, find the power of the test for n = 100, 
150, and 300 assuming that p — 0.6. Compare your an- 
swers to those from part (a) and comment on the effect of 
a on the power of the test for different sample sizes. 

(c) Using a = 0.05, find the power of the test for n = 100, as- 
suming p — 0.08. Compare your answer to part (a) and 
comment on the effect of the true value of p on the power 
of the test for the same sample size and a level. 

(d) Using a = 0.01, what sample size is required if p = 0.6 
and we want B = 0.05? What sample is required if 
p = 0.8 and we want B = 0.05? Compare the two sam- 
ple sizes and comment on the effect of the true value of 
p on sample size required when f is held approximately 
constant. 


9-79. Consider the television picture tube brightness exper- 

iment described in Exercise 8-24. 

(a) For the sample size n = 10, do the data support the 
claim that the standard deviation of current is less than 
20 microamps? 

(b) Suppose instead of n = 10, the sample size was 51. 
Repeat the analysis performed in part (a) using n = 51. 

(c) Compare your answers and comment on how sample size 
affects your conclusions drawn in parts (a) and (b). 


9-80. Consider the fatty acid measurements for the diet 

margarine described in Exercise 8-25. 

(a) For the sample size n — 6, using a two-sided alternative 
hypothesis and a = 0.01, test Hy: o? = 1.0. 

(b) Suppose instead of n = 6, the sample size was n = 51. 
Repeat the analysis performed in part (a) using n = 51. 

(c) Compare your answers and comment on how sample size 
affects your conclusions drawn in parts (a) and (b). 


9-81. A manufacturer of precision measuring instruments 

claims that the standard deviation in the use of the instruments 

is at most 0.00002 millimeter. An analyst, who is unaware of 
the claim, uses the instrument eight times and obtains a sam- 
ple standard deviation of 0.00001 millimeter. 

(a) Confirm using a test procedure and an a level of 0.01 that 
there is insufficient evidence to support the claim that the 
standard deviation of the instruments is at most 0.00002. 
State any necessary assumptions about the underlying dis- 
tribution of the data. 


(b) Explain why the sample standard deviation, s — 0.00001, 
is less than 0.00002, yet the statistical test procedure re- 
sults do not support the claim. 


9-82. A biotechnology company produces a therapeutic 
drug whose concentration has a standard deviation of 4 grams 
per liter. A new method of producing this drug has been pro- 
posed, although some additional cost is involved. Management 
will authorize a change in production technique only if the 
standard deviation of the concentration in the new process is 
less than 4 grams per liter. The researchers chose n = 10 and 
obtained the following data in grams per liter. Perform the nec- 
essary analysis to determine whether a change in production 
technique should be implemented. 


16.628 16.630 
16.622 16.631 
16.627 16.624 
16.623 16.622 
16.618 16.626 


9-83. Consider the 40 observations collected on the number 
of nonconforming coil springs in production batches of size 
50 given in Exercise 6-79. 

(a) Based on the description of the random variable and these 
40 observations, is a binomial distribution an appropriate 
model? Perform a goodness-of-fit procedure with a = 0.05. 

(b) Calculate the P-value for this test. 


9-84. Consider the 20 observations collected on the number 
of errors in a string of 1000 bits of a communication channel 
given in Exercise 6-80. 

(a) Based on the description of the random variable and these 
20 observations, is a binomial distribution an appropriate 
model? Perform a goodness-of-fit procedure with a = 0.05. 

(b) Calculate the P-value for this test. 


9-85. Consider the spot weld shear strength data in Exercise 
6-23. Does the normal distribution seem to be a reasonable 
model for these data? Perform an appropriate goodness-of-fit 
test to answer this question. 


9-86. Consider the water quality data in Exercise 6-24. 

(a) Do these data support the claim that mean concentration 
of suspended solids does not exceed 50 parts per million? 
Use a = 0.05. 

(b) What is the P-value for the test in part (a)? 

(c) Does the normal distribution seem to be a reasonable 
model for these data? Perform an appropriate goodness- 
of-fit test to answer this question. 


9-87. Consider the golf ball overall distance data in 

Exercise 6-25. 

(a) Do these data support the claim that the mean overall dis- 
tance for this brand of ball does not exceed 270 yards? 
Use a = 0.05. 

(b) What is the P-value for the test in part (a)? 
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(c) Do these data appear to be well modeled by a normal dis- 
tribution? Use a formal goodness-of-fit test in answering 
this question. 


9-88. Consider the baseball coefficient of restitution data 

in Exercise 8-79. If the mean coefficient of restitution ex- 

ceeds 0.635, the population of balls from which the sample 
has been taken will be too “lively” and considered unaccept- 
able for play. 

(a) Formulate an appropriate hypothesis testing procedure to 
answer this question. 

(b) Test these hypotheses using the data in Exercise 8-79 and 
draw conclusions, using a = 0.01. 

(c) Find the P-value for this test. 

(d) In Exercise 8-79(b), you found a 99% confidence interval 
on the mean coefficient of restitution. Does this interval, 
or a one-sided CI, provide additional useful information to 
the decision maker? Explain why or why not. 


9-89. Consider the dissolved oxygen data in Exercise 8-81. 
Water quality engineers are interested in knowing whether 
these data support a claim that mean dissolved oxygen con- 
centration is 2.5 milligrams per liter. 

(a) Formulate an appropriate hypothesis testing procedure to 
investigate this claim. 

(b) Test these hypotheses, using a = 0.05, and the data from 
Exercise 8-81. 

(c) Find the P-value for this test. 

(d) In Exercise 8-81(b) you found a 95% CI on the mean dis- 
solved oxygen concentration. Does this interval provide 
useful additional information beyond that of the hypothe- 
sis testing results? Explain your answer. 


9-90. The mean pull-off force of an adhesive used in man- 

ufacturing a connector for an automotive engine application 

should be at least 75 pounds. This adhesive will be used un- 
less there is strong evidence that the pull-off force does not 
meet this requirement. A test of an appropriate hypothesis is 

to be conducted with sample size » = 10 and a = 0.05. 

Assume that the pull-off force is normally distributed, and o 

is not known. 

(a) If the true standard deviation is o = 1, what is the risk that 
the adhesive will be judged acceptable when the true mean 
pull-off force is only 73 pounds? Only 72 pounds? 

(b) What sample size is required to give a 90% chance of 
detecting that the true mean is only 72 pounds when o — 1? 

(c) Rework parts (a) and (b) assuming that o = 2. How much 
impact does increasing the value of o have on the answers 
you obtain? 
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CHAPTER9 TESTS OF HYPOTHESES FORA SINGLE SAMPLE 


MIND-EXPANDING EXERCISES 


9-91. Suppose that we wish to test Ho: ju = pọ versus 

Hı: u # Bo, where the population is normal with 

known c. Let 0 < e < a, and define the critical region 

so that we will reject Hi il zo -~ Ze O if a ez 

where z, is the value of the usual test statistic for these 

hypotheses. 

(a) Show that the probability of type I error for this test 
is Q. 

(b) Suppose that the true mean is pj, = [ug + ò. Derive 
an expression for B for the above test. 


9-92. Derive an expression for B for the test on the 
variance of a normal distribution. Assume that the two- 
sided alternative is specified. 


9-93. When X,, X, ... , X, are independent Poisson 
random variables, each with parameter A, and n is large, 
the sample mean X has an approximate normal distribu- 
tion with mean X and variance A/n. Therefore, 


ES 


á VA/n 


has approximately a standard normal distribution. Thus 
we can test Ho: X = X, by replacing A in Z by Ay. When X; 
are Poisson variables, this test is preferable to the large- 
sample test of Section 9-2.5, which would use S/ V/n in 
the denominator, because it is designed just for the 
Poisson distribution. Suppose that the number of open cir- 
cuits on a semiconductor wafer has a Poisson distribution. 
Test data for 500 wafers indicate a total of 1038 opens. 
Using a = 0.05, does this suggest that the mean number 
of open circuits per wafer exceeds 2.0? 


IMPORTANT TERMS AND CONCEPTS 


In the E-book, click on any 
term or concept below to 
go to that subject. 

Connection between 


Null hypothesis 

One- and two-sided 
alternative hypotheses 

Operating characteristic 


hypothesis tests curves 
and confidence Power of the test 
intervals P-value 

Critical region for a test Reference distribution 
statistic for a test statistic 


9-94. When X., 5,..., X, is a random sample from a 
normal distribution and n is large, the sample standard 
deviation has approximately a normal distribution with 
mean c and variance o*/(2n). Therefore, a large-sample 
test for Hy: © = o, can be based on the statistic 


SR c 


V o$/ 2n) 





Use this result to test Hp: o = 10 versus H,: o < 10 for 
the golf ball overall distance data in Exercise 6-25. 


9-95, Continuation of Exercise 9-94. Using the 
results of the previous exercise, find an approximately 
unbiased estimator of the 95 percentile 0 = u + 1.6450. 
From the fact that X and S are independent random 
variables, find the standard error of 8. How would you 
estimate the standard error? 


9-96. Continuation of Exercises 9-94 and 9-95. 
Consider the golf ball overall distance data in Exercise 
6-25. We wish to investigate a claim that the 95 per- 
centile of overall distance does not exceed 285 yards. 
Construct a test statistic that can be used for testing the 
appropriate hypotheses. Apply this procedure to the data 
from Exercise 6-25. What are your conclusions? 


9-97. Let X,, X>,...,X, be a sample from an exponen- 
tial distribution with parameter à. It can be shown that 
2A y ;-, X; has a chi-square distribution with 2n degrees 
of freedom. Use this fact to devise a test statistic and 
critical region for Hy: N = A, versus the three usual 
alternatives. 


Sample size determina- 
tion for hypothesis 


tests Test statistic 
Significance level of a Type I and type II 

test errors 
Statistical hypotheses CD MATERIAL 


Statistical versus practi- 
cal significance 
Test for goodness of fit 


Likelihood ratio test 


Test for homogeneity 
Test for independence 


9.3.4 Likelihood Ratio Approach to Development of Test Procedures 


(CD Only) 


Hypothesis testing is one of the most important techniques of statistical inference. Throughout 
this book we present many applications of hypothesis testing. While we have emphasized a 
heuristic development, many of these hypothesis-testing procedures can be developed using a 
general principle called the likelihood ratio principle. Tests developed by this method often 
turn out to be "best" test procedures in the sense that they minimize the type II error probabil- 
ity B among all tests that have the same type I error probability a. 

The likelihood ratio principle is easy to illustrate. Suppose that the random variable X has 
a probability distribution that is described by an unknown parameter 0, say, f(x, 0). We wish 
to test the hypothesis Hp: 0 is in O versus H,: 0 is in Q,, where and OQ, are disjoint sets of 
values (such as Hp: w = O versus H,: p < 0). Let Xj, X, ..., X, be the observations in a ran- 
dom sample. The joint distribution of these sample observations is 


J i esde 0) — T 00/8 "eT 00,9] exec TB) 


Recall from our discussion of maximum likelihood estimation in Chapter 7 that the likeli- 
hood function, say (0), is just this joint distribution considered as a function of the parameter 
0. The likelihood ratio principle for test construction consists of the following steps: 


1. Find the largest value of the likelihood for any 0 in Qy. This is done by finding the 
maximum likelihood estimator of 0 restricted to values within Qg and by substituting 
this value of 0 back into the likelihood function. This results in a value of the likeli- 
hood function that we will call L(Qọ). 


2. Find the largest value of the likelihood for any 0 in . Call this the value of the like- 
lihood function L(,). 


3. Form the ratio 





This ratio À is called the likelihood ratio test statistic. 

The test procedure calls for rejecting the null hypothesis Ho when the value of this ratio X 
is small, say, whenever A < k, where k is a constant. Thus, the likelihood ratio principle re- 
quires rejecting Hy when L(Q,,) is much larger than L(Q)), which would indicate that the sam- 
ple data are more compatible with the alternative hypothesis H, than with the null hypothesis 
Hp. Usually, the constant k would be selected to give a specified value for a, the type I error 
probability. 

These ideas can be illustrated by a hypothesis-testing problem that we have studied 
before—that of testing whether the mean of a normal population has a specified value uo. 
This is the one-sample t-test of Section 9-3. Suppose that we have a sample of n observations 
from a normal population with unknown mean p and unknown variance o°, say, X}, X5, ... , X;. 
We wish to test the hypothesis Hp: ju = wo versus H,: y Æ pọ. The likelihood function of the 
sample is 


9-] 


9-2 


and the values of Q, and Q, are Qo = oand O, = (p: — < u < ©}, respectively. The values 
of u and o? that maximize L in OQ, are the usual maximum likelihood estimates for u and o: 


1 n 


A 


(2m/n) X (x, — xy 


To maximize L in Qy we simply set u = ju and then find the value of o? that maximizes L. 
This value is found to be 


which gives 


The likelihood ratio 1s 





Now since 


we may write X as 
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9-3 


(x — Lo)” 
s^/n 
t-distribution with n — 1 degrees of freedom when the null hypothesis Hp: ju. = p is true. So 

we may write the value of the likelihood ratio X as 


Notice that | | = (^ is the square of the value of a random variable that has the 


n/2 
1 


= TF [Pn 1) 


It is easy to find the value for the constant k that would lead to rejection of the null hypothe- 
sis Hy. Since we reject Ho if X < k, this implies that small values of X support the alternative 
hypothesis. Clearly, X will be small when 1? is large. So instead of specifying k we can spec- 
ify a constant c and reject Hy: p = pọ if £^ > c. The critical values of t would be the extreme 
values, either positive or negative, and if we wish to control the type I error probability at a, 
the critical region in terms of t would be 


t < —laj2an-1 and t > Lo /2.n-1 


or, equivalently, we would reject Hy: p = po if £7 c = f nl Therefore, the likelihood 
ratio test for Hp: ju = po versus H,: y pois the familiar single-sample t-test. 

The procedure employed in this example to find the critical region for the likelihood ratio 
X is used often. That is, typically, we can manipulate A to produce a condition that is equiva- 
lent to X < k, but one that is simpler to use. 

The likelihood ratio principle is a very general procedure. Most of the tests presented in 
this book that utilize the ¢, chi-square, and F-distributions for testing means and variances of 
normal distributions are likelihood ratio tests. The principle can also be used in cases where 
the observations are dependent, or even in cases where their distributions are different. 
However, the likelihood function can be very complicated in some of these situations. To use 
the likelihood principle we must specify the form of the distribution. Without such a specifi- 
cation, it is impossible to write the likelihood function, and so if we are unwilling to assume a 
particular probability distribution, the likelihood ratio principle cannot be used. This could 
lead to the use of the nonparametric test procedures discussed in Chapter 15. 


9.5.2 Small-Sample Tests on a Proportion (CD Only) 


Tests on a proportion when the sample size n is small are based on the binomial distribution, 
not the normal approximation to the binomial. To illustrate, suppose we wish to test 


Ho: P = po 
Hy: p < po 
Let X be the number of successes in the sample. A lower-tail rejection region would be used. 


That is, we would reject H, if x S c, where c is the critical value. When Họ is true, X has a 
binomial distribution with parameters n and po; therefore, 


P(Type I error) = P(reject Hy when Hy is true) 
= P|X xc when Xis Bin(n, po)| 
B 


9.4 


where B(c; n, po) is the cumulative binomial distribution. To find the critical value for a given 
a, we would select the largest c satisfying B(c; ni, Po) - a. The type II error calculation is 
straightforward. Let p, be an alternative value of p, with p, < po. If p = pi, X is Bin (n, pj). 
Therefore 


B = P(Type II error when p = pi) 
= P|X > c when X is Bin (n, pı) | 
= 1 — B(c; n, pi) 


where B(c; n, p,) is the cumulative binomial distribution. 

Test procedures for the other one-sided alternative H,: p > po and the two-sided alternative 
Hy: p £ pọ are constructed in a similar fashion. For H,: p > po the critical region has the form 
x = c, where we would choose the smallest value of c satisfying 1 — B(c — 1, n, po) S a. For the 
two-sided case, the critical region consists of both large and small values. Because c is an integer, 
it usually isn’t possible to define the critical region to obtain exactly the desired value of a. 

To illustrate the procedure, let's reconsider the situation of Example 9-10, where we wish 
to test Hp: p = 0.05 verses H,: p < 0.05. Suppose now that the sample size is n = 100 and 
we wish to use a = 0.05. Now from the cumulative binomial distribution with n = 50 and 
p = 0.05, we find that B(0; 100, 0.05) = 0.0059, B(1; 100, 0.05) = 0.0371, and B(2; 100, 
0.05) = 0.1183 (Minitab will generate these cumulative binomial probabilities). Since B(1; 
100, 0.05) = 0.0371 = 0.05 and 5B(2; 100, 0.05) = 0.1183 > 0.05, we would select c = 1. 
Therefore the null hypothesis will be rejected if x = 1. The exact significance level for this 
test is a = 0.0371. To calculate the power of the test, suppose that p, = 0.03. Now 


pel py) 

= ] — B(1; 100, 0.03) 
] — 0.1946 
— 0.8054 


| 


and the power of the test is only 0.1946. This is a fairly small power because p, is close to po. 
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10-6.3 B-Error and Choice of Sample Size 
10-6.4 Confidence Interval for p, — p; 


SUMMARY TABLE FOR INFERENCE 
PROCEDURES FOR TWO SAMPLES 


After careful study of this chapter, you should be able to do the following: 


1. Structure comparative experiments involving two samples as hypothesis tests 


2. Test hypotheses and construct confidence intervals on the difference in means of two normal 
distributions 
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3. Test hypotheses and construct confidence intervals on the ratio of the variances or standard 
deviations of two normal distributions 

4. Test hypotheses and construct confidence intervals on the difference in two population proportions 

5. Use the P-value approach for making decisions in hypotheses tests 

6. Compute power, type II error probability, and make sample size decisions for two-sample tests on 
means, variances, and proportions 


7. Explain and use the relationship between confidence intervals and hypothesis tests 


CD MATERIAL 


8. Use the Fisher-Irwin test to compare two population proportions when the normal approxima- 
tion to the binomial distribution does not apply 


Answers for many odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e- Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e- Text immediately following the section they accompany. 


10-1 INTRODUCTION 


The previous chapter presented hypothesis tests and confidence intervals for a single popula- 
tion parameter (the mean p, the variance o°, or a proportion p). This chapter extends those 
results to the case of two independent populations. ; 

The general situation is shown in Fig. 10-1. Population 1 has mean M1 and variance 9 1, 
while population 2 has mean M2 and variance 92. Inferences will be based on two random 
samples of sizes nl and n2, respectively. That is, X11, X12, p, Xin, is a random sample of nl 
observations from population 1, and X21, X22,p, Jn is a random sample of n2 observations 
from population 2. Most of the practical applications of the procedures in this chapter arise in 
the context of simple comparative experiments in which the objective is to study the differ- 
ence in the parameters of the two populations. 


10-2 INFERENCE FOR A DIFFERENCE IN MEANS OF TWO 
NORMAL DISTRIBUTIONS, VARIANCES KNOWN 


Figure 10-1 Two 
independent popula- 
tions. 


In this section we consider statistical inferences on the difference in means u; — p of two 
normal distributions, where the variances o 1 and ob are known. The assumptions for this sec- 
tion are summarized as follows. 


Population 1 Population 2 
2 2 
g O2 
Sample 1: Sample 2: 


Xil Xizr- Xini X21, X227.: Xon, 
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Assumptions 


Xy, Xiz ..., Xin, 18 a random sample from population 1. 


X51, X55, ..., Xon, 18 a random sample from population 2. 
The two populations represented by X, and X, are independent. 


Both populations are normal. 





A logical point estimator of u, — p; is the difference in sample means X, — X>. Based 
on the properties of expected values 


and the variance of X, — X» is 


2 co £wesluyno0i,e 
VX 2 X>) = V(Xi) T V(X;) B Hi Mm 


Based on the assumptions and the preceding results, we may state the following. 


The quantity 





has a N(0, 1) distribution. 





This result will be used to form tests of hypotheses and confidence intervals on p; — po. 
Essentially, we may think of p, — ji as a parameter 0, and its estimator is © = X, — X, 
with variance af = oi/n, + o$/n,. If 0, is the null hypothesis value specified for 0, the test 


statistic will be (© — 05)/o6 . Notice how similar this is to the test statistic for a single mean 
used in Equation 9-8 of Chapter 9. 


10-2.1 Hypothesis Tests for a Difference in Means, Variances Known 


We now consider hypothesis testing on the difference in the means p, — p; of two normal 
populations. Suppose that we are interested in testing that the difference in means p, — p; is 
equal to a specified value Ay. Thus, the null hypothesis will be stated as Hy: pj — p = Apo. 
Obviously, in many cases, we will specify A, = 0 so that we are testing the equality of two 
means (1.e., Hp: y; = u2). The appropriate test statistic would be found by replacing p, — p; 
in Equation 10-1 by A,, and this test statistic would have a standard normal distribution under 
Ho. That is, the standard normal distribution is the reference distribution for the test statistic. 
Suppose that the alternative hypothesis is H,: uj, — p- ~ Ap. Now, a sample value of x, — x, 
that is considerably different from A, is evidence that H, is true. Because Z, has the N(0, 1) 
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EXAMPLE 10-1 


distribution when Hj, is true, we would take —z,/ and z,/? as the boundaries of the critical re- 
gion just as we did in the single-sample hypothesis-testing problem of Section 9-2.1. This 
would give a test with level of significance a. Critical regions for the one-sided alternatives 
would be located similarly. Formally, we summarize these results below. 


Alternative Hypotheses Rejection Criterion 


Hi: p — M2 t Ap Zo > 2Zy/2 OF Zo < —Za/2 
Hyg — m > Ao Z0 > Za 
Hyg — p < Ao Zg € —Zq 





A product developer is interested in reducing the drying time of a primer paint. Two formula- 
tions of the paint are tested; formulation 1 is the standard chemistry, and formulation 2 has a 
new drying ingredient that should reduce the drying time. From experience, it is known that 
the standard deviation of drying time is 8 minutes, and this inherent variability should be un- 
affected by the addition of the new ingredient. Ten specimens are painted with formulation 1, 
and another 10 specimens are painted with formulation 2; the 20 specimens are painted in 
random order. The two sample average drying times are x; = 121 minutes and x, = 112 
minutes, respectively. What conclusions can the product developer draw about the effective- 
ness of the new ingredient, using a = 0.05? 
We apply the eight-step procedure to this problem as follows: 


1. The quantity of interest is the difference in mean drying times, pu, — p», and Ay = 0. 
Ho: by — po = 0, or Ho: py = po. 

Hi: u; > pu». We want to reject Ho if the new ingredient reduces mean drying time. 
a — 0.05 


The test statistic 1s 


v ew oe opu 


where of = o2 = (8)? = 64 and n, = m = 10. 
6. Reject Ho: | = M2 if zo > 1.645 = Z0.05- 
7. Computations: Since x, = 121 minutes and x; = 112 minutes, the test statistic 1s 


121 — 112 
-—— 252 
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8. Conclusion: Since zọ = 2.52 > 1.645, we reject Hy: u; = p, at the a = 0.05 level 
and conclude that adding the new ingredient to the paint significantly reduces the 
drying time. Alternatively, we can find the P-value for this test as 


P-value = 1 — 42.52) = 0.0059 
Therefore, Hp: jj, = p would be rejected at any significance level a = 0.0059. 


When the population variances are unknown, the sample variances s? and s2 can be substituted 
into the test statistic Equation 10-2 to produce a large-sample test for the difference in means. 
This procedure will also work well when the populations are not necessarily normally distrib- 
uted. However, both n, and n, should exceed 40 for this large-sample test to be valid. 


10-2.2 Choice of Sample Size 


Use of Operating Characteristic Curves 

The operating characteristic curves in Appendix Charts VIa, VIb, VIc, and VId may be used 
to evaluate the type II error probability for the hypotheses in the display (10-2). These curves 
are also useful in determining sample size. Curves are provided for a = 0.05 and a = 0.01. 
For the two-sided alternative hypothesis, the abscissa scale of the operating characteristic 
curve in charts Vla and VIb is d, where 


[pog — ig A] = I^ — Aol 


Vot + o$ Vor o$ 


d = (10-3) 


and one must choose equal sample sizes, say, n = n, = ny. The one-sided alternative hypothe- 
ses require the use of Charts VIc and VId. For the one-sided alternatives H,: jj, — p> > Ag or 
Hi:p4 — W < A, the abscissa scale is also given by 


J= |wi — pa — Al _ I^ — Ao 


Vot + o$ Vor o$ 


It is not unusual to encounter problems where the costs of collecting data differ substantially 
between the two populations, or where one population variance 1s much greater than the other. 
In those cases, we often use unequal sample sizes. If n, # m, the operating characteristic curves 
may be entered with an equivalent value of n computed from 


E op +o — (10-4) 
o1/n -+ 05/n» 


If n, # m, and their values are fixed in advance, Equation 10-4 is used directly to calculate n, 
and the operating characteristic curves are entered with a specified d to obtain B. If we are 
given d and it is necessary to determine n, and n, to obtain a specified B, say, B*, we guess at 
trial values of n, and n,, calculate n in Equation 10-4, and enter the curves with the specified 
value of d to find B. If B = B*, the trial values of n, and n, are satisfactory. If B # B%*, 
adjustments to n, and n, are made and the process is repeated. 
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EXAMPLE 10-2 





Consider the paint drying time experiment from Example 10-1. If the true difference in mean 
drying times is as much as 10 minutes, find the sample sizes required to detect this difference 
with probability at least 0.90. 

The appropriate value of the abscissa parameter is (since Ay = 0, and A = 10) 


2 10 
cd = 0.88 


V o1 + o3 82 + 8? 








and since the detection probability or power of the test must be at least 0.9, with a = 0.05, we 
find from Appendix Chart VIc that n = n, = m = ll. 


Sample Size Formulas 

Itis also possible to obtain formulas for calculating the sample sizes directly. Suppose that the null 
hypothesis Hp: pı — po = A, is false and that the true difference in means is w; — p = A, 
where A > A). One may find formulas for the sample size required to obtain a specific value 
of the type II error probability B for a given difference in means A and level of significance a. 


For the two-sided alternative hypothesis with significance level a, the sample size 
n; = n, = n required to detect a true difference in means of A with power at least 
Tous 


(Zao + zg) (o1 + 3) 
f |^ (A-AY (10-5) 








This approximation is valid when b(—z,» — (A — Ao) Vn/ Vo? + o2) is small compared to B. 


For a one-sided alternative hypothesis with significance level a, the sample size 
n; = M, = n required to detect a true difference in means of A(7z£A,) with power 
at least 1 — Bus 


| (Za t zg) (07 + 04) 
(A= Aol 


(10-6) 


The derivation of Equations 10-5 and 10-6 closely follows the single-sample case in Section 
9-2.3. For example, to obtain Equation 10-6, we first write the expression for the B-error for 
the two-sided alternate, which 1s 


B= DY) a2 -= ez P| an- a 
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EXAMPLE 10-3 


where A is the true difference in means of interest. Then by following a procedure similar 
to that used to obtain Equation 9-17, the expression for B can be obtained for the case 
where n = n, = n. 


To illustrate the use of these sample size equations, consider the situation described in 
Example 10-1, and suppose that if the true difference in drying times is as much as 10 min- 
utes, we want to detect this with probability at least 0.90. Under the null hypothesis, Ay = 0. 
We have a one-sided alternative hypothesis with A = 10, a = 0.05 (so z, = Zoos = 1.645), 
and since the power is 0.9, B = 0.10 (so Zg = Zo 19 = 1.28). Therefore we may find the re- 
quired sample size from Equation 10-6 as follows: 


O (Za + zg)(oi- 05) (1.645 + 128Y[(8P + (8?] _ " 
5 (à-N (10 — 0)? 7 


This is exactly the same as the result obtained from using the O.C. curves. 


10-2.3 Identifying Cause and Effect 


Engineers and scientists are often interested in comparing two different conditions to deter- 
mine whether either condition produces a significant effect on the response that is observed. 
These conditions are sometimes called treatments. Example 10-1 illustrates such a situation; 
the two different treatments are the two paint formulations, and the response is the drying 
time. The purpose of the study is to determine whether the new formulation results in a 
significant effect—reducing drying time. In this situation, the product developer (the experi- 
menter) randomly assigned 10 test specimens to one formulation and 10 test specimens to the 
other formulation. Then the paints were applied to the test specimens in random order until all 
20 specimens were painted. This is an example of a completely randomized experiment. 

When statistical significance is observed in a randomized experiment, the experimenter can 
be confident in the conclusion that it was the difference in treatments that resulted in the differ- 
ence in response. That is, we can be confident that a cause-and-effect relationship has been found. 

Sometimes the objects to be used in the comparison are not assigned at random to the 
treatments. For example, the September 1992 issue of Circulation (a medical journal pub- 
lished by the American Heart Association) reports a study linking high iron levels in the body 
with increased risk of heart attack. The study, done in Finland, tracked 1931 men for five years 
and showed a statistically significant effect of increasing iron levels on the incidence of heart 
attacks. In this study, the comparison was not performed by randomly selecting a sample of 
men and then assigning some to a “low iron level" treatment and the others to a “high iron 
level" treatment. The researchers just tracked the subjects over time. Recall from Chapter 1 
that this type of study is called an observational study. 

It is difficult to identify causality in observational studies, because the observed statisti- 
cally significant difference in response between the two groups may be due to some other 
underlying factor (or group of factors) that was not equalized by randomization and not due to 
the treatments. For example, the difference in heart attack risk could be attributable to the dif- 
ference in iron levels, or to other underlying factors that form a reasonable explanation for the 
observed results—such as cholesterol levels or hypertension. 

The difficulty of establishing causality from observational studies is also seen in the 
smoking and health controversy. Numerous studies show that the incidence of lung cancer and 
other respiratory disorders is higher among smokers than nonsmokers. However, establishing 
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cause and effect here has proven enormously difficult. Many individuals had decided to 
smoke long before the start of the research studies, and many factors other than smoking could 
have a role in contracting lung cancer. 


10-2.4 Confidence Interval on a Difference in Means, 
Variances Known 


Definition 


The 100(1 — a)% confidence interval on the difference in two means p, — p» when the vari- 
ances are known can be found directly from results given previously in this section. Recall 
that X11; X12, ..., Xin, 1s a random sample ofn, observations from the first population and X;,, 
AX», ..., Xon, 18a random sample ofn, observations from the second population. The difference 
in sample means X, — X; is a point estimator of u; — po, and 


Z = 2 2 
01 05 
Mc -- Mp 
N] n 


has a standard normal distribution if the two populations are normal or is approximately stan- 
dard normal if the conditions of the central limit theorem apply, respectively. This implies that 
P(—zyy) sS Ze Zaj2) =] — Q, Or 


P| An T3 —3 Sala 
91,92 
ni n» 


This can be rearranged as 


ES = 01 o$ = = oF SE 
Pp. = Ag = ay» n, n, © M T Bo S4 Ad t Za m m =] ox 


Therefore, the 100(1 — a)% confidence interval for p; — p 1s defined as follows. 


If x and x, are the means of independent random samples of sizes n, and n, from 
two independent normal populations with known variances o1 and o5, respectively, 
a 100(1 — a)% confidence interval for p; — p; is 


gj op 


S p T M SX — %2 E Zant n, T w (10-7) 


where z, /; is the upper a/2 percentage point of the standard normal distribution. 





The confidence level 1 — a is exact when the populations are normal. For nonnormal popu- 
lations, the confidence level is approximately valid for large sample sizes. 


10-2 INFERENCE FOR A DIFFERENCE IN MEANS OF TWO NORMAL DISTRIBUTIONS, VARIANCES KNOWN 335 


EXAMPLE 10-4 


Tensile strength tests were performed on two different grades of aluminum spars used in 
manufacturing the wing of a commercial transport aircraft. From past experience with the spar 
manufacturing process and the testing procedure, the standard deviations of tensile strengths 
are assumed to be known. The data obtained are as follows: n, = 10, x, = 87.6, o, = 1, 
n, = 12, x; = 74.5, and o; = 1.5. If u; and p, denote the true mean tensile strengths for the 
two grades of spars, we may find a 90% confidence interval on the difference in mean strength 
IL; — p» as follows: 








- Of | 0j MEE ol o 
AY 79. Faye i dH» Spy — Wg =X — Xt Zan nm nm 
87.6 — 74.5 — 1.645 (Y Es E < = 8976 — 74.5 + 1.645 (1°) 4 (L5) 
10 LE CI i . 10 D 


Therefore, the 9096 confidence interval on the difference in mean tensile strength (in kilo- 
grams per square millimeter) is 


12.22 € pw, — p» S 13.98 (in kilograms per square millimeter) 


Notice that the confidence interval does not include zero, implying that the mean 
strength of aluminum grade 1 (p) exceeds the mean strength of aluminum grade 2 (2). In 
fact, we can state that we are 90% confident that the mean tensile strength of aluminum 
grade 1 exceeds that of aluminum grade 2 by between 12.22 and 13.98 kilograms per 
square millimeter. 


Choice of Sample Size 

If the standard deviations o, and o» are known (at least approximately) and the two sample 
sizes n, and n, are equal (n, = n; = n, say), we can determine the sample size required so that 
the error in estimating w; — po by x, — x5 will be less than E at 100(1 — a)% confidence. The 
required sample size from each population is 





Remember to round up if 7 is not an integer. This will ensure that the level of confidence does 
not drop below 100(1 — a)%. 


One-Sided Confidence Bounds 
One-sided confidence bounds on p, — p, may also be obtained. A 100(1 — a)% upper- 
confidence bound on p, — p is 
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and a 100(1 — a)% lower-confidence bound is 





EXERCISES FOR SECTION 10-2 


10-1. Two machines are used for filling plastic bottles with 
a net volume of 16.0 ounces. The fill volume can be assumed 
normal, with standard deviation o, = 0.020 and o; = 0.025 
ounces. A member of the quality engineering staff suspects 
that both machines fill to the same mean net volume, whether 
or not this volume is 16.0 ounces. A random sample of 10 bot- 
tles is taken from the output of each machine. 


Machine 1 Machine 2 
16.03 16.01 16.02 16.03 
16.04 15.96 15.97 16.04 
16.05 15.98 15.96 16.02 
16.05 16.02 16.01 16.01 
16.02 15.99 15.99 16.00 


(a) Do you think the engineer is correct? Use a = 0.05. 

(b) What is the P-value for this test? 

(c) What is the power of the test in part (a) for a true differ- 
ence in means of 0.04? 

(d) Find a 95% confidence interval on the difference in 
means. Provide a practical interpretation of this interval. 

(e) Assuming equal sample sizes, what sample size should be 
used to assure that B = 0.05 if the true difference in 
means is 0.04? Assume that a = 0.05. 


10-2. Two types of plastic are suitable for use by an elec- 
tronics component manufacturer. The breaking strength of this 
plastic is important. It is known that o, = o, = 1.0 psi. From 
a random sample of size n, = 10 and n, = 12, we obtain 
x, = 162.5 and x; = 155.0. The company will not adopt plas- 
tic 1 unless its mean breaking strength exceeds that of plastic 
2 by at least 10 psi. Based on the sample information, should 
it use plastic 1? Use a = 0.05 in reaching a decision. 


10-3. Reconsider the situation in Exercise 10-2. Suppose 
that the true difference in means is really 12 psi. Find the 
power of the test assuming that a = 0.05. If it is really impor- 
tant to detect this difference, are the sample sizes employed in 
Exercise 10-2 adequate, in your opinion? 

10-4. The burning rates of two different solid-fuel propel- 
lants used in aircrew escape systems are being studied. It is 
known that both propellants have approximately the same 
standard deviation of burning rate; that is o, = o; = 3 
centimeters per second. Two random samples of n, — 20 





(10-10) 


and n, = 20 specimens are tested; the sample mean burn- 

ing rates are x; = 18 centimeters per second and x, = 24 

centimeters per second. 

(a) Test the hypothesis that both propellants have the same 
mean burning rate. Use a = 0.05. 

(b) What is the P-value of the test in part (a)? 

(c) What is the B-error of the test in part (a) if the true differ- 
ence in mean burning rate is 2.5 centimeters per second? 

(d) Construct a 95% confidence interval on the difference in 
means p, — >. What is the practical meaning of this 
interval? 


10-5. Two machines are used to fill plastic bottles with 
dishwashing detergent. The standard deviations of fill volume 
are known to be o, = 0.10 fluid ounces and o; = 0.15 fluid 
ounces for the two machines, respectively. Two random sam- 
ples of n, = 12 bottles from machine 1 and n, = 10 bottles 
from machine 2 are selected, and the sample mean fill vol- 
umes are x, = 30.87 fluid ounces and x, = 30.68 fluid 
ounces. Assume normality. 

(a) Construct a 90% two-sided confidence interval on the 
mean difference in fill volume. Interpret this interval. 

(b) Construct a 9596 two-sided confidence interval on the mean 
difference in fill volume. Compare and comment on the 
width of this interval to the width of the interval in part (a). 

(c) Construct a 95% upper-confidence interval on the mean 
difference in fill volume. Interpret this interval. 


10-6. Reconsider the situation described in Exercise 10-5. 

(a) Test the hypothesis that both machines fill to the same 
mean volume. Use a = 0.05. 

(b) What is the P-value of the test in part (a)? 

(c) If the B-error of the test when the true difference in fill 
volume 1s 0.2 fluid ounces should not exceed 0.1, what 
sample sizes must be used? Use a = 0.05. 


10-7. Two different formulations of an oxygenated motor fuel 

are being tested to study their road octane numbers. The variance 

of road octane number for formulation 1 is o1 = 1.5, and for 

formulation 2 it is oł = 1.2. Two random samples of size n, = 15 

and n, = 20 are tested, and the mean road octane numbers 

observed are x, = 89.6 and x, = 92.5. Assume normality. 

(a) Construct a 95% two-sided confidence interval on the 
difference in mean road octane number. 

(b) If formulation 2 produces a higher road octane number 
than formulation 1, the manufacturer would like to detect 
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it. Formulate and test an appropriate hypothesis, using 
a = 0.05. 
(c) What is the P-value for the test you conducted in part (b)? 


10-8. Consider the situation described in Exercise 10-4. What 
sample size would be required in each population if we wanted 
the error in estimating the difference in mean burning rates to be 
less than 4 centimeters per second with 9995 confidence? 


10-9. Consider the road octane test situation described in 
Exercise 10-7. What sample size would be required in each pop- 
ulation if we wanted to be 95% confident that the error in esti- 
mating the difference in mean road octane number is less than 1? 


10-10. A polymer is manufactured in a batch chemical 
process. Viscosity measurements are normally made on each 
batch, and long experience with the process has indicated that 
the variability in the process is fairly stable with o = 20. 
Fifteen batch viscosity measurements are given as follows: 
724, 718, 776, 760, 745, 759, 795, 756, 742, 740, 761, 749, 
739, 747, 742. A process change is made which involves 
switching the type of catalyst used in the process. Following 
the process change, eight batch viscosity measurements are 
taken: 735, 775, 729, 755, 783, 760, 738, 780. Assume that 
process variability is unaffected by the catalyst change. Find a 
90% confidence interval on the difference in mean batch vis- 
cosity resulting from the process change. 


10-11. The concentration of active ingredient in a liquid 
laundry detergent is thought to be affected by the type of cata- 
lyst used in the process. The standard deviation of active con- 
centration is known to be 3 grams per liter, regardless of the 
catalyst type. Ten observations on concentration are taken 
with each catalyst, and the data follow: 


Catalyst 1: 57.9, 66.2, 65.4, 65.4, 65.2, 62.6, 67.6, 63.7, 
67.2, 71.0 

Catalyst 2: 66.4, 71.7, 70.3, 69.3, 64.8, 69.6, 68.6, 69.4, 65.3, 
68.8 


(a) Find a 95% confidence interval on the difference in mean 
active concentrations for the two catalysts. 

(b) Is there any evidence to indicate that the mean active con- 
centrations depend on the choice of catalyst? Base your 
answer on the results of part (a). 


10-12. Consider the polymer batch viscosity data in 

Exercise 10-10. If the difference in mean batch viscosity is 

10 or less, the manufacturer would like to detect it with a 

high probability. 

(a) Formulate and test an appropriate hypothesis using a = 
0.10. What are your conclusions? 

(b) Calculate the P-value for this test. 

(c) Compare the results of parts (a) and (b) to the length of the 
90% confidence interval obtained in Exercise 10-10 and 
discuss your findings. 


10-13. For the laundry detergent problem in Exercise 10-11, 
test the hypothesis that the mean active concentrations are the 
same for both types of catalyst. Use a = 0.05. What is the 
P-value for this test? Compare your answer to that found in 
part (b) of Exercise 10-11, and comment on why they are the 
same or different. 


10-14. Reconsider the laundry detergent problem in 
Exercise 10-11. Suppose that the true mean difference in ac- 
tive concentration is 5 grams per liter. What is the power of the 
test to detect this difference if a = 0.05? If this difference is 
really important, do you consider the sample sizes used by the 
experimenter to be adequate? 

10-15. Consider the polymer viscosity data in Exercise 10- 
10. Does the assumption of normality seem reasonable for 
both samples? 

10-16. Consider the concentration data in Exercise 10-11. 
Does the assumption of normality seem reasonable? 


10-3 INFERENCE FOR THE DIFFERENCE IN MEANS OF TWO 
NORMAL DISTRIBUTIONS, VARIANCES UNKNOWN 


We now extend the results of the previous section to the difference in means of the two distribu- 
tions in Fig. 10-1 when the variances of both distributions o1 and o5 are unknown. If the sam- 
ple sizes n, and n, exceed 40, the normal distribution procedures in Section 10-2 could be used. 
However, when small samples are taken, we will assume that the populations are normally dis- 
tributed and base our hypotheses tests and confidence intervals on the f distribution. This nicely 
parallels the case of inference on the mean of a single sample with unknown variance. 


10-3.1 Hypotheses Tests for a Difference in Means, Variances Unknown 


We now consider tests of hypotheses on the difference in means p, — p, of two normal 
distributions where the variances o1 and o5 are unknown. A t-statistic will be used to test these 
hypotheses. As noted above and in Section 9-3, the normality assumption is required to 
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develop the test procedure, but moderate departures from normality do not adversely affect 
the procedure. Two different situations must be treated. In the first case, we assume that the 
variances of the two normal distributions are unknown but equal; that is, o? = o2 = o?. In the 
second, we assume that o? and o2 are unknown and not necessarily equal. 

Case 1: o? = 02 = o° 

Suppose we have two independent normal populations with unknown means u; and po, and 


unknown but equal variances, o? = o? = o?. We wish to test 


Ho: pı — pa = Ag 


Ay: wy — ga # Ao (10-11) 
Let Xj, Xi», ... , Xin, be a random sample of n, observations from the first population and 
Xy1, X5,..., Xon, be a random sample of n, observations from the second population. 


Let Xi, X5. S2, and S$ be the sample means and sample variances, respectively. Now the ex- 


pected value of the difference in sample means X; — X, is E(X, — X;) = pa — Mo, SO 
X, — X, is an unbiased estimator of the difference in means. The variance of X, — X» is 


= = | oO oo [1 l 


It seems reasonable to combine the two sample variances $1 and S$ to form an estimator 
of o°. The pooled estimator of o? is defined as follows. 


The pooled estimator of o^, denoted by 57, is defined by 


(m — DSt + (m - 0S 
Ny a a 2 


S} = (10-12) 





It is easy to see that the pooled estimator s; can be written as 


s2 = inre] 


54 
: L + —*+——_ 3 = wSt + (1— w)S3 


n^n —2. n + ny — 2 


where 0 < w < 1. Thus S% is a weighted average of the two sample variances S? and 55, 
where the weights w and 1 — w depend on the two sample sizes n, and n,. Obviously, ifn, = 
n, = n, w = 0.5 and S; is just the arithmetic average of S? and S$. If n, = 10 and n, = 20 
(say), w = 0.32 and 1 — w = 0.68. The first sample contributes n, — 1 degrees of freedom 
to S; and the second sample contributes n, — 1 degrees of freedom. Therefore, S; has 
n, t n, — 2 degrees of freedom. 

Now we know that 


has a N(0, 1) distribution. Replacing o by S, gives the following. 
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Definition: 
The Two-Sample 
or Pooled t-Test* 


EXAMPLE 10-5 








Given the assumptions of this section, the quantity 


(10-13) 


has a ¢ distribution with n, + n, — 2 degrees of freedom. 


The use of this information to test the hypotheses in Equation 10-11 is now straightfor- 
ward: simply replace p, — p by Ao, and the resulting test statistic has a f distribution with 
n, t n; — 2 degrees of freedom under Hy: p, — p; = Ap. Therefore, the reference distribu- 
tion for the test statistic is the ¢ distribution with n, + n; — 2 degrees of freedom. The location 
of the critical region for both two- and one-sided alternatives parallels those in the one-sample 
case. Because a pooled estimate of variance is used, the procedure is often called the pooled 
t-test. 


Null hypothesis: 


Test statistic: Jy = OO (10-14) 


Alternative Hypothesis Rejection Criterion 
Ay: by — W * Ag to > Íg/2 n 4 n;-2 OF 


lo E m lon, 2 


Hj: pı — Ma > Ag fo = Íu ni n,-2 


Hi: pı — W < Ag ly a 


Two catalysts are being analyzed to determine how they affect the mean yield of a chemical 
process. Specifically, catalyst 1 is currently in use, but catalyst 2 is acceptable. Since catalyst 
2 is cheaper, it should be adopted, providing it does not change the process yield. A test is run 
in the pilot plant and results in the data shown in Table 10-1. Is there any difference between 
the mean yields? Use a = 0.05, and assume equal variances. 

The solution using the eight-step hypothesis-testing procedure is as follows: 


1. The parameters of interest are u; and w,, the mean process yield using catalysts 
1 and 2, respectively, and we want to know if p; — p = 0. 


2. Hy: pı — W = 0, or Hg: by = po 





*While we have given the development of this procedure for the case where the sample sizes could be different, there 
is an advantage to using equal sample sizes n, = n; = n. When the sample sizes are the same from both populations, 
the t-test is more robust to the assumption of equal variances. Please see Section 10-3.2 on the CD. 
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Table 10-1 Catalyst Yield Data, Example 10-5 





Observation 
Number Catalyst 1 Catalyst 2 
1 91.50 89.19 
2 94.18 90.95 
3 92.18 90.46 
4 95.39 93.2] 
5 91.79 97.19 
6 89.07 97.04 
7 94.72 91.07 
8 89.21 92.75 
x, = 92.255 X) = 92.733 
Ss, = 2.39 S = 2.98 
Hip 7 go 
a = 0.05 
The test statistic is 
Ap C X 
iy = 1 2 
ENS 
Sp ni T ; 


Reject Ho if to > Í0.025,14 = 2.145 or if fg < — Í0.025,14 = — 2.145. 


7. Computations: From Table 10-1 we have x, = 92.255,5, = 2.39, n, = 8, x5 = 92.733, 
S, = 2.98, and n; = 8. Therefore 


; _ (m= Ds + (m - 08 (039) + 72.98) — 


= O I = 130 
"p iy PE CEN MED, 
s, = V/7.30 = 2.70 
and 
n-o 92.255 — 92.733 
p = — uL See = -0.35 
2.70 23 + E 2.70 2 + : 
l ny Ny 8 8 


8. Conclusions: Since —2.145 < tọ = —0.35 < 2.145, the null hypothesis cannot be 
rejected. That is, at the 0.05 level of significance, we do not have strong evidence to 
conclude that catalyst 2 results in a mean yield that differs from the mean yield when 
catalyst 1 is used. 


A P-value could also be used for decision making in this example. From Appendix Table IV 
we find that / 49.14 = 0.258 and f9.5 14 = 0.692. Therefore, since 0.258 < 0.35 < 0.692, we 
conclude that lower and upper bounds on the P-value are 0.50 « P < 0.80. Therefore, since 
the P-value exceeds a = 0.05, the null hypothesis cannot be rejected. 
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The Minitab two-sample t-test and confidence interval procedure for Example 10-5 
follows: 


Two-Sample T-Test and CI: Cat 1, Cat 2 


Two-sample T for Cat 1 vs Cat 2 


N Mean StDev SE Mean 
Cat 1 8 92.26 2.39 0.84 
Cat 2 8 92.73 2.99 ell 


Difference = mu Cat 1 — mu Cat 2 

Estimate for difference: —0.48 

95% CI for difference: (—3.37, 2.42) 

T-Test of difference = 0 (vs not = ): T-Value = —0.35 P-Value = 0.730 DF = 14 
Both use Pooled StDev = 2.70 





Notice that the numerical results are essentially the same as the manual computations in 
Example 10-5. The P-value is reported as P = 0.73. The two-sided CI on p, — w, is also 
reported. We will give the computing formula for the CI in Section 10-3.3. Figure 10-2 shows 
the normal probability plot of the two samples of yield data and comparative box plots. The 
normal probability plots indicate that there is no problem with the normality assumption. 
Furthermore, both straight lines have similar slopes, providing some verification of the as- 
sumption of equal variances. The comparative box plots indicate that there is no obvious dif- 
ference in the two catalysts, although catalyst 2 has slightly greater sample variability. 


Case 2: o? # o2 

In some situations, we cannot reasonably assume that the unknown variances o and o5 are 
equal. There is not an exact t-statistic available for testing Hg: |j, — po = Ag in this case. 
However, if Hp: p; — po = Ao is true, the statistic 
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(a) (b) 


Figure 10-2 Normal probability plot and comparative box plot for the catalyst yield data in Example 10-5. 
(a) Normal probability plot, (b) Box plots. 
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EXAMPLE 10-6 


(10-15) 


S2 2 
+ 2) 
n» 


(5/12) 


=l 


(10-16) 





Therefore, if of ~ o5, the hypotheses on differences in the means of two normal distributions are 
tested as in the equal variances case, except that 7) is used as the test statistic and n, + n, — 2 is 
replaced by v in determining the degrees of freedom for the test. 


Arsenic concentration in public drinking water supplies is a potential health risk. An article in 
the Arizona Republic (Sunday, May 27, 2001) reported drinking water arsenic concentrations 
in parts per billion (ppb) for 10 methropolitan Phoenix communities and 10 communities in 
rural Arizona. The data follow: 


Metro Phoenix (x, = 12.5, s, = 7.63) Rural Arizona (x, = 27.5, s2 = 15.3) 
Phoenix, 3 Rimrock, 48 
Chandler, 7 Goodyear, 44 

Gilbert, 25 New River, 40 
Glendale, 10 Apachie Junction, 38 
Mesa, 15 Buckeye, 33 

Paradise Valley, 6 Nogales, 21 

Peoria, 12 Black Canyon City, 20 
Scottsdale, 25 Sedona, 12 

Tempe, 15 Payson, 1 

Sun City, 7 Casa Grande, 18 


We wish to determine it there is any difference in mean arsenic concentrations between met- 
ropolitan Phoenix communities and communities in rural Arizona. Figure 10-3 shows a nor- 
mal probability plot for the two samples of arsenic concentration. The assumption of normal- 
ity appears quite reasonable, but since the slopes of the two straight lines are very different, it 
is unlikely that the population variances are the same. 

Applying the eight-step procedure gives the following: 


1. The parameters of interest are the mean arsenic concentrations for the two geographic 
regions, say, ju, and p», and we are interested in determining whether w; — w = 0. 


2. Ho: pı — w = 0, or Ho: py = p 
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Figure 10-3 Normal 
probability plot of the 
arsenic concentration 
data from Example 
10-6. 
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Hy: p 7 W 
a = 0.05 (say) 


The test statistic is 


m 2 2 


51 $2 
Ny n» 


The degrees of freedom on fj are found from Equation 10-16 as 


E n 2) [o n edi 
"o Gin . (slUmy  [(7.633/10]? . (asay(uop C^ 


n,—]1 ny —1 9 9 


Therefore, using a = 0.05, we would reject Hy: u; = po if fg > 1902513 = 2.160 or if 
fo < — Í9.025,13 = —2.160 


Computations: Using the sample data we find 








5 S X1 — X5 u 135 TS = 977 
i st oS (7.63)? (15.3) ) 
— p — 4 
nj Ny 10 10 
Conclusions: Because tj = —2.77 < 1992513 = —2.160, we reject the null hypoth- 


esis. Therefore, there is evidence to conclude that mean arsenic concentration in the 
drinking water in rural Arizona is different from the mean arsenic concentration in 
metropolitan Phoenix drinking water. Furthermore, the mean arsenic concentration 
is higher in rural Arizona communities. The P-value for this test is approximately 
P — 0.016. 
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The Minitab output for this example follows: 


Two-Sample T-Test and CI: PHX, RuralAZ 


Two-sample T for PHX vs RuralAZ 


N Mean StDev SE Mean 
PHX 10 12:559 7.63 2A 


RuralAZ 10 DES) 153 4.9 


Difference = mu PHX — mu RuralAZ 

Estimate for difference: — 15.00 

95% CI for difference: (~ 26.71, =3.29) 

T-Test of difference = 0 (vs not = ): T-Value = —2.77  P-Value = 0.016 DF = 13 





The numerical results from Minitab exactly match the calculations from Example 10-6. Note 
that a two-sided 95% CI on p, — p; is also reported. We will discuss its computation in 
Section 10-3.4; however, note that the interval does not include zero. Indeed, the upper 95% 
of confidence limit is —3.29 ppb, well below zero, and the mean observed difference is 
X39, 125 — 175 = —15 ppb. 


10-3.2 More about the Equal Variance Assumption (CD Only) 


10-3.3 Choice of Sample Size 


EXAMPLE 10-7 


The operating characteristic curves in Appendix Charts Vle, VIf, VIg, and VIA are used to 
evaluate the type II error for the case where of = ob = o°. Unfortunately, when o7 # o5, the 
distribution of T is unknown if the null hypothesis is false, and no operating characteristic 
curves are available for this case. 

For the two-sided alternative Hi: u; — u» = A # Ap, when o? = 0$ = o? and n, = n, = 
n, Charts VIe and VIf are used with 
_ [A - Al 


a 20 


(10-17) 
where A is the true difference in means that is of interest. To use these curves, they must be 
entered with the sample size n“ = 2n — 1. For the one-sided alternative hypothesis, we use 
Charts VIg and VIA and define d and A as in Equation 10-17. It is noted that the parameter d 
is a function of c, which is unknown. As in the single-sample t-test, we may have to rely on a 
prior estimate of o or use a subjective estimate. Alternatively, we could define the differences 
in the mean that we wish to detect relative to o. 


Consider the catalyst experiment in Example 10-5. Suppose that, if catalyst 2 produces a mean 
yield that differs from the mean yield of catalyst 1 by 4.0%, we would like to reject the null 
hypothesis with probability at least 0.85. What sample size is required? 

Using s, — 2.70 as a rough estimate of the common standard deviation o, we have 
d = |A|/2e = |4.0|/[(2)(2.70)] = 0.74. From Appendix Chart VIe with d = 0.74 and B = 
0.15, we find n° = 20, approximately. Therefore, since n* = 2n — 1, 


n+1 2041 
2 2 








n= = 10.5 = 11(say) 


and we would use sample sizes of n, = n; =n = 11. 
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Minitab will also perform power and sample size calculations for the two-sample t-test (equal 
variances). The output from Example 10-7 is as follows: 


Power and Sample Size 


2-Sample t Test 
Testing mean ] = mean 2 (versus not =) 
Calculating power for mean 1 = mean 2 + difference 


Alpha = 0.05 Sigma = 2.7 
sample Target Actual 


Difference Size Power Power 
4 10 0.8500 0.8793 





The results agree fairly closely with the results obtained from the O.C. curve. 


10-3.4 Confidence Interval on the Difference in Means 


Case 1: o4 = o$ = o? 
To develop the confidence interval for the difference in means p, — u, when both variances 
are equal, note that the distribution of the statistic 


pa e (10-18) 


is the ¢ distribution with n, + n; — 2 degrees of freedom. Therefore P(— t,o n,4n,-2 S T S 
ty /2.n,+n,-2) = 1 — a. Now substituting Equation 10-18 for T and manipulating the quan- 
tities inside the probability statement will lead to the 100(1 — a)% confidence interval on 
Hi ~~ Bo 


Definition 
If x), X>, s? and s$ are the sample means and variances of two random samples of 
sizes n, and n, respectively, from two independent normal populations with un- 
known but equal variances, then a 100(1 — a)% confidence interval on the differ- 
ence in means p; — p; is 


X| — X2 — ly/2,n,+n,-2 Sp 


E E 1 l 
S M 7 M SX — X2 F la/2,n+m-2 o F 75 (Ger) 


where s, = V[(n, — 1)st + (n5 — 1)s$]/(n, + ny — 2) is the pooled estimate 


of the common population standard deviation, and ta/2,n,+m,—2 is the upper a/2 
percentage point of the ¢ distribution with n, + n, — 2 degrees of freedom. 
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EXAMPLE 10-8 


An article in the journal Hazardous Waste and Hazardous Materials (Vol. 6, 1989) reported 
the results of an analysis of the weight of calcium in standard cement and cement doped 
with lead. Reduced levels of calcium would indicate that the hydration mechanism in the 
cement is blocked and would allow water to attack various locations in the cement struc- 
ture. Ten samples of standard cement had an average weight percent calcium of x, — 90.0, 
with a sample standard deviation of s, — 5.0, while 15 samples of the lead-doped cement 
had an average weight percent calcium of x, = 87.0, with a sample standard deviation of 
Sy = 4.0. 

We will assume that weight percent calcium is normally distributed and find a 95% con- 
fidence interval on the difference in means, p, — ps, for the two types of cement. Furthermore, 
we will assume that both normal populations have the same standard deviation. 

The pooled estimate of the common standard deviation is found using Equation 10-12 as 
follows: 


9(5.0 + 14(4.0) 
10+ 15-2 
= 19.52 


Therefore, the pooled standard deviation estimate is s, = V19.52 = 4.4. The 95% confi- 
dence interval is found using Equation 10-19: 


1 1 
Xu eG i Se m = <7, — y — I 
X1 — X2 — 10.025,33 SP4] m, T m = Ma T Ma SXi = X2 fogosos SpA m, T m 
or upon substituting the sample values and using to.025,23 = 2.069, 


90.0 — 87.0 — 2.069(4.4) 


l 
< 90.0 — 87.0 + 2.069(44) 10 Jd: 15 


which reduces to 


Notice that the 95% confidence interval includes zero; therefore, at this level of confidence we 
cannot conclude that there 1s a difference in the means. Put another way, there is no evidence 
that doping the cement with lead affected the mean weight percent of calcium; therefore, we 
cannot claim that the presence of lead affects this aspect of the hydration mechanism at the 
95% level of confidence. 


Case 2: o4 # o% 
In many situations it is not reasonable to assume that o? = o5. When this assumption is un- 


warranted, we may still find a 100(1 — a)% confidence interval on jy, — p, using the fact 
that T = LX, — X; — (wy — u3)]/ V St/n, + S$/n; is distributed approximately as ¢ with 
degrees of freedom v given by Equation 10-16. The CI expression follows. 
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Definition 
If X,, X», sî, and s2 are the means and variances of two random samples of sizes n, and 
n, respectively, from two independent normal populations with unknown and unequal 
variances, an approximate 100(1 — a)% confidence interval on the difference in 


means p, — p 1s 


si 


Xp 


la/2,v n, 


Mq — Bz = Xp — X2 F fgy2y (10-20) 


where v is given by Equation 10-16 and t,» , is the upper a/2 percentage point of the 
t distribution with v degrees of freedom. 


EXERCISES FOR SECTION 10-3 


10-17. The diameter of steel rods manufactured on two dif- 

ferent extrusion machines is being investigated. Two random 

samples of sizes n, = 15 and n, = 17 are selected, and the 

sample means and sample variances are x, = 8.73,57 = 0.35, 

X, = 8.68, and sj = 0.40, respectively. Assume that of = o$ 

and that the data are drawn from a normal distribution. 

(a) Is there evidence to support the claim that the two ma- 
chines produce rods with different mean diameters? Use 
a = 0.05 in arriving at this conclusion. 

(b) Find the P-value for the t-statistic you calculated in 
part (a). 

(c) Construct a 95% confidence interval for the difference in 
mean rod diameter. Interpret this interval. 


10-18. An article in Fire Technology investigated two dif- 
ferent foam expanding agents that can be used in the nozzles 
of fire-fighting spray equipment. A random sample of five ob- 
servations with an aqueous film-forming foam (AFFF) had a 
sample mean of 4.7 and a standard deviation of 0.6. A random 
sample of five observations with alcohol-type concentrates 
(ATC) had a sample mean of 6.9 and a standard deviation 0.8. 
Find a 95% confidence interval on the difference in mean 
foam expansion of these two agents. Can you draw any con- 
clusions about which agent produces the greatest mean foam 
expansion? Assume that both populations are well represented 
by normal distributions with the same standard deviations. 


10-19. Two catalysts may be used in a batch chemical 
process. Twelve batches were prepared using catalyst 1, re- 
sulting in an average yield of 86 and a sample standard devia- 
tion of 3. Fifteen batches were prepared using catalyst 2, and 
they resulted in an average yield of 89 with a standard devia- 
tion of 2. Assume that yield measurements are approximately 
normally distributed with the same standard deviation. 
(a) Is there evidence to support a claim that catalyst 2 pro- 
duces a higher mean yield than catalyst 1? Use a = 0.01. 
(b) Find a 95% confidence interval on the difference in mean 
yields. 





10-20. The deflection temperature under load for two dif- 
ferent types of plastic pipe is being investigated. Two random 
samples of 15 pipe specimens are tested, and the deflection 
temperatures observed are as follows (in °F): 


Type 1: 206, 188, 205, 187, 194, 193, 207, 185, 189, 213, 
192, 210, 194, 178, 205. 

Type 2: 177, 197, 206, 201, 180, 176, 185, 200, 197, 192, 
198, 188, 189, 205, 192. 


(a) Construct box plots and normal probability plots for the 
two samples. Do these plots provide support of the as- 
sumptions of normality and equal variances? Write a prac- 
tical interpretation for these plots. 

(b) Do the data support the claim that the deflection tempera- 
ture under load for type 2 pipe exceeds that of type 1? In 
reaching your conclusions, use a = 0.05. 

(c) Calculate a P-value for the test in part (b). 

(d) Suppose that if the mean deflection temperature for type 2 
pipe exceeds that of type 1 by as much as 5°F, it is important 
to detect this difference with probability at least 0.90. Is the 
choice of n, = m = 15 in part (a) of this problem adequate? 

10-21. In semiconductor manufacturing, wet chemical etch- 

ing is often used to remove silicon from the backs of wafers 

prior to metalization. The etch rate is an important characteris- 
tic in this process and known to follow a normal distribution. 

Two different etching solutions have been compared, using two 

random samples of 10 wafers for each solution. The observed 

etch rates are as follows (in mils per minute): 


Solution 1 Solution 2 
9.9 10.6 10.2 10.0 
9.4 10.3 10.6 10.2 
9.3 10.0 10.7 10.7 
9.6 10.3 10.4 10.4 


10.2 10.1 10.5 10.3 
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(a) Do the data support the claim that the mean etch rate is the 
same for both solutions? In reaching your conclusions, use 
a = 0.05 and assume that both population variances are 
equal. 

(b) Calculate a P-value for the test in part (a). 

(c) Find a 95% confidence interval on the difference in mean 
etch rates. 

(d) Construct normal probability plots for the two samples. 
Do these plots provide support for the assumptions of nor- 
mality and equal variances? Write a practical interpreta- 
tion for these plots. 

10-22. Two suppliers manufacture a plastic gear used in a 

laser printer. The impact strength of these gears measured in 

foot-pounds is an important characteristic. A random sample 

of 10 gears from supplier 1 results in x, = 290 and s, = 12, 

while another random sample of 16 gears from the second 

supplier results in x, = 321 and s, = 22. 

(a) Is there evidence to support the claim that supplier 2 pro- 
vides gears with higher mean impact strength? Use a = 
0.05, and assume that both populations are normally dis- 
tributed but the variances are not equal. 

(b) What is the P-value for this test? 

(c) Do the data support the claim that the mean impact 
strength of gears from supplier 2 is at least 25 foot-pounds 
higher than that of supplier 1? Make the same assump- 
tions as in part (a). 

10-23.  Reconsider the situation in Exercise 10-22, part (a). 

Construct a confidence interval estimate for the difference in 

mean impact strength, and explain how this interval could 

be used to answer the question posed regarding supplier- 
to-supplier differences. 

10-24. A photoconductor film is manufactured at a nominal 

thickness of 25 mils. The product engineer wishes to increase 

the mean speed of the film, and believes that this can be 
achieved by reducing the thickness of the film to 20 mils. 

Eight samples of each film thickness are manufactured in a pi- 

lot production process, and the film speed (in microjoules per 

square inch) is measured. For the 25-mil film the sample data 
result is x, = 1.15 and s, = 0.11, while for the 20-mil film, 
the data yield x; — 1.06 and s, — 0.09. Note that an increase 

in film speed would lower the value of the observation in mi- 

crojoules per square inch. 

(a) Do the data support the claim that reducing the film thick- 
ness increases the mean speed of the film? Use a = 0.10 
and assume that the two population variances are equal 
and the underlying population of film speed is normally 
distributed. 

(b) What is the P-value for this test? 

(c) Find a 95% confidence interval on the difference in the 
two means. 

10-25. The melting points of two alloys used in formulating 

solder were investigated by melting 21 samples of each material. 

The sample mean and standard deviation for alloy 1 was 

x, = 420°F and s, = 4°F, while for alloy 2 they were 


x, = 426°F, and s, = 3°F Do the sample data support the claim 
that both alloys have the same melting point? Use a = 0.05 and 
assume that both populations are normally distributed and have 
the same standard deviation. Find the P-value for the test. 


10-26. Referring to the melting point experiment in 
Exercise 10-25, suppose that the true mean difference in 
melting points is 3°F How large a sample would be required 
to detect this difference using an a = 0.05 level test with 
probability at least 0.9? Use o, = o, = 4 as an initial esti- 
mate of the common standard deviation. 


10-27. Two companies manufacture a rubber material in- 
tended for use in an automotive application. The part will be 
subjected to abrasive wear in the field application, so we 
decide to compare the material produced by each company in 
a test. Twenty-five samples of material from each company 
are tested in an abrasion test, and the amount of wear after 

1000 cycles is observed. For company 1, the sample mean and 

standard deviation of wear are x; = 20 milligrams/1000 

cycles and s, = 2 milligrams/1000 cycles, while for company 

2 we obtain x; = 15 milligrams/1000 cycles and s, = 8 mil- 

ligrams/1000 cycles. 

(a) Do the data support the claim that the two companies pro- 
duce material with different mean wear? Use a = 0.05, 
and assume each population is normally distributed but 
that their variances are not equal. 

(b) What is the P-value for this test? 

(c) Do the data support a claim that the material from com- 
pany 1 has higher mean wear than the material from com- 
pany 2? Use the same assumptions as in part (a). 


10-28. The thickness of a plastic film (in mils) on a sub- 
strate material is thought to be influenced by the temperature 
at which the coating is applied. A completely randomized ex- 
periment is carried out. Eleven substrates are coated at 125°F, 
resulting in a sample mean coating thickness of x, = 103.5 
and a sample standard deviation of s, — 10.2. Another 13 sub- 
strates are coated at 150°F, for which x; = 99.7 and s, = 20.1 
are observed. It was originally suspected that raising the 
process temperature would reduce mean coating thickness. Do 
the data support this claim? Use a = 0.01 and assume that the 
two population standard deviations are not equal. Calculate an 
approximate P-value for this test. 


10-29. Reconsider the coating thickness experiment in 
Exercise 10-28. How could you have answered the question 
posed regarding the effect of temperature on coating thickness 
by using a confidence interval? Explain your answer. 


10-30. Reconsider the abrasive wear test in Exercise 10-27. 
Construct a confidence interval that will address the questions 
in parts (a) and (c) in that exercise. 


10-31. The overall distance traveled by a golf ball is tested 
by hitting the ball with Iron Byron, a mechanical golfer with a 
swing that is said to emulate the legendary champion, Byron 
Nelson. Ten randomly selected balls of two different brands 
are tested and the overall distance measured. The data follow: 


brand 1: 2/5,280,287, 271,283,211,21/9, 275, 263,267 
Brand 2: 258, 244, 260, 265, 273, 281, 271, 270, 263, 268 


(a) Is there evidence that overall distance is approximately 
normally distributed? Is an assumption of equal variances 
justified? 

(b) Test the hypothesis that both brands of ball have equal 
mean overall distance. Use a = 0.05. 

(c) What is the P-value of the test statistic in part (b)? 

(d) What is the power of the statistical test in part (b) to detect 
a true difference in mean overall distance of 5 yards? 

(e) What sample size would be required to detect a true dif- 
ference in mean overall distance of 3 yards with power of 
approximately 0.75? 

(f) Construct a 9596 two-sided CI on the mean difference in 
overall distance between the two brands of golf balls. 

10-32. In Example 9-6 we described how the "spring-like 

effect" in a golf club could be determined by measuring the 

coefficient of restitution (the ratio of the outbound velocity to 
the inbound velocity of a golf ball fired at the clubhead). 

Twelve randomly selected drivers produced by two 
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clubmakers are tested and the coefficient of restitution meas- 
ured. The data follow: 


Club 1: 0.8406, 0.8104, 0.8234, 0.8198, 0.8235, 0.8562, 
0.8123, 0.7976, 0.8184, 0.8265, 0.7773, 0.7871 


Club 2: 0.8305, 0.7905, 0.8352, 0.8380, 0.8145, 0.8465, 
0.8244, 0.8014, 0.8309, 0.8405, 0.8256, 0.8476 


(a) Is there evidence that coefficient of restitution 1s approxi- 
mately normally distributed? Is an assumption of equal 
variances justified? 

(b) Test the hypothesis that both brands of ball have equal 
mean coefficient of restitution. Use a = 0.05. 

(c) What is the P-value of the test statistic in part (b)? 

(d) What is the power of the statistical test in part (b) to detect 
a true difference in mean coefficient of restitution of 0.2? 

(e) What sample size would be required to detect a true dif- 
ference in mean coefficient of restitution of 0.1 with 
power of approximately 0.8? 

(f) Construct a 95% two-sided CI on the mean difference in co- 
efficient of restitution between the two brands of golf clubs. 


A special case of the two-sample t-tests of Section 10-3 occurs when the observations on 
the two populations of interest are collected in pairs. Each pair of observations, say (Xi; 
X5;), is taken under homogeneous conditions, but these conditions may change from one 
pair to another. For example, suppose that we are interested in comparing two different 
types of tips for a hardness-testing machine. This machine presses the tip into a metal spec- 
imen with a known force. By measuring the depth of the depression caused by the tip, the 
hardness of the specimen can be determined. If several specimens were selected at random, 
half tested with tip 1, half tested with tip 2, and the pooled or independent t-test in Section 
10-3 was applied, the results of the test could be erroneous. The metal specimens could 
have been cut from bar stock that was produced in different heats, or they might not 
be homogeneous in some other way that might affect hardness. Then the observed differ- 
ence between mean hardness readings for the two tip types also includes hardness differ- 


ences between specimens. 


A more powerful experimental procedure is to collect the data in pairs—that 1s, to make 
two hardness readings on each specimen, one with each tip. The test procedure would then 
consist of analyzing the differences between hardness readings on each specimen. If there is 
no difference between tips, the mean of the differences should be zero. This test procedure is 


called the paired t-test. 


Let (Xi X51), (X15, X55), ..., (Xin X2,) be a set of n paired observations where we assume 
that the mean and variance of the population represented by X, are u and o1, and the mean 
and variance of the population represented by X; are u, and o$. Define the differences be- 
tween each pair of observations as D; = Xy — X5, J = 1,2,..., n. The D,’s are assumed to be 


normally distributed with mean 


Mp = E(X — X) = EX) — E(X2) = ba — po 
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The Paired 
t- lest 


EXAMPLE 10-9 





and variance o7, so testing hypotheses about the difference between p, and w, can be 


accomplished by performing a one-sample t-test on wp. Specifically, testing Hp: pj — ps = 
A, against H,: p; — p» # Ag is equivalent to testing 

Ho: up = Ao 

Hi: Lp F Ao (10-21) 


The test statistic is given below. 


Null hypothesis: Hg: wp = Ao 


ELO 
—OSp/NVn 


Test statistic: To (10-22) 


Alternative Hypothesis Rejection Region 


Hy: up = Ag iy TOT T I 
Hi: up > Ao iy c s =I 
Hy up < Ao ty < fa, n- 


In Equation 10-22, D is the sample average of the n differences D,, D>,..., D,, and Sp is the 
sample standard deviation of these differences. 


An article in the Journal of Strain Analysis (1983, Vol. 18, No. 2) compares several methods 
for predicting the shear strength for steel plate girders. Data for two of these methods, the 
Karlsruhe and Lehigh procedures, when applied to nine specific girders, are shown in Table 
10-2. We wish to determine whether there is any difference (on the average) between the two 
methods. 

The eight-step procedure is applied as follows: 


1. The parameter of interest is the difference in mean shear strength between the two 
methods, say, up = p, — p» = O. 


pA Hp: Mp = 0 


Table 10-2 Strength Predictions for Nine Steel Plate Girders 
(Predicted Load/Observed Load) 


Girder Karlsruhe Method Lehigh Method Difference d; 


S1/1 1.186 1.061 0.119 
S2/1 1.151 0.992 0.159 
S3/1 1.522 1.063 0.259 
S4/1 1:339 1.062 0.277 
S5/1 1.200 1.065 0.138 
S2/1 1.402 1.178 0.224 
52/2 1.365 1.037 0.328 
S2/3 Fee fo 1.086 0.451 


S2/4 1:559 1.052 0.507 
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Hi: Wp + 0 
a = 0.05 
5. The test statistic is 
__ d 
j Sp/ Vn 
6. Reject Ho if fo > 10 .025,8 = 2.306 or if fo < — l0.025,8 = — 2.306. 


Computations: The sample average and standard deviation of the differences d; are 


d= 0.2736 and sp = 0.1356, so the test statistic is 


"" d | 0.2736 -— 
° sp/Vn  01356/V9 ` 





8. Conclusions: Since t = 6.05 > 2.306, we conclude that the strength prediction 
methods yield different results. Specifically, the data indicate that the Karlsruhe 
method produces, on the average, higher strength predictions than does the Lehigh 
method. The P-value for tọ = 6.05 is P = 0.0002, so the test statistic is well into the 
critical region. 


Paired Versus Unpaired Comparisons 

In performing a comparative experiment, the investigator can sometimes choose between the 
paired experiment and the two-sample (or unpaired) experiment. If n measurements are to be 
made on each population, the two-sample t-statistic is 


X,—X,—4 
T, = 1 2 = 
SpA n * mn 


which would be compared to ¢,,,_,, and of course, the paired t-statistic is 





D mm Ao 
To = 
Sp/ Vn 
which is compared to t,—;. Notice that since 
u n D, n (Xj; — Xo) Ww Xn E X. u u 
J I 2j Vj 2j 
dd D. mx lub Lu 
j=l 7-1 j=l 77 


the numerators of both statistics are identical. However, the denominator of the two-sample 
t-test 1s based on the assumption that X, and X, are independent. In many paired experiments, 
a strong positive correlation p exists between X, and X5. Then it can be shown that 


V(D) = V(X, — X; — Ao) 
V(X) T V(X») = cov (X, X2) 
- 207(1 — p) 
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Definition 


assuming that both populations X, and X, have identical variances o°. Furthermore, S/n 
estimates the variance of D. Whenever there is positive correlation within the pairs, the de- 
nominator for the paired t-test will be smaller than the denominator of the two-sample t-test. 
This can cause the two-sample t-test to considerably understate the significance of the data if 
it 1s incorrectly applied to paired samples. 

Although pairing will often lead to a smaller value of the variance of X, — X5, it does 
have a disadvantage—namely, the paired t-test leads to a loss of n — 1 degrees of freedom in 
comparison to the two-sample t-test. Generally, we know that increasing the degrees of free- 
dom of a test increases the power against any fixed alternative values of the parameter. 

So how do we decide to conduct the experiment? Should we pair the observations or not? 
Although there is no general answer to this question, we can give some guidelines based on 
the above discussion. 


1. Ifthe experimental units are relatively homogeneous (small o) and the correlation 
within pairs 1s small, the gain in precision attributable to pairing will be offset by the 
loss of degrees of freedom, so an independent-sample experiment should be used. 


2. Ifthe experimental units are relatively heterogeneous (large o) and there is large pos- 
itive correlation within pairs, the paired experiment should be used. Typically, this 
case occurs when the experimental units are the same for both treatments; as in 
Example 10-9, the same girders were used to test the two methods. 


Implementing the rules still requires judgment, because o and p are never known precisely. 
Furthermore, if the number of degrees of freedom is large (say, 40 or 50), the loss of n — 1 of 
them for pairing may not be serious. However, if the number of degrees of freedom is small 
(say, 10 or 20), losing half of them 1s potentially serious if not compensated for by increased 
precision from pairing. 


A Confidence Interval for up 
To construct the confidence interval for tp = p — p», note that 


T= D — up 
Sp/Vn 

follows a t distribution with n — 1 degrees of freedom. Then, since P(—ta/2n-1 = T = 
ta/2n-1) = 1 — a, we can substitute for T in the above expression and perform the necessary 
steps to isolate [up = p — p between the inequalities. This leads to the following 100(1 — a)% 
confidence interval on p; — pz. 


If d and sp are the sample mean and standard deviation of the difference of n random 
pairs of normally distributed measurements, a 100(1 — a)% confidence interval on 
the difference in means pp = p4 — Py is 


d — fa. n-1Sp/ V = Wp = d + te/2,n—-1Sp/ Vn (10-23) 


where f,/2,,-1 is the upper a/2% point of the ¢-distribution with n — 1 degrees of 
freedom. 





EXAMPLE 10-10 
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Table 10-3 Time in Seconds to Parallel Park Two 


Automobiles 
Automobile Difference 
Subject 1x) 2(x»;) (dj) 
1 37.0 17.8 19.2 
2 25.8 20.2 5.6 
3 16.2 16.8 —0.6 
4 24.2 41.4 =i 
5 22.0 21.4 0.6 
6 33.4 38.4 =5 
7 23.8 16.8 7.0 
8 58.2 32.2 26.0 
9 33.6 27.8 5.8 
10 24.4 2:2 ]:2 
11 23.4 29.6 =62 
12 21.2 20.6 0.6 
13 36.2 22:2 4.0 
14 29.8 53.8 —24.0 





This confidence interval is also valid for the case where o? # 0%, because så estimates 02, = 
V(X, — X,). Also, for large samples (say, n = 30 pairs), the explicit assumption of normality 
is unnecessary because of the central limit theorem. 


The journal Human Factors (1962, pp. 375-380) reports a study in which n = 14 subjects 
were asked to parallel park two cars having very different wheel bases and turning radii. The 
time in seconds for each subject was recorded and is given in Table 10-3. From the column of 
observed differences we calculate d = 1.21 and sp = 12.68. The 90% confidence interval for 


Mp = M4 — p» is found from Equation 9-24 as follows: 
d — to.05,135p/ Vn <ppsdt to.05,135p/ Vn 
1.21 — 1.771(12.68)/V14 < up < 121  1.771(12.68)/ VÀ 


2490. = 5,2 901 


Notice that the confidence interval on wp includes zero. This implies that, at the 90% level of con- 
fidence, the data do not support the claim that the two cars have different mean parking times u; 
and p». That is, the value up = u; — w = 01s not inconsistent with the observed data. 


EXERCISES FOR SECTION 10-4 


10-33. 


Consider the shear strength experiment described in 


must be normal? Use a normal probability plot to investigate 


Example 10-9. Construct a 95% confidence interval on the 
difference in mean shear strength for the two methods. Is the 
result you obtained consistent with the findings in Example 
10-9? Explain why. 

10-34. Reconsider the shear strength experiment described 
in Example 10-9. Do each of the individual shear strengths 
have to be normally distributed for the paired t-test to be ap- 
propriate, or is it only the difference in shear strengths that 


the normality assumption. 


10-35. Consider the parking data in Example 10-10. Use 
the paired t-test to investigate the claim that the two types of 
cars have different levels of difficulty to parallel park. Use 
a = 0.10. Compare your results with the confidence interval 
constructed in Example 10-10 and comment on why they are 
the same or different. 
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10-36. Reconsider the parking data in Example 10-10. 
Investigate the assumption that the differences in parking 
times are normally distributed. 


10-37. The manager of a fleet of automobiles is testing two 
brands of radial tires and assigns one tire of each brand at ran- 
dom to the two rear wheels of eight cars and runs the cars un- 
til the tires wear out. The data (in kilometers) follow. Find a 
99% confidence interval on the difference in mean life. Which 
brand would you prefer, based on this calculation? 


Car Brand 1 Brand 2 
l 36,925 34,318 
2 45,300 42,280 
3 36,240 35,500 
4 32,100 31,950 
5 37,210 38,015 
6 48,360 47,800 
7 38,200 37,810 
8 33,500 33,215 


10-38. A computer scientist is investigating the usefulness 
of two different design languages in improving programming 
tasks. Twelve expert programmers, familiar with both lan- 
guages, are asked to code a standard function in both lan- 
guages, and the time (in minutes) is recorded. The data follow: 


Time 
Design Design 
Language Language 
Programmer 1 D 
1 17 18 
2 16 14 
3 21 19 
4 14 11 
5 18 23 
6 24 21 
7 16 10 
8 14 13 
9 21 19 
10 23 24 
11 13 15 
12 18 20 


(a) Find a 95% confidence interval on the difference in mean 
coding times. Is there any indication that one design lan- 
guage is preferable? 

(b) Is the assumption that the difference in coding time is nor- 
mally distributed reasonable? Show evidence to support 
your answer. 


10-39. Fifteen adult males between the ages of 35 and 50 
participated in a study to evaluate the effect of diet and ex- 
ercise on blood cholesterol levels. The total cholesterol was 
measured in each subject initially and then three months af- 
ter participating in an aerobic exercise program and switch- 
ing to a low-fat diet. The data are shown in the accompany- 
ing table. Do the data support the claim that low-fat diet and 
aerobic exercise are of value in producing a mean reduction 
in blood cholesterol levels? Use a = 0.05. 


Blood Cholesterol Level 


Subject Before After 
l 265 229 
2 240 231 
3 258 221 
4 295 240 
5 251 238 
6 245 241 
7 287 234 
8 314 256 
9 260 247 

10 279 239 

11 283 246 

12 240 218 

13 238 219 

14 225 226 

15 247 233 


10-40. An article in the Journal of Aircraft (Vol. 23, 1986, 
pp. 859—864) describes a new equivalent plate analysis 
method formulation that is capable of modeling aircraft 
structures such as cranked wing boxes and that produces 
results similar to the more computationally intensive finite 
element analysis method. Natural vibration frequencies for 
the cranked wing box structure are calculated using both 
methods, and results for the first seven natural frequencies 
follow: 


Finite Equivalent 
Element Plate, 
Freq. Cycle/s Cycle/s 
1 14.58 14.76 
2 48.52 49.10 
3 97.22 99.99 
4 113.99 117.53 
5 174.73 181.22 
6 212.72 220.14 
7 277.38 294.80 
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(a) Do the data suggest that the two methods prove the same 
mean value for natural vibration frequency? Use a — 
0.05. 

(b) Find a 95% confidence interval on the mean difference 
between the two methods. 


10-41. Ten individuals have participated in a diet-modifica- 
tion program to stimulate weight loss. Their weight both be- 
fore and after participation in the program is shown in the fol- 
lowing list. Is there evidence to support the claim that this 
particular diet-modification program is effective in producing 
a mean weight reduction? Use a — 0.05. 


Subject Before After 
1 195 187 
2 213 195 
3 247 221 
4 201 190 
5 187 175 
6 210 197 
7 215 199 
8 246 221 
9 294 278 

10 310 285 


10-42. Two different analytical tests can be used to deter- 
mine the impurity level in steel alloys. Eight specimens 
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OF TWO NORMAL POPULATIONS 


are tested using both procedures, and the results are shown in 
the following tabulation. Is there sufficient evidence to con- 
clude that both tests give the same mean impurity level, 
using a = 0.01? 


Specimen Test 1 Test 2 
1 12 1.4 
2 1.3 LJ 
3 1.5 1.5 
4 1.4 1.3 
5 17 2.0 
6 1.8 Pm 
7 1.4 1.7 
8 1.3 1.6 


10-43. Consider the weight-loss data in Exercise 10-41. Is 
there evidence to support the claim that this particular diet- 
modification program will result in a mean weight loss of at 
least 10 pounds? Use a = 0.05. 


10-44. Consider the weight-loss experiment in Exercise 
10-41. Suppose that, if the diet-modification program results 
in mean weight loss of at least 10 pounds, it is important to de- 
tect this with probability of at least 0.90. Was the use of 10 
subjects an adequate sample size? If not, how many subjects 
should have been used? 


We now introduce tests and confidence intervals for the two population variances shown in 
Fig. 10-1. We will assume that both populations are normal. Both the hypothesis-testing and 
confidence interval procedures are relatively sensitive to the normality assumption. 


10-5.1 The F Distribution 


Suppose that two independent normal populations are of interest, where the population means 
and variances, say, p4, 07, p», and o2, are unknown. We wish to test hypotheses about the 
equality of the two variances, say, Hy: o1 = 03. Assume that two random samples of size n 
from population 1 and of size n, from population 2 are available, and let $7 and S$ be the sam- 
ple variances. We wish to test the hypotheses 


Ho: 01 = 5 


Hy: 0; to (10-24) 


The development of a test procedure for these hypotheses requires a new probability 
distribution, the F distribution. The random variable F is defined to be the ratio of two 
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Definition 


independent chi-square random variables, each divided by its number of degrees of free- 
dom. That 1s, 


Wh 
pu (10-25) 


where W and Y are independent chi-square random variables with u and v degrees of freedom, 
respectively. We now formally state the sampling distribution of F. 


Let W and Y be independent chi-square random variables with u and v degrees of 
freedom, respectively. Then the ratio 


_ Wru 
CE (10-26) 


has the probability density function 


v 


u/2 
z) uet 


0 « x « o (10-27) 


and is said to follow the F distribution with u degrees of freedom in the numerator 
and v degrees of freedom in the denominator. It is usually abbreviated as F’, ,. 





The mean and variance of the F distribution are u = v/(v — 2) for v > 2, and 


2v^(u + v — 2) 
u(v — 2)°(v — 4y 


Z2 





v>4 


Two F distributions are shown in Fig. 10-4. The F random variable is nonnegative, and the 
distribution is skewed to the right. The F distribution looks very similar to the chi-square dis- 
tribution; however, the two parameters u and v provide extra flexibility regarding shape. 

The percentage points of the F distribution are given in Table V of the Appendix. Let 
Jou be the percentage point of the F distribution, with numerator degrees of freedom u and 
denominator degrees of freedom v such that the probability that the random variable F ex- 
ceeds this value is 


oO 


PE > faus) = | fear =e 


Soui 


This is illustrated in Fig. 10-5. For example, if u = 5 and v = 10, we find from Table V of the 
Appendix that 


PUR > fo.05,5,10) = P(F 5,10 > 3.33) = 0.05 
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u-5,v=15 fx) 


0 2 4 6 8 10 3 fi-a u,v fa, u,v x 


Figure 10-4 Probability density functions of Figure 10-5 Upper and lower percentage 
two F distributions. points of the F distribution. 


That is, the upper 5 percentage point of F's 10 iS Jo.05,5,10 = 3.33. 
Table V contains only upper-tail percentage points (for selected values of f, ,,, for a — 
0.25) of the F distribution. The lower-tail percentage points fj. ,,,, can be found as follows. 


(10-28) 





For example, to find the lower-tail percentage point f, os ; ;o, note that 


l l 


= — — ————0211 
Jo.95,5.10 foosios 4.74 


10-5.2 Development of the F Distribution (CD Only) 
10-5.3 Hypothesis Tests on the Ratio of Two Variances 


A hypothesis-testing procedure for the equality of two variances is based on the following result. 


Let X11, Xiz ..., Xin, Þe a random sample from a normal population with mean p; and 
variance o^, and let X, X5», ..., Xn, be a random sample from a second normal pop- 
ulation with mean p, and variance 05. Assume that both normal populations are 
independent. Let S? and S$ be the sample variances. Then the ratio 


_ Si/oi 
25/55 


has an F distribution with n, — 1 numerator degrees of freedom and n) — 1 denom- 
inator degrees of freedom. 
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This result is based on the fact that (n, — 1)S%7/o7 is a chi-square random variable with n, — 1 
degrees of freedom, that (n; — 1)5$/03 is a chi-square random variable with n) — 1 degrees 
of freedom, and that the two normal populations are independent. Clearly under the null 
hypothesis Hy: o? = 0} the ratio Fy = S1/S5 has an F, —1n,-1 distribution. This is the basis of 
the following test procedure. 


Null hypothesis: blo a = S 
Si 


(sj 


Test statistic: Fo (10-29) 


Alternative Hypotheses Rejection Criterion 
DE, 2 
Hi0 a= 057 WO = oq e EI 
dA 2 
Hi: 0i > 03 D Sar 
D 2 
Hoi- o; D Aa a 


Oxide layers on semiconductor wafers are etched in a mixture of gases to achieve the proper 
thickness. The variability in the thickness of these oxide layers is a critical characteristic of the 
wafer, and low variability is desirable for subsequent processing steps. Two different mixtures 
of gases are being studied to determine whether one is superior in reducing the variability of 
the oxide thickness. Twenty wafers are etched in each gas. The sample standard deviations of 
oxide thickness are s, = 1.96 angstroms and s, = 2.13 angstroms, respectively. Is there any 
evidence to indicate that either gas is preferable? Use a = 0.05. 

The eight-step hypothesis-testing procedure may be applied to this problem as follows: 


1. The parameters of interest are the variances of oxide thickness o1 and o5. We will 
assume that oxide thickness is a normal random variable for both gas mixtures. 


2. H0, =o 
3. Ayo; ro 
4. a= 0.05 
5. The test statistic is given by Equation 10-29: 
s 
um 


6. Since n, = m = 20, we will reject Hg: of = 04 if fo > fo.025,19,19 = 2.53 or 


if fo < fo.975,19,19 = 1/fo.025,19,19 = 1/2.53 = 0.40. 
7. Computations: Since s? = (1.96)? = 3.84 and s = (2.13)° = 4.54, the test statistic is 


8. Conclusions: Since fo975 1919 = 0.40 < fo = 0.85 < foo551919 = 2.53, we cannot 
reject the null hypothesis Hy: o? = o% at the 0.05 level of significance. Therefore, 
there is no strong evidence to indicate that either gas results in a smaller variance of 
oxide thickness. 
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We may also find a P-value for the F-statistic in Example 10-11. Since fo 59.199 = 1.00, 
the computed value of the test statistic f, = s1/s5 = 3.84/4.54 = 0.85 is nearer the lower 
tail of the F distribution than the upper tail. The probability that an F-random variable with 
19 numerator and denominator degrees of freedom is less than 0.85 is 0.3634. Since it 1s ar- 
bitrary which population is identified as “one,” we could have computed the test statistic as 
fo = 4.54/3.84 = 1.18. The probability that an F-random variable with 19 numerator and 
denominator degrees of freedom exceeds 1.18 is 0.3610. Therefore, the P-value for the test 
statistic fọ = 0.85 is the sum of these two probabilities, or P = 0.3634 + 0.3610 = 0.7244. 
Since the P-value exceeds 0.05, the null hypothesis Hy: o? = of cannot be rejected. (The 
probabilities given above were computed using a hand-held calculator.) 


10-5.4  Q-Error and Choice of Sample Size 


EXAMPLE 10-12 


Appendix Charts VIo, VIp, VIg, and VIr provide operating characteristic curves for the F-test 
given in Section 10-5.1 for a = 0.05 and a = 0.01, assuming that n, = n; = n. Charts VIo 
and VIp are used with the two-sided alternate hypothesis. They plot B against the abscissa 
parameter 

01 


A= (10-30) 


02 
for various 7, = n; = n. Charts VIq and VIr are used for the one-sided alternative hypotheses. 


For the semiconductor wafer oxide etching problem in Example 10-11, suppose that one gas 
resulted in a standard deviation of oxide thickness that 1s half the standard deviation of oxide 
thickness of the other gas. If we wish to detect such a situation with probability at least 0.80, 
is the sample size n, = n, = 20 adequate? 

Note that if one standard deviation is half the other, 


By referring to Appendix Chart VIo with n, = n; = n = 20 and à = 2, we find that B = 0.20. 
Therefore, if B = 0.20, the power of the test (which is the probability that the difference in 
standard deviations will be detected by the test) is 0.80, and we conclude that the sample sizes 
n; = n; = 20 are adequate. 


10-5.5 Confidence Interval on the Ratio of Two Variances 


To find the confidence interval on 01/05, recall that the sampling distribution of 


_ 85/05 


— Si/oi 





is an F with n; — 1 and n, — 1 degrees of freedom. Therefore, P(fi-45, 44-1 F = 
Áa2m-1,n,-1) = 1 — a. Substitution for F and manipulation of the inequalities will lead to 
the 100(1 — a)% confidence interval for o1/o$. 
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EXAMPLE 10-13 


If 5? and sô are the sample variances of random samples of sizes n, and n», respec- 
tively, from two independent normal populations with unknown variances o and o5, 
then a 100(1 — «)% confidence interval on the ratio 07/02 is 


51 Oj si 


T =a ah = ot = a CGE 
S2 Un ED 


where fy55,—15,-1 2d fi—o/2n,-1,-1 are the upper and lower a/2 percentage 
points of the F distribution with n; — 1 numerator and », — 1 denominator degrees 
of freedom, respectively. A confidence interval on the ratio of the standard deviations 
can be obtained by taking square roots in Equation 10-31. 





A company manufactures impellers for use in Jet-turbine engines. One of the operations 
involves grinding a particular surface finish on a titanium alloy component. Two different 
grinding processes can be used, and both processes can produce parts at identical mean sur- 
face roughness. The manufacturing engineer would like to select the process having the 
least variability in surface roughness. A random sample of n; = 11 parts from the first 
process results in a sample standard deviation s, = 5.1 microinches, and a random sample 
of n, — 16 parts from the second process results in a sample standard deviation of s, — 4.7 
microinches. We will find a 90% confidence interval on the ratio of the two standard devi- 
ations, 01/0». 

Assuming that the two processes are independent and that surface roughness is normally 
distributed, we can use Equation 10-31 as follows: 

si oj 


ees 


si 
Jois 55-5 2 
52 o3 


> f0.05,15,10 
52 

5.1) o? (5.1) 
( | 939 « 91 «f ) 
(4.7) o (4.7) 








2.85 
or upon completing the implied calculations and taking square roots, 
01 
0.678 < — < 1.887 
02 


Notice that we have used Equation 10-28 to find foosi1s19 = 1/fo.0s.10,15 = 1/2.54 = 0.39. 
Since this confidence interval includes unity, we cannot claim that the standard deviations of 
surface roughness for the two processes are different at the 90% level of confidence. 


EXERCISES FOR SECTION 10-5 


10-45. For an F distribution, find the following: (C) foot2010 (d) fo.75,.7,15 
(a) foossio (©) fo.10,24,9 (€) fo.s0,10,12  (£) fo.99,20,10 
(C) Jo.os,s,15 — (d) fo.75,5,10 10-47. Two chemical companies can supply a raw material. 
(€) fo9o249 — (f) fo.9s,8,15 The concentration of a particular element in this material is 
10-46. For an F distribution, find the following: important. The mean concentration for both suppliers is the 


(a) Jo.25,7,15 


(b) fo1010.12 same, but we suspect that the variability in concentration may 
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differ between the two companies. The standard deviation of 
concentration in a random sample of n, = 10 batches pro- 
duced by company 1 is s, — 4.7 grams per liter, while for 
company 2, a random sample of n) = 16 batches yields s, = 
5.8 grams per liter. Is there sufficient evidence to conclude 
that the two population variances differ? Use a = 0.05. 


10-48. Consider the etch rate data in Exercise 10-21. Test 
the hypothesis Hy: of = o$ against H,: of # o% using a = 
0.05, and draw conclusions. 

10-49. Consider the etch rate data in Exercise 10-21. 

Suppose that if one population variance is twice as large as the 

other, we want to detect this with probability at least 0.90 

(using a = 0.05). Are the sample sizes n, = n; = 10 adequate? 

10-50. Consider the diameter data in Exercise 10-17. Con- 

struct the following: 

(a) A 90% two-sided confidence interval on o | /0>. 

(b) A 95% two-sided confidence interval on o/o. Comment 
on the comparison of the width of this interval with the 
width of the interval in part (a). 

(c) A 90% lower-confidence bound on o; /o;. 


10-51. Consider the foam data in Exercise 10-18. Construct 

the following: 

(a) A 9096 two-sided confidence interval on 07/03. 

(b) A 95% two-sided confidence interval on 04/03. Comment 
on the comparison of the width of this interval with the 
width of the interval in part (a). 

(c) A 90% lower-confidence bound on o; /o;. 


10-52. Consider the film speed data in Exercise 10-24. Test 
Hy: 07 = oj versus Hi: 01 # 05 using a = 0.02. 

10-53. Consider the gear impact strength data in Exercise 
10-22. Is there sufficient evidence to conclude that the vari- 


ance of impact strength is different for the two suppliers? 
Use a = 0.05. 


10-54. Consider the melting point data in Exercise 10-25. 
Do the sample data support a claim that both alloys have the 


same variance of melting point? Use a = 0.05 in reaching 
your conclusion. 


10-55. Exercise 10-28 presented measurements of plastic 
coating thickness at two different application temperatures. 
Test Hy: o? = o? against H4: 01 # 04 using a = 0.01. 
10-56. A study was performed to determine whether men 
and women differ in their repeatability in assembling compo- 
nents on printed circuit boards. Random samples of 25 men 
and 21 women were selected, and each subject assembled the 
units. The two sample standard deviations of assembly time 
were Siren = 0.98 minutes and Swomen = 1.02 minutes. Is there 
evidence to support the claim that men and women differ in 
repeatability for this assembly task? Use a = 0.02 and state 
any necessary assumptions about the underlying distribution 
of the data. 


10-57.  Reconsider the assembly repeatability experiment 
described in Exercise 10-56. Find a 9896 confidence interval 
on the ratio of the two variances. Provide an interpretation of 
the interval. 


10-58. Reconsider the film speed experiment in Exercise 
10-24. Suppose that one population standard deviation is 50% 
larger than the other. Is the sample size n, = n, = 8 adequate 
to detect this difference with high probability? Use a = 0.01 
in answering this question. 


10-59. Reconsider the overall distance data for golf balls in 
Exercise 10-31. Is there evidence to support the claim that the 
standard deviation of overall distance is the same for both 
brands of balls (use a = 0.05)? Explain how this question can 
be answered with a 95% confidence interval on 0,/0o;. 
10-60.  Reconsider the coefficient of restitution data in 
Exercise 10-32. Do the data suggest that the standard devia- 
tion is the same for both brands of drivers (use a = 0.05)? 
Explain how to answer this question with a confidence inter- 
val on o,/0o5. 


10-6 INFERENCE ON TWO POPULATION PROPORTIONS 


We now consider the case where there are two binomial parameters of interest, say, p, and p>, 
and we wish to draw inferences about these proportions. We will present large-sample 
hypothesis testing and confidence interval procedures based on the normal approximation to 


the binomial. 


10-6.1 Large-Sample Test for Ho: p, = p; 


Suppose that two independent random samples of sizes n; and n, are taken from two pop- 
ulations, and let X, and X; represent the number of observations that belong to the class of in- 
terest in samples 1 and 2, respectively. Furthermore, suppose that the normal approximation 
to the binomial is applied to each population, so the estimators of the population proportions 
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P, — Xj/n, and P, = X ,/m have approximate normal distributions. We are interested in 
testing the hypotheses 

Hot pi po 

Al: p; * po 


The statistic 


= DES LIES 
py(1l E50) i pl — pə) 
ni n» 


(10-32) 





is distributed approximately as standard normal and is the basis of a test for Hp: p, = po. 
Specifically, if the null hypothesis Ho: p, = p> is true, using the fact that p, = p, = p, the 
random variable 


is distributed approximately N(0, 1). An estimator of the common parameter p is 


& Ay rao 
P = —— 
ny T n 
The test statistic for y: p, = p is then 
P, — P, 


Null hypothesis: Hg: Pj = p; 





Test statistic: 


Alternative Hypotheses 


(gle jon a= D> 
IDE P 
inp p> 





Rejection Criterion 
Zy = Za OF 2g s —Zum 
i m 


Ey e ccm 


Q 


EXAMPLE 10-14 
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Extracts of St. John's Wort are widely used to treat depression. An article in the April 18, 2001 
issue of the Journal of the American Medical Association ("Effectiveness of St. John's Wort 
on Major Depression: A Randomized Controlled Trial") compared the efficacy of a standard 
extract of St. John's Wort with a placebo in 200 outpatients diagnosed with major depression. 
Patients were randomly assigned to two groups; one group received the St. John's Wort, and 
the other received the placebo. After eight weeks, 19 of the placebo-treated patients showed 
improvement, whereas 27 of those treated with St. John's Wort improved. Is there any reason 
to believe that St. John's Wort is effective in treating major depression? Use a = 0.05. 

The eight-step hypothesis testing procedure leads to the following results: 


1. 


Oa cop 


The parameters of interest are p, and p,, the proportion of patients who improve 
following treatment with St. John's Wort (p,) or the placebo ( p;). 


Ho: pi = Pr 
Hy pi # p; 
a = 0.05 


The test statistic is 
Zn = — 
a oe 
PA= Pa t+ a 
where p, = 27/100 = 0.27, b; = 19/100 = 0.19, n, = m, = 100, and 


x tx,  19+27 





p — = ———— = 0.23 
Pin tm 100 + 100 
Reject Hg: pi — Pp» if zo b Z0.025 — 1.96 or if zo < — 20.025 — — 1.96. 
Computations: The value of the test statistic 1s 
0.27 — 0.19 
Zg = = 1.35 


1 l 
OO co aan 


Conclusions: Since zy = 1.35 does not exceed zo 95, we cannot reject the null hy- 
pothesis. Note that the P-value is P = 0.177. There is insufficient evidence to 
support the claim that St. John’s Wort is effective in treating major depression. 


The following box shows the Minitab two-sample hypothesis test and CI procedure for 
proportions. Notice that the 95% CI on p, — p; includes zero. The equation for constructing 
the CI will be given in Section 10-6.4. 


Test and CI for Two Proportions 


sample X N sample p 
l 20 100 0.270000 
2 19 100 0.190000 


Estimate for p(1) — p(2): 0.08 
95% CI for p(1) — p(2): (—0.0361186, 0.196119) 
Test for p(1) — p(2) = 0 (vs not = 0): Z = 1.35 P-Value = 0.177 
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10-6.2 Small-Sample Test for Ho: pı = p; (CD Only) 

10-6.3  Q-Error and Choice of Sample Size 
The computation of the B-error for the large-sample test of Hp: p, = p; is somewhat more 
involved than in the single-sample case. The problem is that the denominator of the test 


statistic Z, is an estimate of the standard deviation of P, = P, under the assumption that p, — 
D» = p. When Hy: p, = p; is false, the standard deviation of P, — P» is 


pil = JP pxX1 — pə) 
OB-Pp,- — 0n RS n — (10-34) 


If the alternative hypothesis is two sided, the Q-error is 


2 Zuj2 V pq(1/ni + 1/n;) — (pi --| 


OPP, 


oa See ee 


Op p, 





where 


nip, + mp ni(1 — pi) + nY(1 — po) 


p= and q= 
P ny + n, d Ny + n, 








and of — p, is given by Equation 10-34. 


If the alternative hypothesis is H,: p, > Po, 


- o [^ v pq(1l/n, + 1/nz) — (pi =P) (10-36) 


eae 


and if the alternative hypothesis is H,: p, < po, 


(10-37) 


gig] ees el 


OPER 





For a specified pair of values p, and p;, we can find the sample sizes n, = n, = n required to 
give the test of size a that has specified type II error p. 


10-6 INFERENCE ON TWO POPULATION PROPORTIONS 365 


For the two-sided alternative, the common sample size is 








[zuy V(pi + polar + 92)/2 + zg Vpigi + Pay 





i a ud 


where q; = 1 — p, and q, = 1 — p». 





For a one-sided alternative, replace z, 7 in Equation 10-38 by z,. 


10-6.4 Confidence Interval for p, — p; 


The confidence interval for p, — p, can be found directly, since we know that 


P, — Ê, (pi — po) 
py(1 — pi) a p2(1 — p3) 
ni n» 


is a standard normal random variable. Thus P(—z,j; = Z S Zy/2) = 1 — a, so we can substi- 
tute for Z in this last expression and use an approach similar to the one employed previously 
to find an approximate 100(1 — a)% two-sided confidence interval for p, — p>. 


Definition 
If p, and p, are the sample proportions of observation in two independent random 
samples of sizes 7, and n, that belong to a class of interest, an approximate two- 
sided 100(1 — a)™% confidence interval on the difference in the true proportions 


pi — piis 


Dies Po — 20/2 


i " hi1 — pj) pol — po) 
pu UE E ee EE EE (10-39) 


where z,/ is the upper a/2 percentage point of the standard normal distribution. 





EXAMPLE 10-15 | Consider the process manufacturing crankshaft bearings described in Example 8-6. 
Suppose that a modification is made in the surface finishing process and that, subse- 
quently, a second random sample of 85 axle shafts 1s obtained. The number of defective 
shafts in this second sample is 8. Therefore, since n, = 85, p, = 0.12, n; = 85, and 
Do = 8/85 = 0.09, we can obtain an approximate 95% confidence interval on the 
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difference in the proportion of defective bearings produced under the two processes from 


Equation 10-39 as follows: 








A Pill — pi) | Po — £2) 
Pı — P2 — 20.025 ni "s nj 
mE Pil — P) , PX1 = £5) 
= Pi pb ~ Pat Z0005\) m ' n 
or 
112 0.00 — roc [)-12(088) | 0.090.917 
l l l 85 85 


<p, — p € 0.12 — 0.09 + 1.96 


This simplifies to 


0.12(0.88) | 0.09(0.91) 
85 85 


This confidence interval includes zero, so, based on the sample data, 1t seems unlikely that the 
changes made in the surface finish process have reduced the proportion of defective crank- 


shaft bearings being produced. 


EXERCISES FOR SECTION 10-6 


10-61. Two different types of injection-molding machines 
are used to form plastic parts. A part is considered defective 
if it has excessive shrinkage or is discolored. Two random 
samples, each of size 300, are selected, and 15 defective 
parts are found in the sample from machine 1 while 8 defec- 
tive parts are found in the sample from machine 2. Is it rea- 
sonable to conclude that both machines produce the same 
fraction of defective parts, using a = 0.05? Find the P-value 
for this test. 


10-62. Two different types of polishing solution are being 
evaluated for possible use in a tumble-polish operation for 
manufacturing interocular lenses used in the human eye fol- 
lowing cataract surgery. Three hundred lenses were tumble- 
polished using the first polishing solution, and of this number 
253 had no polishing-induced defects. Another 300 lenses 
were tumble-polished using the second polishing solution, and 
196 lenses were satisfactory upon completion. Is there any 
reason to believe that the two polishing solutions differ? Use 
a = 0.01. Discuss how this question could be answered with a 
confidence interval on p, — p». 

10-63. Consider the situation described in Exercise 10-61. 
Suppose that p, = 0.05 and p, = 0.01. 

(a) With the sample sizes given here, what is the power of the 

test for this two-sided alternate? 


(b) Determine the sample size needed to detect this difference 
with a probability of at least 0.9. Use a = 0.05. 


10-64. Consider the situation described in Exercise 10-61. 

Suppose that p, = 0.05 and p, = 0.02. 

(a) With the sample sizes given here, what is the power of the 
test for this two-sided alternate? 

(b) Determine the sample size needed to detect this difference 
with a probability of at least 0.9. Use a = 0.05. 


10-65. A random sample of 500 adult residents of Maricopa 
County found that 385 were in favor of increasing the high- 
way speed limit to 75 mph, while another sample of 400 adult 
residents of Pima County found that 267 were in favor of the 
increased speed limit. Do these data indicate that there is a dif- 
ference in the support for increasing the speed limit between 
the residents of the two counties? Use a = 0.05. What is the 
P-value for this test? 

10-66. Construct a 95% confidence interval on the differ- 
ence in the two fractions defective for Exercise 10-61. 

10-67. Construct a 95% confidence interval on the differ- 
ence in the two proportions for Exercise 10-65. Provide a 
practical interpretation of this interval. 
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10-7 SUMMARY TABLE FOR INFERENCE PROCEDURES 


FOR TWO SAMPLES 


The table in the end papers of the book summarizes all of the two-sample inference procedures 
given in this chapter. The table contains the null hypothesis statements, the test statistics, the 
criteria for rejection of the various alternative hypotheses, and the formulas for constructing the 


100(1 — a)% confidence intervals. 


Supplemental Exercises 


10-68. A procurement specialist has purchased 25 resistors 
from vendor | and 35 resistors from vendor 2. Each resistor’s 
resistance is measured with the following results: 


Vendor 1 
96.8 100.0 100.3 98.5 98.3 98.2 
99.6 99.4 99.9 101.1 103.7 97.7 
99.7 101.1 DT 98.6 101.9 101.0 
99.4 99.8 99.1 99.6 101.2 98.2 
98.6 

Vendor 2 
106.8 106.8 104.7 104.7 108.0 102.2 
103.2 103.7 106.8 105.1 104.0 106.2 
102.6 100.3 104.0 107.0 104.3 105.8 
104.0 106.3 102.2 102.8 104.2 103.4 
104.6 103.5 106.3 109.2 107.2 105.4 
106.4 106.8 104.1 107.1 107.7 


(a) What distributional assumption is needed to test the claim 
that the variance of resistance of product from vendor 1 is 
not significantly different from the variance of resistance 
of product from vendor 2? Perform a graphical procedure 
to check this assumption. 

(b) Perform an appropriate statistical hypothesis-testing pro- 
cedure to determine whether the procurement specialist 
can claim that the variance of resistance of product from 
vendor 1 is significantly different from the variance of re- 
sistance of product from vendor 2. 


10-69. An article in the Journal of Materials Engineering 
(1989, Vol. 11, No. 4, pp. 275—282) reported the results of an 
experiment to determine failure mechanisms for plasma- 
sprayed thermal barrier coatings. The failure stress for one 
particular coating (NiCrAIZr) under two different test condi- 
tions is as follows: 


Failure stress (X 10° Pa) after nine 1-hour cycles: 19.8, 
18.5, 17.6, 16.7, 16.7, 14.8, 15.4, 14.1, 13.6 


Failure stress (X 10° Pa) after six 1-hour cycles: 14.9, 
12.7, 11.9, 11.4, 10.1, 7.9 


(a) What assumptions are needed to construct confidence in- 
tervals for the difference in mean failure stress under the 


two different test conditions? Use normal probability plots 
of the data to check these assumptions. 

(b) Find a 99% confidence interval on the difference in mean 
failure stress under the two different test conditions. 

(c) Using the confidence interval constructed in part (b), does 
the evidence support the claim that the first test conditions 
yield higher results, on the average, than the second? 
Explain your answer. 


10-70. Consider Supplemental Exercise 10-69. 

(a) Construct a 95% confidence interval on the ratio of the 
variances, 0/0», of failure stress under the two different 
test conditions. 

(b) Use your answer in part (b) to determine whether there is 
a significant difference in variances of the two different 
test conditions. Explain your answer. 


10-71. A liquid dietary product implies in its advertising 
that use of the product for one month results in an average 
weight loss of at least 3 pounds. Eight subjects use the product 
for one month, and the resulting weight loss data are reported 
below. Use hypothesis-testing procedures to answer the fol- 
lowing questions. 


Initial Final 
Subject Weight (Ib) Weight (Ib) 
l 165 161 
2 201 195 
3 195 192 
4 198 193 
5 155 150 
6 143 141 
7 150 146 
8 187 183 


(a) Do the data support the claim of the producer of the dietary 
product with the probability of a type I error set to 0.05? 

(b) Do the data support the claim of the producer of the dietary 
product with the probability of a type I error set to 0.01? 

(c) In an effort to improve sales, the producer is considering 
changing its claim from “at least 3 pounds” to “at least 5 
pounds.” Repeat parts (a) and (b) to test this new claim. 


10-72. The breaking strength of yarn supplied by two man- 
ufacturers is being investigated. We know from experience 
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with the manufacturers’ processes that o, = 5 psi and o; = 

4 psi. À random sample of 20 test specimens from each manu- 

facturer results in x, = 88 psi and x; = 91 psi, respectively. 

(a) Using a 90% confidence interval on the difference in 
mean breaking strength, comment on whether or not there 
is evidence to support the claim that manufacturer 2 pro- 
duces yarn with higher mean breaking strength. 

(b) Using a 98% confidence interval on the difference in mean 
breaking strength, comment on whether or not there is ev- 
idence to support the claim that manufacturer 2 produces 
yarn with higher mean breaking strength. 

(c) Comment on why the results from parts (a) and (b) are dif- 
ferent or the same. Which would you choose to make your 
decision and why? 

10-75. The Salk polio vaccine experiment in 1954 focused 
on the effectiveness of the vaccine in combatting paralytic 
polio. Because it was felt that without a control group of 
children there would be no sound basis for evaluating the 
efficacy of the Salk vaccine, the vaccine was administered to 
one group, and a placebo (visually identical to the vaccine 
but known to have no effect) was administered to a second 
group. For ethical reasons, and because it was suspected that 
knowledge of vaccine administration would affect subse- 
quent diagnoses, the experiment was conducted in a double- 
blind fashion. That is, neither the subjects nor the 
administrators knew who received the vaccine and who 
received the placebo. The actual data for this experiment are 
as follows: 


Placebo group: n = 201,299: 110 cases of polio observed 
Vaccine group: n = 200,745: 33 cases of polio observed 


(a) Use a hypothesis-testing procedure to determine if the 
proportion of children in the two groups who contracted 
paralytic polio is statistically different. Use a probability 
of a type I error equal to 0.05. 

(b) Repeat part (a) using a probability of a type I error equal 
to 0.01. 

(c) Compare your conclusions from parts (a) and (b) and ex- 
plain why they are the same or different. 


10-74. Consider Supplemental Exercise 10-72. Suppose that 

prior to collecting the data, you decide that you want the error in 

estimating u; — Ww by x, — x; to be less than 1.5 psi. Specify the 

sample size for the following percentage confidence: 

(a) 90% 

(b) 98% 

(c) Comment on the effect of increasing the percentage confi- 
dence on the sample size needed. 

(d) Repeat parts (a)-(c) with an error of less than 0.75 psi 
instead of 1.5 psi. 

(e) Comment on the effect of decreasing the error on the sam- 
ple size needed. 


10-75. A random sample of 1500 residential telephones in 
Phoenix in 1990 found that 387 of the numbers were unlisted. 
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A random sample in the same year of 1200 telephones in 

Scottsdale found that 310 were unlisted. 

(a) Find a 95% confidence interval on the difference in the 
two proportions and use this confidence interval to deter- 
mine if there is a statistically significant difference in 
proportions of unlisted numbers between the two cities. 

(b) Find a 90% confidence interval on the difference in the 
two proportions and use this confidence interval to deter- 
mine if there is a statistically significant difference in pro- 
portions of unlisted numbers between the two cities. 

(c) Suppose that all the numbers in the problem description 
were doubled. That is, 774 residents out of 3000 sampled 
in Phoenix and 620 residents out of 2400 in Scottsdale had 
unlisted phone numbers. Repeat parts (a) and (b) and 
comment on the effect of increasing the sample size with- 
out changing the proportions on your results. 


10-76. In a random sample of 200 Phoenix residents who 

drive a domestic car, 165 reported wearing their seat belt regu- 

larly, while another sample of 250 Phoenix residents who drive 

a foreign car revealed 198 who regularly wore their seat belt. 

(a) Perform a hypothesis-testing procedure to determine if 
there is a statistically significant difference in seat belt us- 
age between domestic and foreign car drivers. Set your 
probability of a type I error to 0.05. 

(b) Perform a hypothesis-testing procedure to determine if 
there is a statistically significant difference in seat belt 
usage between domestic and foreign car drivers. Set your 
probability of a type I error to 0.1. 

(c) Compare your answers for parts (a) and (b) and explain 
why they are the same or different. 

(d) Suppose that all the numbers in the problem description 
were doubled. That is, in a random sample of 400 
Phoenix residents who drive a domestic car, 330 re- 
ported wearing their seat belt regularly, while another 
sample of 500 Phoenix residents who drive a foreign car 
revealed 396 who regularly wore their seat belt. Repeat 
parts (a) and (b) and comment on the effect of increasing 
the sample size without changing the proportions on 
your results. 


10-77. Consider the previous exercise, which summarized 

data collected from drivers about their seat belt usage. 

(a) Do you think there is a reason not to believe these data? 
Explain your answer. 

(b) Is it reasonable to use the hypothesis-testing results from 
the previous problem to draw an inference about the dif- 
ference in proportion of seat belt usage 

(1) of the spouses of these drivers of domestic and foreign 
cars? Explain your answer. 
(11) ofthe children of these drivers of domestic and foreign 
cars? Explain your answer. 
(iii) of all drivers of domestic and foreign cars? Explain 
your answer. 
(iv) of all drivers of domestic and foreign trucks? Explain 
your answer. 
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10-78. Consider the situation described in Exercise 10-62. 

(a) Redefine the parameters of interest to be the proportion of 
lenses that are unsatisfactory following tumble polishing 
with polishing fluids 1 or 2. Test the hypothesis that the two 
polishing solutions give different results using a = 0.01. 

(b) Compare your answer in part (a) with that for Exercise 10- 
62. Explain why they are the same or different. 


10-79. Consider the situation of Exercise 10-62, and recall 
that the hypotheses of interest are Hp: p, = p; versus H4: p, # p;. 
We wish to use a = 0.01. Suppose that if p, = 0.9 and p, = 0.6, 
we wish to detect this with a high probability, say, at least 0.9. 
What sample sizes are required to meet this objective? 


10-80. A manufacturer of a new pain relief tablet would 
like to demonstrate that its product works twice as fast as the 
competitor's product. Specifically, the manufacturer would 
like to test 


Ho: pı = 2p 
Hi: ga > 2p 


where u; is the mean absorption time of the competitive prod- 
uct and p, is the mean absorption time of the new product. 
Assuming that the variances o^ and o2 are known, develop a 
procedure for testing this hypothesis. 


10-81. Suppose that we are testing Hp: jj, = p; versus H}: 
I4 # p», and we plan to use equal sample sizes from the two 
populations. Both populations are assumed to be normal with 
unknown but equal variances. If we use a = 0.05 and if the 
true mean p, = p» + c, what sample size must be used for the 
power of this test to be at least 0.90? 


10-82. Consider the fire-fighting foam expanding agents 

investigated in Exercise 10-18, in which five observations of 

each agent were recorded. Suppose that, if agent 1 produces a 

mean expansion that differs from the mean expansion of agent 

1 by 1.5, we would like to reject the null hypothesis with prob- 

ability at least 0.95. 

(a) What sample size is required? 

(b) Do you think that the original sample size in Exercise 
10-18 was appropriate to detect this difference? Explain 
your answer. 


10-83. A fuel-economy study was conducted for two German 
automobiles, Mercedes and Volkswagen. One vehicle of each 
brand was selected, and the mileage performance was observed 
for 10 tanks of fuel in each car. The data are as follows (in miles 
per gallon): 


Mercedes Volkswagen 
24.7 24.9 41.7 42.8 
24.8 24.6 42.3 42.4 
24.9 23.9 41.6 39.9 
24.7 24.9 21 40.8 
24.5 24.8 41.9 29.6 
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(a) Construct a normal probability plot of each of the data 
sets. Based on these plots, is it reasonable to assume that 
they are each drawn from a normal population? 

(b) Suppose that it was determined that the lowest observa- 
tion of the Mercedes data was erroneously recorded and 
should be 24.6. Furthermore, the lowest observation of the 
Volkswagen data was also mistaken and should be 39.6. 
Again construct normal probability plots of each of the 
data sets with the corrected values. Based on these new 
plots, is it reasonable to assume that they are each drawn 
from a normal population? 

(c) Compare your answers from parts (a) and (b) and com- 
ment on the effect of these mistaken observations on the 
normality assumption. 

(d) Using the corrected data from part (b) and a 95% confi- 
dence interval, is there evidence to support the claim that 
the variability in mileage performance is greater for a 
Volkswagen than for a Mercedes? 


10-84. Reconsider the fuel-economy study in Supplemental 
Exercise 10-83. Rework part (d) of this problem using an ap- 
propriate hypothesis-testing procedure. Did you get the same 
answer as you did originally? Why? 


10-85. An experiment was conducted to compare the filling 
capability of packaging equipment at two different wineries. 
Ten bottles of pinot noir from Ridgecrest Vineyards were ran- 
domly selected and measured, along with 10 bottles of pinot 
noir from Valley View Vineyards. The data are as follows (fill 
volume is in milliliters): 


Ridgecrest Valley View 
T33 TƏL 732 139 756 754 757 756 
753 753 753 754 755 756 756 755 
Tod FS 755 756 


(a) What assumptions are necessary to perform a hypothesis- 
testing procedure for equality of means of these data? 
Check these assumptions. 

(b) Perform the appropriate hypothesis-testing procedure to 
determine whether the data support the claim that both 
wineries will fill bottles to the same mean volume. 


10-86. Consider Supplemental Exercise 10-85. Suppose 
that the true difference in mean fill volume is as much as 2 
fluid ounces; did the sample sizes of 10 from each vineyard 
provide good detection capability when a = 0.05? Explain 
your answer. 


10-87. A Rockwell hardness-testing machine presses a tip 
into a test coupon and uses the depth of the resulting depres- 
sion to indicate hardness. Two different tips are being com- 
pared to determine whether they provide the same Rockwell 
C-scale hardness readings. Nine coupons are tested, with both 
tips being tested on each coupon. The data are shown in the 
accompanying table. 
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Coupon  Tipl  Tip2 Coupon Tip! Tip? 
l 47 46 6 41 41 
2 42 40 7 45 46 
3 43 45 8 45 46 
4 40 41 9 49 48 
5 42 43 


(a) State any assumptions necessary to test the claim that both 
tips produce the same Rockwell C-scale hardness readings. 
Check those assumptions for which you have the information. 

(b) Apply an appropriate statistical method to determine if the 
data support the claim that the difference in Rockwell 
C-scale hardness readings of the two tips is significantly 
different from zero 

(c) Suppose that if the two tips differ in mean hardness read- 
ings by as much as 1.0, we want the power of the test to be 
at least 0.9. For an a = 0.01, how many coupons should 
have been used in the test? 

10-88. Two different gauges can be used to measure the depth 

of bath material in a Hall cell used in smelting aluminum. Each 

gauge is used once in 15 cells by the same operator. 


Cell Gaugel  Gauge2 Cell Gaugel  Gauge2 

1 46 in. 47 in. 9 52 51 
2 50 53 10 47 45 
3 47 45 11 49 51 
4 53 50 12 45 45 
5 49 51 13 47 49 
6 48 48 14 46 43 
T 53 54 15 50 51 
8 56 53 
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(a) State any assumptions necessary to test the claim that 
both gauges produce the same mean bath depth read- 
ings. Check those assumptions for which you have the 
information. 

(b) Apply an appropriate statistical procedure to determine 1f 
the data support the claim that the two gauges produce dif- 
ferent mean bath depth readings. 

(c) Suppose that if the two gauges differ in mean bath depth 
readings by as much as 1.65 inch, we want the power of 
the test to be at least 0.8. For a = 0.01, how many cells 
should have been used? 


10-89. An article in the Journal of the Environmental 
Engineering Division (“Distribution of Toxic Substances in 
Rivers,” 1982, Vol. 108, pp. 639—649) investigates the con- 
centration of several hydrophobic organic substances in the 
Wolf River in Tennessee. Measurements on hexachloroben- 
zene (HCB) in nanograms per liter were taken at different 
depth downstream of an abandoned dump site. Data for two 
depths follow: 

Surface: 3.74, 4.61, 4.00, 4.67, 4.87, 5.12, 4.52, 5.29, 5.74, 5.48 
Bottom: 5.44, 6.88, 5.37, 5.44, 5.03, 6.48, 3.89, 5.85, 6.85, 7.16 


(a) What assumptions are required to test the claim that 
mean HCB concentration is the same at both depths? 
Check those assumptions for which you have the infor- 
mation. 

(b) Apply an appropriate procedure to determine if the data 
support the claim in part a. 

(c) Suppose that the true difference in mean concentrations is 
2.0 nanograms per liter. For a = 0.05, what is the power 
of a statistical test for Hg: jj, = p, versus Hi: w Æ po? 

(d) What sample size would be required to detect a difference 
of 1.0 nanograms per liter at a = 0.05 if the power must 
be at least 0.9? 


MIND-EXPANDING EXERCISES 


10-90. Three different pesticides can be used to control 
infestation of grapes. It is suspected that pesticide 3 is 
more effective than the other two. In a particular vineyard, 
three different plantings of pinot noir grapes are selected 
for study. The following results on yield are obtained: 


Xj ni 
(Bushels/ (Number of 
Pesticide Plant) Sj Plants) 
l 4.6 0.7 100 
D p 0.6 120 


3 6.1 0.8 130 


If p, is the true mean yield after treatment with the ith 
pesticide, we are interested in the quantity 


] 
B= 5 (Mi + po) = Bs 


which measures the difference in mean yields between 

pesticides 1 and 2 and pesticide 3. If the sample sizes n; 

are large, the estimator (say, Ñ) obtained by replacing 

each individual u; by X; is approximately normal. 

(a) Find an approximate 100(1 — a)% large-sample 
confidence interval for p. 
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MIND-EXPANDING EXERCISES 


(b) Do these data support the claim that pesticide 3 is 
more effective than the other two? Use a = 0.05 in 
determining your answer. 


10-91. Suppose that we wish to test Hp: u; = p2 
versus H,: u; Æ po, where o? and o$ are known. The 
total sample size N is to be determined, and the alloca- 
tion of observations to the two populations such that 
n, t n, = Nis to be made on the basis of cost. If the 
cost of sampling for populations 1 and 2 are C, and C, 
respectively, find the minimum cost sample sizes that 
provide a specified variance for the difference in sam- 
ple means. 


10-92. Suppose that we wish to test the hypothesis Hj: 
u; = w versus Hi: p; Æ p, where both variances o7 and 
02 are known. A total of n, + n; = N observations can be 
taken. How should these observations be allocated to the 
two populations to maximize the probability that H will 
be rejected if H, is true and u; — pj, = A # 0? 

10-93. Suppose that we wish to test Hp: |. = [Wo ver- 
sus H,: u # po, where the population is normal with 
known c. Let 0 < e < a, and define the critical region 
so that we will reject Hy iE zo~ z2 O ifa a 
where z, is the value of the usual test statistic for these 
hypotheses. 


IMPORTANT TERMS AND CONCEPTS 


Null and alternative 
hypotheses 

One-sided and two- 
sided alternative 


In the E-book, click on any 
term or concept below to 
go to that subject. 

Comparative experi- 


ments 

Critical region for a test 
statistic 

Identifying cause and 
effect 


hypotheses 
Operating characteristic 
curves 
Paired t-test 
Pooled t-test 


(a) Show that the probability of type I error for this test 
is Q. 

(b) Suppose that the true mean is p, = p + A. Derive 
an expression for B for the above test. 


10-94. Construct a data set for which the paired t-test 
statistic is very large, indicating that when this analysis 
is used the two population means are different, but tọ for 
the two-sample t-test is very small so that the incorrect 
analysis would indicate that there is no significant dif- 
ference between the means. 


10-95. In some situations involving proportions, we 

are interested in the ratio 0 = p,/p, rather than the differ- 

ence p, — p>. Let 6 = p,/ p>. We can show that In(Ó) has 

an approximate normal distribution with the mean (1/0) 

and variance [(n — xiV(nqxi) + (n5 — x2)/(nx2) |". 

(a) Use the information above to derive a large-sample 
confidence interval for In 0. 

(b) Show how to find a large-sample CI for 0. 

(c) Use the data from the St. John's Wort study in 
Example 10-14, and find a 95% CI on 0 = p,/p». 
Provide a practical interpretation for this CI. 

10-96. Derive an expression for B for the test of the 

equality of the variances of two normal distributions. 

Assume that the two-sided alternative is specified. 


P-value CD MATERIAL 
Reference distribution Fisher-Irwin test on two 
for a test statistic proportions 


Sample size determina- 
tion for hypothesis 
tests and confidence 
intervals 

Statistical hypotheses 

Test statistic 


10-3.2 More About the Equal Variance Assumption (CD Only) 


In practice, one often has to choose between case 1 and case 2 of the two-sample t-test. In case 
1, we assume that oj = 05 and use the pooled t-test. On the surface this test would seem to have 
some advantages. It is a likelihood ratio test, whereas the case 2 test with o? # 05 is not. 
Furthermore, it is an exact test (if the assumptions of normality, independence, and equal vari- 
ances are correct), whereas the case 2 test is an approximate procedure. However, the pooled 
t-test can be very sensitive to the assumption of equal variances, especially when the sample sizes 
are not equal. To help see this, consider the denominator of the test statistic for the pooled t-test: 


S b te (m -)Si*(5-U0S5mtm | S1 S5 
PNnm on. ny + Ny — 2 ni Nn, n 


Because the variances are divided (approximately) by the wrong sample sizes, use of the 
pooled t-test when the variances are unequal and when n, # m can lead to very frequent er- 
roneous conclusions. This is why using n; = m, is a good idea in general, and especially when 
we are in doubt about the validity of the equal variance assumption. 

It would, of course, be possible to perform a test of Hj: o? = o3 versus Hi: 01 # 0% and 
then use the pooled t-test if the null hypothesis is not rejected. This test is discussed in Section 10- 
5. However, the test on variances is much more sensitive to the normality assumption than are 
t-tests. A conservative approach would be to always use the case 2 procedure. Alternatively, one 
can use the normal probability plot both as a check of the normality assumption and as a check for 
equality of variance. If there is a noticable difference in the slopes of the two straight lines on the 
normal probability plot, the case 2 procedure would be preferred, especially when n, # np. 


10-5.2 Development of the F Distribution (CD Only) 


Theorem: The 
F-Distribution 





We now give a formal development of the F distribution. The development makes use of the 
material in Section 5-8 (CD Only). 


Let U, and U, be independent chi-square random variables with v, and v, degrees of 
freedom, respectively. Then the ratio 


E U;/vi 
U5f v) 
has the probability density function 


f(x) = ee ee 
OOO” 


Ol ay ace 


This is the F-distribution with v, degrees of freedom in the numerator and v, degrees 
of freedom in the denominator. 


10-1 


10-2 


Proof Since U, and U, are independent chi-square random variables, their joint probability 
distribution is 


vi/2-1 , v/2-1 
A ERTA 


f(u, u5) = c WT EX e ; 0 < uj, Uu < oo 
w/p [ | | 22T (2) 
( 2 2 


Using the method in Equation S5-4, define the new random variable M = U,. The inverse 
solutions of 


are 


u = y mx and u =m 


Therefore, the Jacobian is 


Vi Vi 
V5 V5 Vi 
J = RUE 
0 1 V2 


Thus, the joint probability density function of X and M is 


vi/2-1 
(= mx) m^ 1 e 4/2 [(v1/v2)mx +m] (=) m 
V2 
f(x, m) = 


DA a a 0 « x, m « o 
e 


The probability density function of F is 


vı vı/2—1 v] » 
v” v 


| gni e+] dy 
oit 2/27 (2) È (2) : 


- m [Vy Vi 25 
Substituting z = > (Fs F 1) and dm = 2 (2s T 1) dz, we obtain 
2 v5 


/2 
(y y x (vj v92-1 
L^ 


2) ex] 
joy2 (ne eae) 
Vi V2 =i rz 
ESTE 0 Ny" Fl 
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zn *tvj/2-16-z dz 


PL ne x"/271 [o 
2 
T > DU > v> + 1 0 
r(* + 2C yl 
2 V2 
— ——————M—————————s = x= 05 


r vi r v2 vi - (vi * v3)/2? 
2 2AÀv^ 


which is the probability density function in the theorem on page 10-1. 





10-6.2. Small-Sample Test for Hp: p, = p; (CD Only) 


Many problems involving the comparison of proportions p, and p; have relatively large sam- 
ple sizes, so the procedure based on the normal approximation to the binomial is widely used 
in practice. However, occasionally, a small-sample-size problem is encountered. In such 
cases, the Z-tests are inappropriate and an alternative procedure is required. In this section we 
describe a procedure based on the hypergeometric distribution. 

Suppose that X, and X, are the number of successes in two random samples of size n, and 
n», respectively. The test procedure requires that we view the total number of successes as 
fixed at the value X, + X; = Y. Now consider the hypotheses 


Ho: pi = p» 
Hy pi > p; 


Given that X, + X; = Y, large values of X, support H,, and small moderate values of X, sup- 
port Ho. Therefore, we will reject Hy whenever X, is sufficiently large. 

Since the combined sample of n, + n, observations contains X, + X, = Y total suc- 
cesses, if Hp: p, = p; the successes are no more likely to be concentrated in the first sample 
than in the second. That is, all the ways in which the n} + n, responses can be divided into one 
sample of n; responses and a second sample of n, responses are equally likely. The number of 
ways of selecting X, successes for the first sample leaving Y —_X, successes for the second 1s 


(C + Ny — ó 

A1 n — X 

Because outcomes are equally likely, the probability of exactly X, successes in sample 1 1s the 
ratio of the number of sample 1 outcomes having X, successes to the total number of outcomes, or 


Cy" zz Ny — Jj 
X1 ny ~ Xx 
P(X, = x| Y success in n, + m responses) = — —— — —Á—— — (S10-1) 
(^ T x 
n 
given that Hp: p, = p> is true. We recognize Equation S10-1 as a hypergeometric distribution. 


To use Equation S10-1 for hypothesis testing, we would compute the probability of find- 
ing a value of X, at least as extreme as the observed value of X,. Note that this probability is a 


10-4 


EXAMPLE $10-1 


P-value. If this P-value is sufficiently small, the null hypothesis is rejected. This approach 
could also be applied to lower-tailed and two-tailed alternatives. 


Insulating cloth used in printed circuit boards is manufactured in large rolls. The manufacturer 
is trying to improve the process yield, that 1s, the number of defect-free rolls produced. A sam- 
ple of 10 rolls contains exactly 4 defect-free rolls. From analysis of the defect types, process 
engineers suggest several changes in the process. Following implementation of these changes, 
another sample of 10 rolls yields 8 defect-free rolls. Do the data support the claim that the new 
process is better than the old one, using a = 0.10? 

To answer this question, we compute the P-value. In our example, n; ^ n; = 10, y = 
8 + 4 = 12, and the observed value of x, = 8. The values of x, that are more extreme than 8 
are 9 and 10. Therefore 


8 
P(X, = 8|12 successes) = —~——— = .0750 


9 
P(X, = 9|12 successes) = XO Dr .0095 
m 
Go) 
10/40 
P(X, = 10|12 successes) = = .0003 


The P-value is P = .0750 + .0095 + .0003 = .0848. Thus, at the level a = 0.10, the null 
hypothesis is rejected and we conclude that the engineering changes have improved the 
process yield. 


This test procedure is sometimes called the Fisher-Irwin test. Because the test depends 
on the assumption that X, + X; is fixed at some value, some statisticians argue against use of 
the test when X, + X; is not actually fixed. Clearly X, + X, is not fixed by the sampling pro- 
cedure in our example. However, because there are no other better competing procedures, the 
Fisher-Irwin test is often used whether or not X, + X, is actually fixed in advance. 
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11-6.2 Confidence Interval on the 
Mean Response 


PREDICTION OF NEW 
OBSERVATIONS 
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11-8.1 Residual Analysis 
11-8.2 Coefficient of Determination (R?) 
11-8.3 Lack-of-Fit Test (CD Only) 


TRANSFORMATIONS TO A 
SIRAIGHT LINE 


MORE ABOUT 
TRANSFORMATIONS (CD ONLY) 


CORRELATION 


After careful study of this chapter, you should be able to do the following: 


1. Use simple linear regression for building empirical models to engineering and scientific data 


2. Understand how the method of least squares is used to estimate the parameters in a linear 


regression model 


3. Analyze residuals to determine if the regression model is an adequate fit to the data or to see if 


any underlying assumptions are violated 


4. Test statistical hypotheses and construct confidence intervals on regression model parameters 


5. Use the regression model to make a prediction of a future observation and construct an 
appropriate prediction interval on the future observation 


6. Use simple transformations to achieve a linear regression model 


7. Apply the correlation model 
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CD MATERIAL 


8. Conduct a lack-of-fit test in a regression model where there are replicated observations. 


Answers for many odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e-Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e-Text immediately following the section they accompany. 


11-1 EMPIRICAL MODELS 


Many problems in engineering and science involve exploring the relationships between two 
or more variables. Regression analysis is a statistical technique that 1s very useful for these 
types of problems. For example, in a chemical process, suppose that the yield of the product 
is related to the process-operating temperature. Regression analysis can be used to build a 
model to predict yield at a given temperature level. This model can also be used for process 
optimization, such as finding the level of temperature that maximizes yield, or for process 
control purposes. 

As an illustration, consider the data in Table 11-1. In this table y is the purity of oxygen 
produced in a chemical distillation process, and x is the percentage of hydrocarbons that are 
present in the main condenser of the distillation unit. Figure 11-1 presents a scatter diagram 


Table 11-1 Oxygen and Hydrocarbon Levels 


Observation 
Number 


O © Nunca BW NY 


NO = = e= e ee ee e a a a 
O Omen Aa RA LUNE D 


Hydrocarbon Level Purity 
x (76) y (7o) 
0.99 90.01 
1.02 89.05 
1.15 91.43 
1.29 93.74 
1.46 96.73 : : 
1:36 94 45 "E : "E 3. o ee X uus 
0.87 87.59 e| 
1.23 91.77 98 
1.55 99.42 e * 
1.40 93.65 di 
1.19 93.54 BS 94 "E. . ec 
1.15 92.52 2 , «= 2 
0.98 9056 È 7% e ° t 
EU 89.54 90 C e " ecco 
1.11 89.85 "e . 
1.20 90.39 Se a " 
1.26 93.25 86 
1.32 93.4] 0.85 0.95 1.05 1.15 1.25 1.35 1.45 1.55 
1.43 94.98 Hydrocarbon level (x) 
0.95 87.33 Figure 11-1 Scatter diagram of oxygen purity versus hydrocarbon 


level from Table 11-1. 
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of the data in Table 11-1. This is just a graph on which each (x;, y;) pair is represented as a point 
plotted in a two-dimensional coordinate system. This scatter diagram was produced by 
Minitab, and we selected an option that shows dot diagrams of the x and y variables along the 
top and right margins of the graph, respectively, making it easy to see the distributions of the 
individual variables (box plots or histograms could also be selected). Inspection of this scatter 
diagram indicates that, although no simple curve will pass exactly through all the points, there 
is a strong indication that the points lie scattered randomly around a straight line. Therefore, it 
is probably reasonable to assume that the mean of the random variable Y is related to x by the 
following straight-line relationship: 


E(Y|x) = py\x = Bo + Bix 


where the slope and intercept of the line are called regression coefficients. While the mean 
of Y 1s a linear function of x, the actual observed value y does not fall exactly on a straight 
line. The appropriate way to generalize this to a probabilistic linear model is to assume 
that the expected value of Y is a linear function of x, but that for a fixed value of x the actual 
value of Y is determined by the mean value function (the linear model) plus a random error 
term, say, 


Y = Bo + Bix te (11-1) 


where e is the random error term. We will call this model the simple linear regression model, 
because it has only one independent variable or regressor. Sometimes a model like this will 
arise from a theoretical relationship. At other times, we will have no theoretical knowledge of 
the relationship between x and y, and the choice of the model is based on inspection of a scat- 
ter diagram, such as we did with the oxygen purity data. We then think of the regression model 
as an empirical model. 

To gain more insight into this model, suppose that we can fix the value of x and observe 
the value of the random variable Y. Now if x 1s fixed, the random component e on the right- 
hand side of the model in Equation 11-1 determines the properties of Y. Suppose that the mean 
and variance of e are 0 and o”, respectively. Then 


E(Y|x) = E(Bo + Bix + €) = Bo + Bix + Ele) = Bo + Bix 


Notice that this is the same relationship that we initially wrote down empirically from inspec- 
tion of the scatter diagram in Fig. 11-1. The variance of Y given x is 


V(Y|x) = V(Bo + Bix + €) = V(Bo + Bix) + Vie) = 0 + o° = o? 


Thus, the true regression model wyją = Bo + Bix is a line of mean values; that is, the height 
of the regression line at any value of x is just the expected value of Y for that x. The slope, B4, 
can be interpreted as the change in the mean of Y for a unit change in x. Furthermore, the vari- 
ability of Y at a particular value of x is determined by the error variance o°. This implies that 
there is a distribution of Y-values at each x and that the variance of this distribution is the same 
at each x. 

For example, suppose that the true regression model relating oxygen purity to hydrocarbon 
level is juy|, = 75 + 15x, and suppose that the variance is o? — 2. Figure 11-2 illustrates this 
situation. Notice that we have used a normal distribution to describe the random variation 
in e. Since Y is the sum of a constant Dg + ix (the mean) and a normally distributed ran- 
dom variable, Y is a normally distributed random variable. The variance o? determines the 
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y 
(Oxygen 
purity) 
Bo + By (1.25) Trùe regression line 
Uy|x =Bo+ Bix 
= 75 + 15x 
x = 100 x=1.25 «x (Hydrocarbon level) 


Figure 11-2 The distribution of Y for a given value of x for the 
oxygen purity-hydrocarbon data. 


variability in the observations Y on oxygen purity. Thus, when o? is small, the observed 
values of Y will fall close to the line, and when o? is large, the observed values of Y may de- 
viate considerably from the line. Because o? is constant, the variability in Y at any value of 
x is the same. 

The regression model describes the relationship between oxygen purity Y and hydrocar- 
bon level x. Thus, for any value of hydrocarbon level, oxygen purity has a normal distribution 
with mean 75 + 15x and variance 2. For example, if x = 1.25, Y has mean value py), = 75 + 
15(1.25) = 93.75 and variance 2. 

In most real-world problems, the values of the intercept and slope (Bo, 8,) and the error 
variance o? will not be known, and they must be estimated from sample data. Then this fitted 
regression equation or model is typically used in prediction of future observations of Y, or for 
estimating the mean response at a particular level of x. To illustrate, a chemical engineer might 
be interested in estimating the mean purity of oxygen produced when the hydrocarbon level is 
x = 1.25%. This chapter discusses such procedures and applications for the simple linear re- 
gression model. Chapter 12 will discuss multiple linear regression models that involve more 
than one regressor. 


11-2 SIMPLE LINEAR REGRESSION 


The case of simple linear regression considers a single regressor or predictor x and a de- 
pendent or response variable Y. Suppose that the true relationship between Y and x is a 
straight line and that the observation Y at each level of x is a random variable. As noted previ- 
ously, the expected value of Y for each value of x is 


E(Y|x) = Bo + Bix 


where the intercept Bo and the slope (9; are unknown regression coefficients. We assume that 
each observation, Y, can be described by the model 


Y = Bot Bix + e (11-2) 
where e is a random error with mean zero and (unknown) variance o°. The random errors 


corresponding to different observations are also assumed to be uncorrelated random 
variables. 
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Observed value 
Data (y) 


Estimated 
regression line 


Figure 11-3 Deviations of the data from the 
estimated regression model. 


Suppose that we have n pairs of observations (xi, y1), X2, y5), ... (Xp Yn). Figure 11-3 
shows a typical scatter plot of observed data and a candidate for the estimated regression line. 
The estimates of Bo and B; should result in a line that is (in some sense) a “best fit" to the data. 
The German scientist Karl Gauss (1777—1855) proposed estimating the parameters Bo and B, 
in Equation 11-2 to minimize the sum of the squares of the vertical deviations in Fig. 11-3. 

We call this criterion for estimating the regression coefficients the method of least 
squares. Using Equation 11-2, we may express the n observations in the sample as 


yi = Bo + Bix; + ej i= 1,2,...,n (11-3) 


and the sum of the squares of the deviations of the observations from the true regression line 
1S 


S 
S 


L = l e; — 3 (y; — Bo — RAN (11-4) 


The least squares estimators of By and B4, say, Bo and B. must satisfy 





oL is " " 

— = —2 pem — Bix) = 0 

9Bo là. 8. 2,0 Bo Di ) 

oL 2 ^ z 

ae = 2 ;— = B,x;)x; = 0 11-5 
AB, 8.8, 2,0 Bo By ) ( ) 





Simplifying these two equations yields 


| 
= 


nBo + Bi 2, X; = 
Bo 2, x; t Bi >, x; = > vx (11-6) 


Equations 11-6 are called the least squares normal equations. The solution to the normal 
equations results in the least squares estimators By and p. 


Definition 


EXAMPLE 11-1 
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The least squares estimates of the intercept and slope in the simple linear regression 
model are 


(RS) 


xicrc = (lV np = (ly) > e 


The fitted or estimated regression line is therefore 


^ ^ 


» = Bo + Bix (11:9) 
Note that each pair of observations satisfies the relationship 
y; = Bo + Bix; ^ e; [os bas 
where e; = y; — y; is called the residual. The residual describes the error in the fit of the 
model to the ith observation y;. Later in this chapter we will use the residuals to provide in- 
formation about the adequacy of the fitted model. 


Notationally, it is occasionally convenient to give special symbols to the numerator and 
denominator of Equation 11-8. Given data (xj, y), (x5, y5), ... , Xn Yn), let 


i = i=l 
Sx. = 2, T p Da = (11-10) 


and 


n n 
- =] 
Sey = >y B x) — >a I 
i-l i=l 


Ms 
= 
~|—= 
p X 
iMs= 
= 
pM 


(11-11) 


We will fit a simple linear regression model to the oxygen purity data in Table 11-1. The 
following quantities may be computed: 


20 20 
n=20 *x;—-2392 > y, = 1,843.21 x= 1.1960 y = 92.1605 
i=l i=l 
20 20 20 
X y$—170044.5321 X x? = 29.2892 V x;y; = 2,214.6566 
i=l i=l i=l 
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Figure 11-4 Scatter 
plot of oxygen 

purity y versus 
hydrocarbon level x 
and regression model 
p = 74.20 + 14.97x. 


C = 29.2892 — = 0.68088 
XX = 20 
and 
20 20 
SS $ ey 4 ) — 2,214.6566 E) LM M 10.17744 
xy 2, 20 = Ay . 20 = . 


Therefore, the least squares estimates of the slope and intercept are 


~ — Sxy 1017744 . ONUS 
Mp S. 0.68088 ` 








and 
Bo = y — Bux — 92.1605 — (14.94748)1.196 — 74.28331 


The fitted simple linear regression model (with the coefficients reported to three decimal places) is 


$ = 74283 + 14.947x 


This model is plotted in Fig. 11-4, along with the sample data. 

Computer software programs are widely used in regression modeling. These programs 
typically carry more decimal places in the calculations. Table 11-2 shows a portion of the out- 
put from Minitab for this problem. The estimates By and B, are highlighted. In subsequent sec- 
tions we will provide explanations for the information provided in this computer output. 


Using the regression model of Example 11-1, we would predict oxygen purity of ? = 
89.23% when the hydrocarbon level is x = 1.00%. The purity 89.23% may be interpreted as 


102 


99 


Oxygen purity y (%) 


90 





87 
0.87 1.07 1.27 1.47 1.67 


Hydrocarbon level (%) 
x 
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Table 11-2 Minitab Output for the Oxygen Purity Data in Example 11-1 


Regression Analysis 


The regression equation is 


Purity = 74.3 + 14.9 HC Level 


Predictor Coef SE Coef T P 
Constant 74.283 -«- B, 1.593 46.62 0.000 
HC Level 14.947 -«- B, 1.317 11.35 0.000 
S — 1.087 R-Sq = 87.7% R-Sq (adj) = 87.1% 


Analysis of Variance 


Source DF SS MS F P 
Regression l 152.13 152.13 128.86 0.000 
Residual Error 18 21.25 -«- SS; 1.18 467 

Total 19 173.38 


Predicted Values for New Observations 


New Obs Fit SE Fit 95.0% CI 95.0% PI 
l 59.251 0.354 (88.486, 89.975) (86.830, 91.632) 


Values of Predictors for New Observations 


New Obs HC Level 
1 1.00 


an estimate of the true population mean purity when x = 1.00%, or as an estimate of a new 
observation when x = 1.00%. These estimates are, of course, subject to error; that is, it is un- 
likely that a future observation on purity would be exactly 89.23% when the hydrocarbon 
level is 1.00%. In subsequent sections we will see how to use confidence intervals and pre- 
diction intervals to describe the error in estimation from a regression model. 


Estimating o° 

There is actually another unknown parameter in our regression model, o? (the variance of the 
error term e). The residuals e; = y; — f; are used to obtain an estimate of o°. The sum of 
squares of the residuals, often called the error sum of squares, is 


n 


SSe = Y e; = > (i = Hi) (11-12) 
i=] i=] 


We can show that the expected value of the error sum of squares is E(SS;) = (n — 2)o°. 
Therefore an unbiased estimator of o? is 


(11-13) 





Computing SS; using Equation 11-12 would be fairly tedious. A more convenient computing 
formula can be obtained by substituting y; = Bo + 4x; into Equation 11-12 and simplifying. 


380 CHAPTER 11 SIMPLE LINEAR REGRESSION AND CORRELATION 


The resulting computing formula is 


SSe = SSr — B4S,, 


(11-14) 





where SSp = Yj=1(9; — yy — 


Y ay; — ny” is the total sum of squares of the response 


variable y. The error sum of squares and the estimate of o? for the oxygen purity data, 
6^ — 1.18, are highlighted in the Minitab output in Table 11-2. 


EXERCISES FOR SECTION 11-2 


(a) Calculate the least squares estimates of the slope and in- 


11-1. An article in Concrete Research (“Near Surface 
Characteristics of Concrete: Intrinsic Permeability,’ Vol. 41, 
1989), presented data on compressive strength x and intrinsic 
permeability y of various concrete mixes and cures. Summary 
quantities are n = 14, Xy, = 572, Xy? = 23,530, Xx, = 43, 
Xx? = 157.42, and X x;y; = 1697.80. Assume that the two vari- 
ables are related according to the simple linear regression model. 
(a) Calculate the least squares estimates ofthe slope and intercept. 
(b) Use the equation of the fitted line to predict what perme- 
ability would be observed when the compressive strength 
is x — 4. 
(c) Give a point estimate of the mean permeability when 
compressive strength is x = 3.7. 
(d) Suppose that the observed value of permeability at x = 3.7 is 
y — 46.1. Calculate the value of the corresponding residual. 


11-2. Regression methods were used to analyze the data 
from a study investigating the relationship between roadway 
surface temperature (x) and pavement deflection ( y). Summary 
quantities were n = 20, Sy, = 12.75, Xy; = 8.86, Xx; = 
1478, Xx? = 143,215.8, and © x;y; = 1083.67. 


tercept. Graph the regression line. 

(b) Use the equation of the fitted line to predict what pave- 
ment deflection would be observed when the surface 
temperature is 85°F. 

(c) What is the mean pavement deflection when the surface 
temperature is 90°F? 

(d) What change in mean pavement deflection would be ex- 
pected for a 1°F change in surface temperature? 

11-3. 

11-2. 

(a) Suppose that temperature is measured in °C rather than °F. 
Write the new regression model that results. 

(b) What change in expected pavement deflection is associ- 
ated with a 1°C change in surface temperature? 

11-4. Montgomery, Peck, and Vining (2001) present data 

concerning the performance of the 28 National Football 


Consider the regression model developed in Exercise 


League teams in 1976. It is suspected that the number of games m 


won ( y) is related to the number of yards gained rushing by an 
opponent (x). The data are shown in the following table. 


Yards 


Yards 
Games Rushing by Games Rushing by 
Teams Won (y) Opponent (x) Teams Won (y) Opponent (x) 

Washington 10 2205 Detroit 6 1901 
Minnesota 11 2096 Green Bay 5 2288 
New England 11 1847 Houston 3 2072 
Oakland 13 1903 Kansas City 5 2861 
Pittsburgh 10 1457 Miami 6 2411 
Baltimore 11 1848 New Orleans 4 2289 
Los Angeles 10 1564 New York Giants 3 2203 
Dallas IB 1821 New York Jets 3 2592 
Atlanta 4 2517 Philadelphia 4 2053 
Buffalo 2476 St. Louis 10 1979 
Chicago 7 1984 San Diego 6 2048 
Cincinnati 10 1917 San Francisco 8 1786 
Cleveland 9 1761 Seattle 2 2876 
Denver 9 1709 Tampa Bay 0 2560 


SZ 


es 


= 


(a) Calculate the least squares estimates of the slope and inter- 
cept. What is the estimate of o^? Graph the regression model. 

(b) Find an estimate of the mean number of games won if the 
opponents can be limited to 1800 yards rushing. 

(c) What change in the expected number of games won is asso- 
ciated with a decrease of 100 yards rushing by an opponent? 

(d) To increase by 1 the mean number of games won, how much 
decrease in rushing yards must be generated by the defense? 

(e) Given that x = 1917 yards (Cincinnati), find the fitted 
value of y and the corresponding residual. 


11-5. An article in Technometrics by S. C. Narula and J. F. 

Wellington (“Prediction, Linear Regression, and a Minimum 

Sum of Relative Errors,’ Vol. 19, 1977) presents data on the 

selling price and annual taxes for 24 houses. The data are 

shown in the following table. 

(a) Assuming that a simple linear regression model is appro- 
priate, obtain the least squares fit relating selling price to 
taxes paid. What is the estimate of o°? 


Taxes 
Sale (Local, School), 
Price/1000 County)/1000 


Taxes 
Sale (Local, School), 
Price/1000 County)/1000 


25.9 4.9176 30.0 5.0500 
29.5 5.0208 36.9 8.2464 
27.9 4.5429 41.9 6.6969 
25.9 4.5573 40.5 7.7841 
29.9 5.0597 43.9 9.0384 
29.9 3.8910 37.5 5.9894 
30.9 5.8980 37.9 7.5422 
28.9 5.6039 44.5 8.7951 
35.9 5.8282 37.9 6.0831 
31.5 5.3003 38.9 8.3607 
31.0 6.2712 36.9 8.1400 
30.9 5.9592 45.8 9.1416 
MPG Displacement 
Make Model (highway) (in?) 
Acura Legend 30 9T 
BMW 7351 19 209 
Buick Regal 29 173 
Chevrolet Cavalier 32 121 
Chevrolet Celebrity 30 151 
Chrysler Conquest 24 156 
Dodge Aries 30 135 
Dodge Dynasty 28 181 
Ford Escort 31 114 
Ford 302 
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(b) Find the mean selling price given that the taxes paid are 
x = 7.50. 

(c) Calculate the fitted value of y corresponding to x = 
5.8980. Find the corresponding residual. 

(d) Calculate the fitted y; for each value of x; used to fit the 
model. Then construct a graph of y; versus the correspon- 
ding observed value y; and comment on what this plot 
would look like if the relationship between y and x was a 
deterministic (no random error) straight line. Does the 
plot actually obtained indicate that taxes paid is an effec- 
tive regressor variable in predicting selling price? 


11-6. The number of pounds of steam used per month by a 
chemical plant is thought to be related to the average ambient 
temperature (in^ F) for that month. The past year's usage and 
temperature are shown in the following table: 


Month Temp. Usage/1000 Month Temp. Usage/1000 


Jan. 21 185.79 July 68 621.55 
Feb. 24 214.47 Aug. 74 675.06 
Mar. 32 288.03 Sept. 62 562.03 
Apr. 47 424.84 Oct. 50 452.93 
May 50 454.58 Nov. 41 369.95 
June 59 539.03 Dec. 30 213.98 


(a) Assuming that a simple linear regression model is appro- 
priate, fit the regression model relating steam usage (y) to 
the average temperature (x). What is the estimate of o°? 

(b) What is the estimate of expected steam usage when the 
average temperature is 55?F? 

(c) What change in mean steam usage 1s expected when the 
monthly average temperature changes by 1°F? 

(d) Suppose the monthly average temperature is 47^F. Calculate 
the fitted value of y and the corresponding residual. 

11-7. The data shown in the following table are highway 

gasoline mileage performance and engine displacement for a 

sample of 20 cars. 


Engine 


Engine 
MPG Displacement 
Make Model (highway) (in?) 
Ford Taurus 2] 153 
Ford Tempo 33 90 
Honda Accord 30 119 
Mazda RX-7 23 80 
Mercedes 260E 24 159 
Mercury Tracer 29 97 
Nissan Maxima 26 181 
Oldsmobile Cutlass 29 173 
Plymouth Laser 37 122 
Pontiac Grand Prix 29 173 


Mustang 23 
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(a) Fit a simple linear model relating highway miles per gal- 
lon ( y) to engine displacement (x) using least squares. 

(b) Find an estimate of the mean highway gasoline mileage 
performance for a car with 150 cubic inches engine dis- 
placement. 

(c) Obtain the fitted value of y and the corresponding residual 
for a car, the Ford Escort, with engine displacement of 114 
cubic inches. 


11-8. An article in the Tappi Journal (March, 1986) pre- 
sented data on green liquor Na,S concentration (in grams per 
liter) and paper machine production (in tons per day). The data 
(read from a graph) are shown as follows: 


y 40 42 49 46 44 48 


830 890 895 890 910 


y | 46 43 29 22 54 57 58 


960 | 990 | 1010 | 1012 | 1030 | 1050 

(a) Fit a simple linear regression model with y = green liquor 
Na,S concentration and x = production. Find an estimate 
of o?. Draw a scatter diagram of the data and the resulting 
least squares fitted model. 

(b) Find the fitted value of y corresponding to x = 910 and 
the associated residual. 

(c) Find the mean green liquor Na,S concentration when the 


production rate 1s 950 tons per day. 


11-9. An article in the Journal of Sound and Vibration 
(Vol. 151, 1991, pp. 383—394) described a study investigating 
the relationship between noise exposure and hypertension. 
The following data are representative of those reported in the 
article. 


y| 1] OJ 1121]5]|1]|41]| 6 9; 3 


x1601631 651 70! 701 70: 801 90 | 80 | 80 


y1514161|181415|7|9]|7] 6 


x | 851 89| 90| 90! 901 90 | 94 | 100 | 100 | 100 

(a) Draw a scatter diagram of y (blood pressure rise in 
millimeters of mercury) versus x (sound pressure level in 
decibels). Does a simple linear regression model seem 
reasonable in this situation? 

(b) Fit the simple linear regression model using least squares. 


Find an estimate of o. 


(c) Find the predicted mean rise in blood pressure level asso- 
ciated with a sound pressure level of 85 decibals. 


11-10. An article in Wear (Vol. 152, 1992, pp. 171—181) 


m presents data on the fretting wear of mild steel and oil viscos- 


CY 


ity. Representative data follow, with x = oil viscosity and y = 
wear volume (10 ^ cubic millimeters). 
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X 35.5 | 43.0 | 40.5 33.0 


(a) Construct a scatter plot of the data. Does a simple linear 
regression model appear to be plausible? 

(b) Fit the simple linear regression model using least squares. 
Find an estimate of o?. 

(c) Predict fretting wear when viscosity x — 30. 

(d) Obtain the fitted value of y when x — 22.0 and calculate 
the corresponding residual. 


11-11. An article in the Journal of Environmental 
Engineering (Vol. 115, No. 3, 1989, pp. 608—619) reported the 
results of a study on the occurrence of sodium and chloride in 
surface streams in central Rhode Island. The following data 
are chloride concentration y (in milligrams per liter) and road- 
way area in the watershed x (in percentage). 


y 4.4 6.6 9.7 10.9 


10.6 


10.8 








X 0.19 0.15 0.57 0.70 0.67 0.63 
y 11.8 12.1 14.3 14.7 15.0 L5 
x 0.47 0.70 0.60 0.78 0.81 0.78 
y 19.2 23.1 27.4 21.1 31.8 39.5 





1.05 1.06 1.62 


1.74 


(a) Draw a scatter diagram of the data. Does a simple linear 
regression model seem appropriate here? 

(b) Fit the simple linear regression model using the method of 
least squares. Find an estimate of o°. 

(c) Estimate the mean chloride concentration for a watershed 
that has 1% roadway area. 

(d) Find the fitted value corresponding to x — 0.47 and the 
associated residual. 


11-12. A rocket motor is manufactured by bonding together 
two types of propellants, an igniter and a sustainer. The shear 
strength of the bond y is thought to be a linear function of the 
age of the propellant x when the motor is cast. Twenty obser- 
vations are shown in the table on the next page. 

(a) Draw a scatter diagram of the data. Does the straight-line 
regression model seem to be plausible? 

(b) Find the least squares estimates of the slope and inter- 
cept in the simple linear regression model. Find an 
estimate of o?. 

(c) Estimate the mean shear strength of a motor made from 
propellant that is 20 weeks old. 

(d) Obtain the fitted values y; that correspond to each ob- 
served value y, Plot $; versus y,, and comment on what 
this plot would look like if the linear relationship between 


11-3 PROPERTIES OF THE LEAST SQUARES ESTIMATORS 


Observation Strength y Age x 
Number (psi) (weeks) 

l 2158.70 15.50 

2 1678.15 29415 

3 2316.00 8.00 

4 2061.30 17.00 

5 2207.50 5.00 

6 1708.30 19.00 

7 1784.70 24.00 

8 2575.00 2.50 

9 2357,90 7.50 

10 2277.40 11.00 


shear strength and age were perfectly deterministic (no er- 
ror). Does this plot indicate that age 1s a reasonable choice 
of regressor variable in this model? 


11-13. Show that in a simple linear regression model 
the point (x, y) lies exactly on the least squares regression 
line. 


11-14. Consider the simple linear regression model Y = By + 

Dix + e. Suppose that the analyst wants to use z = x — x as the 

regressor variable. 

(a) Using the data in Exercise 11-12, construct one scatter 
plot of the (x; y;) points and then another of the 
(z; = x; — x, yj) points. Use the two plots to intuitively 
explain how the two models, Y = By + Bix + € and 
Y = B + Biz + e, are related. 

(b) Find the least squares estimates of 85 and B¥ in the model 
Y = p$ + Biz + e. How do they relate to the least 
squares estimates By and By? 
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Observation Strength y Age x 

Number (psi) (weeks) 

11 2165.20 13.00 

12 2399.55 5.75 

13 1779.80 25.00 

14 2336.75 9.75 

15 1765.30 22.00 

16 2053.50 18.00 

17 2414.40 6.00 

18 2200.50 12.50 

19 2654.20 2.00 

20 1753.70 21.50 
11-15. Suppose we wish to fit the model y? = Bý + 


Bi(x, — x) + e; where yt = y; — y (i = 1,2,..., n). Find 
the least squares estimates of B% and Bł. How do they relate 
to By and Bi? 

11-16. Suppose we wish to fit a regression model for which 
the true regression line passes through the point (0, 0). The ap- 
propriate model is Y = Bx + e. Assume that we have n pairs 
of data (x1, y), Q6, Y2), ... , Xn Yn). Find the least squares esti- 
mate of p. 

11-17. Using the results of Exercise 11-16, fit the model 
Y = Bx + e to the chloride concentration-roadway area 
data in Exercise 11-11. Plot the fitted model on a scatter 
diagram of the data and comment on the appropriateness of 
the model. 


11-3 PROPERTIES OF THE LEAST SQUARES ESTIMATORS 


The statistical properties of the least squares estimators By and B, may be easily described. 
Recall that we have assumed that the error term e in the model Y = B, + Bix + eisarandom 
variable with mean zero and variance o^. Since the values of x are fixed, Y is a random vari- 
able with mean my, = Bo + Bix and variance o°. Therefore, the values of By and B, depend 
on the observed y's; thus, the least squares estimators of the regression coefficients may be 
viewed as random variables. We will investigate the bias and variance properties of the least 
squares estimators Bo and B. 

Consider first B,. Because B, is a linear combination of the observations Y, we can use 
properties of expectation to show that expected value of B, is 


E(B1) = By (11-15) 


Thus, B, is an unbiased estimator of the true slope B}. 
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Definition 


Now consider the variance of B,. Since we have assumed that V(e;) = o°, it follows that 
V(Y,) = o?, and it can be shown that 


^ o 
WBi) = 5 (11-16) 
For the intercept, we can show that 
A : xlix 
E(Bo) = Bo and V(Bo) = o0^|; + rm (11-17) 


Thus, Bo i is an unbiased estimator of the intercept By. The covariance of the random variables 
Bo and B, is not zero. r can be shown (see Exercise 11-69) that cov(Bo, B) = = —0°X/S 

The estimate of o? could be used in Equations 11-16 and 11-17 to provide estimates of the 
variance of the slope and the intercept. We call the square roots of the resulting variance esti- 
mators the estimated standard errors of the slope and intercept, respectively. 


In simple linear regression the estimated standard error of the slope and the 
estimated standard error of the intercept are 


respectively, where G* is computed from Equation 11-13. 





The Minitab computer output in Table 11-2 reports the estimated standard errors of the slope 
and intercept under the column heading “SE coeff.” 


11-4 SOME COMMENTS ON USES OF REGRESSION (CD ONLY) 


11-5 HYPOTHESIS TESTS IN SIMPLE LINEAR REGRESSION 


An important part of assessing the adequacy of a linear regression model is testing statistical hy- 
potheses about the model parameters and constructing certain confidence intervals. Hypothesis 
testing in simple linear regression is discussed in this section, and Section 11-6 presents meth- 
ods for constructing confidence intervals. To test hypotheses about the slope and intercept of the 
regression model, we must make the additional assumption that the error component in the 
model, €, is normally distributed. Thus, the complete assumptions are that the errors are nor- 
mally and independently distributed with mean zero and variance o^, abbreviated NID(0, o^). 


11-5.1 Use of t-Tests 


Suppose we wish to test the hypothesis that the slope equals a constant, say, , 9. The appro- 
priate hypotheses are 


Ho: B1 = Bio 
Hy By # Bio (11-18) 
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where we have assumed a two-sided alternative. Since the errors e; are NID(0, o^), it follows 
directly that the observations Y; are NID(B, + Bix» o^). Now B, is a linear combination of 
independent normal random variables, and consequently, B, is N(B,, o0?/5,,), using the bias 
and variance properties of the slope discussed in Section 11-3. In addition, (n — 2)67/o^ has 
a chi-square distribution with n — 2 degrees of freedom, and B, is independent of 62. As a 
result of those properties, the statistic 


^ 


D; Bio 
V 6^/S,, 





Ty = (11-19) 


follows the ¢ distribution with n — 2 degrees of freedom under Ho: B, = B, 9. We would reject 
Ay: By = Bio if 


[to] > lo2,n —2 (11-20) 


where tọ is computed from Equation 11-19. The denominator of Equation 11-19 is the standard 
error of the slope, so we could write the test statistic as 


^ 


|. Bi 7 Bio 
^07 se(B) 


A similar procedure can be used to test hypotheses about the intercept. To test 


Ho: Bo = Boo 
Hi: Bo 7 Boo (11-21) 


we would use the statistic 


^ ^ 


Do — Boo Bo — Boo 
Ta = ~ = ———À——— 11-22 
i a) l X se(Bo) 
Oo A + S. 


and reject the null hypothesis if the computed value of this test statistic, £j, is such that 
Io] > tuy2,n—2- Note that the denominator of the test statistic in Equation 11-22 is just the stan- 
dard error of the intercept. 

A very important special case of the hypotheses of Equation 11-18 is 


Ho: By = 
Hi: Bı + 0 (11-23) 


These hypotheses relate to the significance of regression. Failure to reject Hy: Q, = 0 is 
equivalent to concluding that there is no linear relationship between x and Y. This situation is 
illustrated in Fig. 11-5. Note that this may imply either that x 1s of little value in explaining the 
variation in Y and that the best estimator of Y for any x is » = Y (Fig. 11-5a) or that the true 
relationship between x and Y is not linear (Fig. 11-55). Alternatively, if Ho: D, = 0 is rejected, 
this implies that x is of value in explaining the variability in Y (see Fig. 11-6). Rejecting Ho: 
8, = 0 could mean either that the straight-line model is adequate (Fig. 11-6a) or that, 
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Figure 11-5 The 
hypothesis Ho: B; = 0 
is not rejected. 


EXAMPLE 11-2 


Figure 11-6 The 


hypothesis Hp: D, = 0 


is rejected. 


(a) (b) 


although there is a linear effect of x, better results could be obtained with the addition of 
higher order polynomial terms in x (Fig. 11-65). 


We will test for significance of regression using the model for the oxygen purity data from 
Example 11-1. The hypotheses are 

Ho: Bı = 0 

H 1: Bı # 0 


and we will use a = 0.01. From Example 11-1 and Table 11-2 we have 
ĝi = 14.97 n=20, S,, = 0.68088, 6? = 1.18 


so the t-statistic in Equation 10-20 becomes 


^ 


Bi 14.947 


p a — = — 
: &S.. se(B)  V1.18/0.68088 





Since the reference value of f is 99951 = 2.88, the value of the test statistic is very far 
into the critical region, implying that Hp: Bı = 0 should be rejected. The P-value for this test 
is P = 1.23 X 10 ?. This was obtained manually with a calculator. 

Table 11-2 presents the Minitab output for this problem. Notice that the t-statistic value 
for the slope is computed as 11.35 and that the reported P-value is P — 0.000. Minitab also 
reports the t-statistic for testing the hypothesis Hp: By = 0. This statistic is computed from 
Equation 11-22, with Bo; = 0, as fj = 46.62. Clearly, then, the hypothesis that the intercept is 
Zero is rejected. 





(a) (b) 
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11-5.2 Analysis of Variance Approach to Test Significance of Regression 


A method called the analysis of variance can be used to test for significance of regression. 
The procedure partitions the total variability in the response variable into meaningful compo- 
nents as the basis for the test. The analysis of variance identity is as follows: 


(11-24) 





The two components on the right-hand-side of Equation 11-24 measure, respectively, the 
amount of variability in y; accounted for by the regression line and the residual variation left 
unexplained by the regression line. We usually call SS; = 3.4, (y; — fiy the error sum of 
squares and $$, = YX;4(j; — YY the regression sum of squares. Symbolically, Equation 
11-24 may be written as 


SS SS SS (11525) 


where SS; = >;_, (y; — yy. is the total corrected sum of squares of y. In Section 11-2 we 
noted that SS; = SS. — BS, (see Equation 11-14), so since SS, = BS. + SS,, we note that the 
regression sum of squares in Equation 10-26 is SS, = B,S,,. The total sum of squares SS; has 
n — 1 degrees of freedom, and SS, and SS; have 1 and n — 2 degrees of freedom, respectively. 

We may show that E[SSz/(n — 2)] = o°, E(SSp) = o^ + B1S,, and that SS;/o* and 
SSp/o* are independent chi-square random variables with n — 2 and 1 degrees of freedom, re- 
spectively. Thus, if the null hypothesis Hp: B, = 0 is true, the statistic 


SSy/1 — MS; res 


F, = —-——_"— = 
0  SSg/(n— 2)  MSg 





follows the F; „-2 distribution, and we would reject Ho if fo > foi n—2- The quantities MS, = 
SSp/1 and MS, = SS,/(n — 2) are called mean squares. In general, a mean square is always 
computed by dividing a sum of squares by its number of degrees of freedom. The test proce- 
dure is usually arranged in an analysis of variance table, such as Table 11-3. 


Table 11-3 Analysis of Variance for Testing Significance of Regression 


Source of Sum of Degrees of Mean 

Variation Squares Freedom Square Fo 
Regression SS, = BiS 1 MS, MS,/ MS; 
Error SSp = SSp — BS, n—2 MS; 

Total SS n= 


Note that MS; = 67. 
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EXAMPLE 11-3 
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We will use the analysis of variance approach to test for significance of regression using the 
oxygen purity data model from Example 11-1. Recall that SS; = 173.38, B; = 14.947, 
Sy = 10.17744, and n = 20. The regression sum of squares is 


SSp = B1S,, = (14.947)10.17744 = 152.13 
and the error sum of squares 1s 


SS; = SSp — SSp = 173.38 — 152.13 = 21.25 


The analysis of variance for testing Hy: B, = 0 1s summarized in the Minitab output in 
Table 11-2. The test statistic is fj = MSp/MS, = 152.13/1.18 = 128.86, for which we find 
that the P-value is P = 1.23 X 10 ?, so we conclude that B, is not zero. 

There are frequently minor differences in terminology among computer packages. For 
example, sometimes the regression sum of squares is called the *model" sum of squares, and 
the error sum of squares is called the “residual” sum of squares. 


Note that the analysis of variance procedure for testing for significance of regression is 
equivalent to the t-test in Section 11-5.1. That is, either procedure will lead to the same conclusions. 
This is easy to demonstrate by starting with the /-test statistic in Equation 11-19 with B; = 0, say 


^ 








Pi 
To) =a 11-27 
^^ Ves. uM 
Squaring both sides of Equation 11-27 and using the fact that 6^ = MS; results in 
As - 
os S, MS 
T2 — BiSex _ BiSsy _ MSR (11-28) 


MS; MS, MSk 


Note that 7? in Equation 11-28 is identical to Fy in Equation 11-26 It is true, in general, that 
the square of a t random variable with v degrees of freedom is an F random variable, with one 
and v degrees of freedom in the numerator and denominator, respectively. Thus, the test using 
To is equivalent to the test based on Fy. Note, however, that the t-test is somewhat more flexi- 
ble in that it would allow testing against a one-sided alternative hypothesis, while the F-test is 
restricted to a two-sided alternative. 


EXERCISES FOR SECTION 11-5 


11-18. Consider the data from Exercise 11-1 on x = com- 

pressive strength and y — intrinsic permeability of concrete. 

(a) Test for significance of regression using a = 0.05. Find 
the P-value for this test. Can you conclude that the model 
specifies a useful linear relationship between these two 
variables? 

(b) Estimate o? and the standard deviation of f. 

(c) What is the standard error of the intercept in this model? 


11-19. Consider the data from Exercise 11-2 on x = road- 
way surface temperature and y — pavement deflection. 


(a) Test for significance of regression using a = 0.05. Find 
the P-value for this test. What conclusions can you draw? 
(b) Estimate the standard errors of the slope and intercept. 


11-20. Consider the National Football League data in 

Exercise 11-4. 

(a) Test for significance of regression using a = 0.01. Find 
the P-value for this test. What conclusions can you draw? 

(b) Estimate the standard errors of the slope and intercept. 

(c) Test (using a = 0.01) Hy: B, = —0.01 versus H;: B, # 
—0.01. Would you agree with the statement that this is a test 
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of the claim that if you can decrease the opponent’s rushing 
yardage by 100 yards the team will win one more game? 


11-21. Consider the data from Exercise 11-5 on y = sales 

price and x = taxes paid. 

(a) Test Hy: B, = O using the t-test; use a = 0.05. 

(b) Test Hy: Bı = O using the analysis of variance with a = 
0.05. Discuss the relationship of this test to the test from 
part (a). 

(c) Estimate the standard errors of the slope and intercept. 

(d) Test the hypothesis that By = 0. 


11-22. Consider the data from Exercise 11-6 on y = steam 

usage and x = average temperature. 

(a) Test for significance of regression using a = 0.01. What 
is the P-value for this test? State the conclusions that re- 
sult from this test. 

(b) Estimate the standard errors of the slope and intercept. 

(c) Test the hypothesis Hy: B, = 10 versus H,: B, ~ 10 using 
a = 0.01. Find the P-value for this test. 

(d) Test Hy: Bg = 0 versus H,: Bo Æ 0 using a = 0.01. Find 
the P-value for this test and draw conclusions. 


11-23. Exercise 11-7 gave 20 observations on y — highway 

gasoline mileage and x — engine displacement. 

(a) Test for significance of regression using a = 0.01. Find 
the P-value for this test. What conclusions can you reach? 

(b) Estimate the standard errors of the slope and intercept. 

(c) Test Hy: B, = —0.05 versus H,: B, < —0.05 using a = 
0.01 and draw conclusions. What is the P-value for this test? 

(d) Test the hypothesis Hp: Bo = 0 versus H,: By ~ 0 using 
a = 0.01. What is the P-value for this test? 


11-24. Exercise 11-8 gave 13 observations on y — green 

liquor Na,S concentration and x = production in a paper mill. 

(a) Test for significance of regression using a = 0.05. Find 
the P-value for this test. 

(b) Estimate the standard errors of the slope and intercept. 

(c) Test Hy: Bo = 0 versus H,: By # 0 using a = 0.05. What 
is the P-value for this test? 


11-25. Exercise 11-9 presented data on y — blood pressure 

rise and x — sound pressure level. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Estimate the standard errors of the slope and intercept. 

(c) Test Hp: By = 0 versus H,: Bo # 0 using a = 0.05. Find 
the P-value for this test. 
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11-26. Exercise 11-11 presented data on y — chloride con- 

centration in surface streams and x — roadway area. 

(a) Test the hypothesis Hy: D, = 0 versus H,: B, ~ 0 using 
the analysis of variance procedure with a = 0.01. 

(b) Find the P-value for the test in part (a). 

(c) Estimate the standard errors of B, and Bo. 

(d) Test Hp: Bg = 0 versus H,: Bo Æ 0 using a = 0.01. What 
conclusions can you draw? Does it seem that the model 
might be a better fit to the data if the intercept were re- 
moved? 


11-27. Refer to Exercise 11-12, which gives 20 observations 
on y — shear strength of a propellant and x — propellant age. 


(a) Test for significance of regression with a = 0.01. Find the 
P-value for this test. 

(b) Estimate the standard errors of Bo and B. 

(c) Test Hy: Bı = —30 versus H;: B, ~ —30 using a = 0.01. 
What is the P-value for this test? 

(d) Test Hp: Bo = 0 versus H,: Bo # 0 using a = 0.01. What 
is the P-value for this test? 

(e) Test Hy: By = 2500 versus H,: By > 2500 using a = 
0.01. What is the P-value for this test? 


11-28. Suppose that each value of x; is multiplied by a pos- 
itive constant a, and each value of y; is multiplied by another 
positive constant b. Show that the t-statistic for testing Hj: 
D, = 0 versus H: B, # 0 is unchanged in value. 


11-29. Consider the no-intercept model Y= Bx + € 
with the es NID(0, o°). The estimate of o^ is s* = 
Dao Êx}? /(n — 1) and (B) -o?/X x. 

(a) Devise a test statistic for Họ: B = 0 versus Hj: B # 0. 
(b) Apply the test in (a) to the model from Exercise 11-17. 


11-30. The type II error probability for the t-test for Hp: 
D, = Bio can be computed in a similar manner to the t-tests 
of Chapter 9. If the true value of B, is Bj, the value 
d —|gio — Bil/(o V(n — 1)/S, is calculated and used as 
the horizontal scale factor on the operating characteristic 
curves for the t-test, (Appendix Charts VIe through VIA) and 
the type II error probability is read from the vertical scale us- 
ing the curve for n — 2 degrees of freedom. Apply this proce- 
dure to the football data of Exercise 11-4, using o = 2.4 and 
D; = —0.005, where the hypotheses are Hp: B, = —0.01 ver- 
sus H,: B, # —0.01. 





11-6.1 Confidence Intervals on the Slope and Intercept 


In addition to point estimates of the slope and intercept, it is possible to obtain confidence in- 


terval estimates of these parameters. The width of these confidence intervals is a measure of 


= 
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the overall quality of the regression line. If the error terms, e;, in the regression model are nor- 
mally and independently distributed, 


z 
(Bi — 8)/V/82/S,, and (Bo — Bo)/ E 4 = 


n Sy X 


are both distributed as t random variables with n — 2 degrees of freedom. This leads to the fol- 
lowing definition of 100(1 — a)% confidence intervals on the slope and intercept. 


Definition 
Under the assumption that the observations are normally and independently distributed, 
a 100(1 — a)% confidence interval on the slope B, in simple linear regression is 
ô? a a2 


^ ô 
Bia. Ss. m B EE Bir nu d S (0101529) 


Similarly, a 100(1 — a)% confidence interval on the intercept p, is 


d 
s B, =f) +t exti (11-30) 
a/2,n 2 n Se 





EXAMPLE 11-4 We will find a 95% confidence interval on the slope of the regression line using the data in 


11-6.2 


Example 11-1. Recall that 8, = 14.947, S., = 0.68088, and 6? = 1.18 (see Table 11-2). 
Then, from Equation 10-31 we find 


` ô? : ô? 
Di — fo.05.18 S. = B; = Bi t f0.025,184/ <7 
XX XX 


L18 n 
iere diu QU URS c PES SAP Sd CES 


Or 





IA 





This simplifies to 


[2:197 6, 2 17-697 


Confidence Interval on the Mean Response 


A confidence interval may be constructed on the mean response at a specified value of x, say, 
xo. This is a confidence interval about E(Y |x ) = I. |, and is often called a confidence interval 
about the regression line. Since E(Y |x;) = Wyl, = Bo + Bixo, we may obtain a point estimate 
of the mean of Y at x = xo(py),,) from the fitted model as 


Ariz = Bo + Bixo 


Definition 


EXAMPLE 11-5 
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Now fiy)x, is an unbiased point estimator of py),., since Bo and B, are unbiased estimators of 
Bo and B,. The variance of [Ly|,. is 


Vlr) = o| n um 


This last result follows from the fact that cov (Y, 81) = 0 (Refer to Exercise 11-71). Also, [LY x, 
is normally distributed, because B, and By are normally distributed, and if we 6^ use as an 
estimate of o^, it is easy to show that 


Py x n P Y x, 





1 2 
^2 
? E M | 


has a ¢ distribution with n — 2 degrees of freedom. This leads to the following confidence in- 
terval definition. 


A 100(1 — a)% confidence interval about the mean response at the value of 
X = Xo, SAY uy |x,, 18 given by 


A el 
I yi, — fa/2,n-24] 9 | B+ 





(1015315 


"$ | 


^ A 
S Myj = BY|x, T by /2,n-24} 0 | 


where Arin, = Bo + Bixo is computed from the fitted regression model. 


Note that the width of the confidence interval for jy ,, is a function of the value specified for 
xo. The interval width is a minimum for xy = x and widens as |xy — x| increases. 


We will construct a 95% confidence interval about the mean response for the data 1 
11-1. The fitted model is fxy|,, = 74.283 + 14.947xo, and the 95% confidence interval on 
[Ly|x, 18 found from Equation 11-31 as 





yu, + 2.1014 [1 i| D „œ el 
PYps = £ Mb 0.68088 


Suppose that we are interested in predicting mean oxygen purity when xy — 1.0096. Then 
[Ly |x, 99 = 74.283 + 14.947(1.00) = 89.23 


and the 95% confidence interval is 


8923 + 2.101. /1.18 + ee D ciu g 
es 71 90 0.68088 
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Figure 11-7  Scatter 
diagram of oxygen 
purity data from 
Example 11-1 with 
fitted regression line 
and 95 percent 
confidence limits on 


P yx, 
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Therefore, the 95% confidence interval on Wwyj1.00 is 
88.48 < Wy) 1.00 = 89.98 


Minitab will also perform these calculations. Refer to Table 11-2. The predicted value of y at 
x — 1.00 is shown along with the 9596 CI on the mean of y at this level of x. 

By repeating these calculations for several different values for xy we can obtain confi- 
dence limits for each corresponding value of wy),,. Figure 11-7 displays the scatter diagram 
with the fitted model and the corresponding 9596 confidence limits plotted as the upper and 
lower lines. The 95% confidence level applies only to the interval obtained at one value of x 
and not to the entire set of x-levels. Notice that the width of the confidence interval on jy, 
increases as |xọ — x| increases. 


11-7 PREDICTION OF NEW OBSERVATIONS 


An important application of a regression model is predicting new or future observations Y 
corresponding to a specified level of the regressor variable x. If x is the value of the regressor 
variable of interest, 


Yo = Bo + Bux (11-32) 


is the point estimator of the new or future value of the response Yọ. 

Now consider obtaining an interval estimate for this future observation Yọ. This new 
observation is independent of the observations used to develop the regression model. 
Therefore, the confidence interval for wy),, in Equation 11-31 is inappropriate, since it is based 
only on the data used to fit the regression model. The confidence interval about py,,, refers to 
the true mean response at x = x, (that is, a population parameter), not to future observations. 


Definition 


EXAMPLE 11-6 
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Let Y, be the future observation at x = Xp, and let Y, given by Equation 11-32 be the es- 
timator of Yọ. Note that the error in prediction 


C= fg di 


is a normally distributed random variable with mean zero and variance 


EA 


x 1 
Ve) = Yo - 3) e rege 5 


because Y, is independent of Y. If we use 6? to estimate o?, we can show that 
Yo — Yo 


l 
ZEE 


has a ¢ distribution with n — 2 degrees of freedom. From this we can develop the following 
prediction interval definition. 


A 100(1 — a) % prediction interval on a future observation Y, at the value x, is 
given by 


l 
5 AU 
JA T sup OR a dec: 


1 Ma 
sheit ipa hi L| s 


The value yp is computed from the regression model jp = Bo aE Bixo: 





Notice that the prediction interval is of minimum width at xy = x and widens as |xọ — x | 
increases. By comparing Equation 11-33 with Equation 11-31, we observe that the prediction 
interval at the point x, 1s always wider than the confidence interval at xy. This results because 
the prediction interval depends on both the error from the fitted model and the error associated 
with future observations. 


To illustrate the construction of a prediction interval, suppose we use the data in Example 11-1 
and find a 95% prediction interval on the next observation of oxygen purity at xy = 1.00%. 
Using Equation 11-33 and recalling from Example 11-5 that yy = 89.23, we find that the 
prediction interval 1s 


N (1.00 — | 
20 0.68088 








b, (1.00 — 1.1960) 
20 0.68088 





< Y) = 89.23 + 2.101 i181 4 


SJ 
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which simplifies to 


86.83 < y) < 91.63 


Minitab will also calculate prediction intervals. Refer to the output in Table 11-2. The 95% PI 
on the future observation at xy = 1.00 is shown in the display. 

By repeating the foregoing calculations at different levels of xọ, we may obtain the 95% 
prediction intervals shown graphically as the lower and upper lines about the fitted regression 
model in Fig. 11-8. Notice that this graph also shows the 95% confidence limits on Myx, 
calculated in Example 11-5. It illustrates that the prediction limits are always wider than the 


confidence limits. 


EXERCISES FOR SECTIONS 11-6 AND 11-7 


11-31. Refer to the data in Exercise 11-1 on y = intrinsic 

permeability of concrete and x — compressive strength. Find 

a 95% confidence interval on each of the following: 

(a) Slope (b) Intercept 

(c) Mean permeability when x — 2.5 

(d) Find a 9596 prediction interval on permeability when 
x = 2.5. Explain why this interval is wider than the 
interval in part (c). 

11-32. Exercise 11-2 presented data on roadway surface 

temperature x and pavement deflection y. Find a 9996 confi- 

dence interval on each of the following: 

(a) Slope (b) Intercept 

(c) Mean deflection when temperature x = 85°F 

(d) Find a 99% prediction interval on pavement deflection 
when the temperature is 90°F. 


11-33. Exercise 11-4 presented data on the number of 


games won by NFL teams in 1976. Find a 95% confidence in- 


terval on each of the following: 


(a) Slope (b) Intercept 

(c) Mean number of games won when opponents rushing 
yardage is limited to x = 1800 

(d) Find a 95% prediction interval on the number of games 
won when opponents rushing yards is 1800. 


11-34. Refer to the data on y — house selling price and 

x — taxes paid in Exercise 11-5. Find a 9596 confidence inter- 

val on each of the following: 

(a) Bı (b) Bo 

(c) Mean selling price when the taxes paid are x = 7.50 

(d) Compute the 95% prediction interval for selling price 
when the taxes paid are x — 7.50. 


11-35. Exercise 11-6 presented data on y — steam usage 

and x = monthly average temperature. 

(a) Find a 99% confidence interval for B4. 

(b) Find a 99% confidence interval for Bo. 

(c) Find a 95% confidence interval on mean steam usage 
when the average temperature is 55°F. 


(d) Find a 95% prediction interval on steam usage when tem- 
perature is 55°F. Explain why this interval is wider than 
the interval in part (c). 


11-36. Exercise 11-7 presented gasoline mileage perform- 

ance for 20 cars, along with information about the engine 

displacement. Find a 9596 confidence interval on each of the 

following: 

(a) Slope (b) Intercept 

(c) Mean highway gasoline mileage when the engine dis- 
placement is x = 150 in? 

(d) Construct a 9596 prediction interval on highway gasoline 
mileage when the engine displacement is x = 150 in’. 


11-37. Consider the data in Exercise 11-8 on y — green 

liquor Na,S concentration and x = production in a paper mill. 

Find a 99% confidence interval on each of the following: 

(a) Bı (b) Bo 

(c) Mean Na,S concentration when production x = 910 
tons/day 

(d) Find a 99% prediction interval on Na,S concentration 
when x — 910 tons/day. 

11-38. Exercise 11-9 presented data on y — blood pressure 

rise and x = sound pressure level. Find a 95% confidence 

interval on each of the following: 


(a) Bi (b) Bo 
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(c) Mean blood pressure rise when the sound pressure level is 
85 decibals 

(d) Find a 95% prediction interval on blood pressure rise 
when the sound pressure level is 85 decibals. 


11-39. Refer to the data in Exercise 11-10 on y = wear 

volume of mild steel and x — oil viscosity. Find a 9596 confi- 

dence interval on each of the following: 

(a) Intercept (b) Slope 

(c) Mean wear when oil viscosity x — 30 

11-40. Exercise 11-11 presented data on chloride concentra- 

tion y and roadway area x on watersheds in central Rhode Island. 

Find a 99% confidence interval on each of the following: 

(a) Bi (b) Bo 

(c) Mean chloride concentration when roadway area x = 1.0% 

(d) Find a 99% prediction interval on chloride concentration 
when roadway area x = 1.0%. 


11-41. Refer to the data in Exercise 11-12 on rocket motor 

shear strength y and propellant age x. Find a 95% confidence 

interval on each of the following: 

(a) Slope B; (b) Intercept Bo 

(c) Mean shear strength when age x = 20 weeks 

(d) Find a 95% prediction interval on shear strength when age 
x — 20 weeks. 


11-8 ADEQUACY OF THE REGRESSION MODEL 


Fitting a regression model requires several assumptions. Estimation of the model parameters 
requires the assumption that the errors are uncorrelated random variables with mean zero and 
constant variance. Tests of hypotheses and interval estimation require that the errors be nor- 
mally distributed. In addition, we assume that the order of the model is correct; that is, if we 
fit a simple linear regression model, we are assuming that the phenomenon actually behaves in 
a linear or first-order manner. 

The analyst should always consider the validity of these assumptions to be doubtful and 
conduct analyses to examine the adequacy of the model that has been tentatively entertained. 
In this section we discuss methods useful in this respect. 


11-8.1 Residual Analysis 


The residuals from a regression model are e; = y; — 5, i = 1,2,...,n , where y; is an actual 
observation and y; is the corresponding fitted value from the regression model. Analysis of the 
residuals is frequently helpful in checking the assumption that the errors are approximately 
normally distributed with constant variance, and in determining whether additional terms in 
the model would be useful. 

As an approximate check of normality, the experimenter can construct a frequency his- 
togram of the residuals or a normal probability plot of residuals. Many computer programs 
will produce a normal probability plot of residuals, and since the sample sizes in regression 
are often too small for a histogram to be meaningful, the normal probability plotting method 
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Figure 11-9 Patterns 
for residual plots. 

(a) satisfactory, 

(b) funnel, (c) double 
bow, (d) nonlinear. 
[Adapted from 
Montgomery, Peck, 
and Vining (2001).] 


EXAMPLE 11-7 
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(c) (d) 


is preferred. It requires judgment to assess the abnormality of such plots. (Refer to the discus- 
sion of the “fat pencil" method in Section 6-7). 

We may also standardize the residuals by computing d; = e;/ V6, i=1,2...,n. If 
the errors are normally distributed, approximately 95% of the standardized residuals should 
fall in the interval (—2, +2). Residuals that are far outside this interval may indicate the 
presence of an outlier, that is, an observation that is not typical of the rest of the data. Various 
rules have been proposed for discarding outliers. However, outliers sometimes provide 1m- 
portant information about unusual circumstances of interest to experimenters and should not 
be automatically discarded. For further discussion of outliers, see Montgomery, Peck and 
Vining (2001). 

It is frequently helpful to plot the residuals (1) in time sequence (if known), (2), against the 
y,, and (3) against the independent variable x. These graphs will usually look like one of the four 
general patterns shown in Fig. 11-9. Pattern (a) in Fig. 11-9 represents the ideal situation, while 
patterns (b), (c), and (d ) represent anomalies. If the residuals appear as in (b), the variance of the 
observations may be increasing with time or with the magnitude of y; or x;. Data transformation 
on the response y is often used to eliminate this problem. Widely used variance-stabilizing trans- 
formations include the use of Vy, In y, or 1/y as the response. See Montgomery, Peck, and 
Vining (2001) for more details regarding methods for selecting an appropriate transformation. If 
a plot of the residuals against time has the appearance of (5), the variance of the observations is 
increasing with time. Plots of residuals against y; and x; that look like (c) also indicate inequal- 
ity of variance. Residual plots that look like (d) indicate model inadequacy; that is, higher order 
terms should be added to the model, a transformation on the x-variable or the y-variable (or both) 
should be considered, or other regressors should be considered. 


The regression model for the oxygen purity data in Example 11-1 is y = 74.283 + 14.947x. 
Table 11-4 presents the observed and predicted values of y at each value of x from this data set, 
along with the corresponding residual. These values were computed using Minitab and show 
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Table 11-4 Oxygen Purity Data from Example 11-1, Predicted Values, and Residuals 


Hydrocarbon 
evel 
0.99 
1.02 
1.15 
1.29 
1.46 
1.36 
0.87 
1.23 
1.55 
1.40 


Oo co-10]9 NA BW NY 


— 
e 


Oxygen 
Purity, y 
90.01 
89.05 
91.43 
93.74 
96.73 
94.45 
87.59 
E 
99.42 
93.65 


Predicted 
Value, y 
89.069009 
89.518136 
91.464353 
93.560279 
96.105332 
94.608242 
87.272501 
92.662025 
97.452713 
95.207078 


Residual 
dE nd 
0.940991 
— 0.468136 
— 0.034353 
0.179721 
0.624668 
— 0.158242 
0.317499 
— 0.892025 
1.967287 
— 1.557078 


11 
12 
13 
14 
15 
16 
17 
18 
19 
20 


Hydrocarbon Oxygen 
Level, x Purity, y 
1.19 93.54 
1.15 92.52 
0.98 90.56 
1.01 89.54 
1.11 89.85 
1.20 90.39 
1.26 93.25 
1:32 93.41 
1.43 94.98 
0.95 87.33 


Predicted 
Value, v 
92.063189 
91.614062 
88.919300 
89.368427 
90.865517 
92.212898 
93.111152 
94.009406 
95.656205 
88.470173 


Residual 
o 
1.476811 
0.905938 
1.640700 
0.171573 
=1.015517 
— 1.822898 
0.138848 
—0.599406 
—0.676205 
— 1.140173 
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the number of decimal places typical of computer output. A normal probability plot of the 
residuals is shown in Fig. 11-10. Since the residuals fall approximately along a straight line in 
the figure, we conclude that there is no severe departure from normality. The residuals are also 
plotted against the predicted value y; in Fig. 11-11 and against the hydrocarbon levels x; in 
Fig. 11-12. These plots do not indicate any serious model inadequacies. 


11-8.2 Coefficient of Determination (R°) 
The quantity 


Sh 
© SS, 


io 11-34 
=l ss (11-34) 


Rp 
is called the coefficient of determination and is often used to judge the adequacy of a 
regression model. Subsequently, we will see that in the case where X and Y are jointly distrib- 
uted random variables, R? is the square of the correlation coefficient between X and Y. From 
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Figure 11-10 
Example 11-7. 


Normal probability plot of residuals, Figure 11-11 Plot of residuals versus predicted oxygen 


purity y, Example 11-7. 
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Figure 11-12 Plot of 
residuals versus hydro- 
carbon level x, 
Example 11-8. 
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the analysis of variance identity in Equations 11-24 and 11-25, 0 <= R? x 1. We often refer 
loosely to R as the amount of variability in the data explained or accounted for by the regres- 
sion model. For the oxygen purity regression model, we have R? = $S,/S$, = 152.13/ 
173.38 — 0.877; that 1s, the model accounts for 87.796 of the variability in the data. 

The statistic R” should be used with caution, because it is always possible to make R? 
unity by simply adding enough terms to the model. For example, we can obtain a “perfect” fit 
to n data points with a polynomial of degree n — 1. In addition, R° will always increase if we 
add a variable to the model, but this does not necessarily imply that the new model is superior 
to the old one. Unless the error sum of squares in the new model is reduced by an amount 
equal to the original error mean square, the new model will have a larger error mean square 
than the old one, because of the loss of one error degree of freedom. Thus, the new model will 
actually be worse than the old one. 

There are several misconceptions about R^. In general, R? does not measure the magni- 
tude of the slope of the regression line. A large value of R? does not imply a steep slope. 
Furthermore, R? does not measure the appropriateness of the model, since it can be artificially 
inflated by adding higher order polynomial terms in x to the model. Even if y and x are related 
in a nonlinear fashion, R? will often be large. For example, R? for the regression equation in 
Fig. 11-6(5) will be relatively large, even though the linear approximation is poor. Finally, 
even though R? is large, this does not necessarily imply that the regression model will provide 
accurate predictions of future observations. 


11-8.3 Lack-of-Fit Test (CD Only) 


EXERCISES FOR SECTION 11-8 


11-42. 
Exercise 11-4. 


Refer to the NFL team performance data in 


(a) Find the residuals for the least squares model. 
(b) Prepare a normal probability plot of the residuals and in- 


(a) Calculate R? for this model and provide a practical inter- 
pretation of this quantity. 

(b) Prepare a normal probability plot of the residuals from the 
least squares model. Does the normality assumption seem 
to be satisfied? 

(c) Plot the residuals versus y and against x. Interpret these 
graphs. 

11-45. Refer to the data in Exercise 11-5 on house selling 

price y and taxes paid x. 


terpret this display. 

(c) Plot the residuals versus y and versus x. Does the assump- 
tion of constant variance seem to be satisfied? 

(d) What proportion of total variability is explained by the 
regression model? 


11-44. Exercise 11-6 presents data on y — steam usage and 

x — average monthly temperature. 

(a) What proportion of total variability is accounted for by the 
simple linear regression model? 


r eem, * 


SZ 


| EE 


<7 


. 


SJ 


SZ 


(b) Prepare a normal probability plot of the residuals and 
interpret this graph. 

(c) Plot residuals versus y and x. Do the regression assump- 
tions appear to be satisfied? 


11-45. Refer to the gasoline mileage data in Exercise 11-7. 

(a) What proportion of total variability in highway gaso- 
line mileage performance is accounted for by engine 
displacement? 

(b) Plot the residuals versus y and x, and comment on the graphs. 

(c) Prepare a normal probability plot of the residuals. Does 
the normality assumption appear to be satisfied? 


11-46. Consider the data in Exercise 11-8 on y = green 
liquor Na,S concentration and x = paper machine production. 
Suppose that a 14th sample point is added to the original data, 
where y,4 = 59 and x,, = 855. 

(a) Prepare a scatter diagram of y versus x. Fit the simple lin- 
ear regression model to all 14 observations. 

(b) Test for significance of regression with a = 0.05. 

(c) Estimate o? for this model. 

(d) Compare the estimate of o? obtained in part (c) above with 
the estimate of o? obtained from the original 13 points. 
Which estimate is larger and why? 

(e) Compute the residuals for this model. Does the value of 
€14, appear unusual? 

(f) Prepare and interpret a normal probability plot of the 
residuals. 

(g) Plot the residuals versus y and versus x. Comment on 
these graphs. 


11-47. Refer to Exercise 11-9, which presented data on 

blood pressure rise y and sound pressure level x. 

(a) What proportion of total variability in blood pressure rise 
is accounted for by sound pressure level? 

(b) Prepare a normal probability plot of the residuals from 
this least squares model. Interpret this plot. 

(c) Plot residuals versus ? and versus x. Comment on these plots. 


11-48. Exercise 11-10 presents data on wear volume y and 

oil viscosity x. 

(a) Calculate R? for this model. Provide an interpretation of 
this quantity. 

(b) Plot the residuals from this model versus y and versus x. 
Interpret these plots. 

(c) Prepare a normal probability plot of the residuals. Does 
the normality assumption appear to be satisfied? 


11-49. Refer to Exercise 11-11, which presented data on 

chloride concentration y and roadway area x. 

(a) What proportion of the total variability in chloride con- 
centration is accounted for by the regression model? 

(b) Plot the residuals versus and versus x. Interpret these plots. 

(c) Prepare a normal probability plot of the residuals. Does 
the normality assumption appear to be satisfied? 


11-50. Consider the rocket propellant data in Exercise 11-12. 
(a) Calculate R? for this model. Provide an interpretation of 
this quantity. 


11-8 ADEQUACY OF THE REGRESSION MODEL 399 


(b) Plot the residuals on a normal probability scale. Do any 
points seem unusual on this plot? 

(c) Delete the two points identified in part (b) from the 
sample and fit the simple linear regression model to the 
remaining 18 points. Calculate the value of R° for the new 
model. Is it larger or smaller than the value of R? com- 
puted in part (a)? Why? 

(d) Did the value of 6? change dramatically when the two 
points identified above were deleted and the model fit to 
the remaining points? Why? 


11-51. Show that an equivalent way to define the test for 
significance of regression in simple linear regression is to base 
the test on R? as follows: to test Hy: B, = 0 versus H;: B, Æ 0, 
calculate 


" R*(n — 2) 
AE. 


and to reject Hy: B, = 0 if the computed value fy > f, ». 


11-52. Suppose that a simple linear regression model has 

been fit to n = 25 observations and R° = 0.90. 

(a) Test for significance of regression at a = 0.05. Use the 
results of Exercise 11-51. 

(b) What is the smallest value of R* that would lead to the 
conclusion of a significant regression if a = 0.05? 


11-53. Consider the rocket propellant data in Exercise 11- 
12. Calculate the standardized residuals for these data. Does 
this provide any helpful information about the magnitude of 
the residuals? 


11-54. Studentized Residuals. Show that the variance 
of the ith residual 1s 


Lu 
c E B 
Hint: 
^ 1 (xu-xy 
— Wo — 
cov, Y) =o | + S. | 





The ith studentized residual is defined as 





eG) 


(a) Explain why r; has unit standard deviation. 

(b) Do the standardized residuals have unit standard deviation? 

(c) Discuss the behavior of the studentized residual when the 
sample value x; is very close to the middle of the range of x. 

(d) Discuss the behavior of the studentized residual when the 
sample value x; is very near one end of the range of x. 
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11-9 TRANSFORMATIONS TO A STRAIGHT LINE 


We occasionally find that the straight-line regression model Y = By + Bix + € is inappropri- 
ate because the true regression function is nonlinear. Sometimes nonlinearity is visually de- 
termined from the scatter diagram, and sometimes, because of prior experience or underlying 
theory, we know in advance that the model is nonlinear. Occasionally, a scatter diagram will 
exhibit an apparent nonlinear relationship between Y and x. In some of these situations, a non- 
linear function can be expressed as a straight line by using a suitable transformation. Such 
nonlinear models are called intrinsically linear. 

As an example of a nonlinear model that is intrinsically linear, consider the exponential 
function 


Y = Bye e 


This function is intrinsically linear, since it can be transformed to a straight line by a logarith- 
mic transformation 


In Y = In Bo + Bix + Ine 


This transformation requires that the transformed error terms In € are normally and independ- 
ently distributed with mean 0 and variance o. 
Another intrinsically linear function 1s 


l 
Y= By +8,(+)+¢ 
By using the reciprocal transformation z = 1/x, the model is linearized to 
Y = Do + Bız + € 


Sometimes several transformations can be employed jointly to linearize a function. For ex- 
ample, consider the function 


l 
exp(Bo + B,x + €) 


letting Y* = 1/Y, we have the linearized form 
In Y* = Bo + Bix + € 


For examples of fitting these models, refer to Montgomery, Peck, and Vining (2001) or 
Myers (1990). 


11-10 MORE ABOUT TRANSFORMATIONS (CD ONLY) 
11-11 CORRELATION 
Our development of regression analysis has assumed that x is a mathematical variable, meas- 


ured with negligible error, and that Y is a random variable. Many applications of regression 
analysis involve situations in which both X and Y are random variables. In these situations, it 
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is usually assumed that the observations (X; Y), i = 1, 2,..., are jointly distributed random 
variables obtained from the distribution f(x, y). 

For example, suppose we wish to develop a regression model relating the shear strength 
of spot welds to the weld diameter. In this example, weld diameter cannot be controlled. We 
would randomly select n spot welds and observe a diameter (X;) and a shear strength (Y;) for 
each. Therefore (X;, Y;) are jointly distributed random variables. 

We assume that the joint distribution of X; and Y; is the bivariate normal distribution pre- 
sented in Chapter 5, and uy and g% are the mean and variance of Y, wy and o are the mean 
and variance of X, and p is the correlation coefficient between Y and X. Recall that the corre- 
lation coefficient is defined as 





CIF 
P = y (11-35) 
where o yy is the covariance between Y and X. 
The conditional distribution of Y for a given value of X = x is 
l 1/y— Bo- Bey 
= ———— =A s 11-36 
where 
= ui 
Bo = My — HXP Gy (11-37) 
Bi = GP (11-38) 
and the variance of the conditional distribution of Y given X = x is 
oy, = oy(1 — p^) (11-39) 
That is, the conditional distribution of Y given X = x is normal with mean 
E(Y|x) = Bo + Bix (11-40) 


and variance OF |x: Thus, the mean of the conditional distribution of Y given X = x is a 
simple linear regression model. Furthermore, there is a relationship between the correlation 
coefficient p and the slope B,. From Equation 11-38 we see that if p = 0, then B, = 0, which 
implies that there is no regression of Y on X. That is, knowledge of X does not assist us in 
predicting Y. 

The method of maximum likelihood may be used to estimate the parameters Bo and B4. It 
can be shown that the maximum likelihood estimators of those parameters are 


o= Y- ÂX (11-41) 


T> 


and 


i = E = > (11-42) 
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We note that the estimators of the intercept and slope in Equations 11-41 and 11-42 are 
identical to those given by the method of least squares in the case where X was assumed to be 
a mathematical variable. That 1s, the regression model with Y and X jointly normally distrib- 
uted is equivalent to the model with X considered as a mathematical variable. This follows 
because the random variables Y given X — x are independently and normally distributed with 
mean B, + Bx and constant variance OF |x. These results will also hold for any joint distribu- 
tion of Y and X such that the conditional distribution of Y given X is normal. 

It is possible to draw inferences about the correlation coefficient p in this model. The 
estimator of p is the sample correlation coefficient 


< S 
R "TEC X — sn (11-43) 
x-X?$0o-Y 
2 ) 2, ) 
Note that 
. SS MP 
& = (=) R (11-44) 


so the slope B, is just the sample correlation coefficient R multiplied by a scale factor that is 
the square root of the “spread” of the Y values divided by the "spread" of the X values.Thus, 
B, and R are closely related, although they provide somewhat different information. The 
sample correlation coefficient R measures the linear association between Y and X, while B, 
measures the predicted change in the mean of Y for a unit change in X. In the case of a math- 
ematical variable x, R has no meaning because the magnitude of R depends on the choice of 
spacing of x. We may also write, from Equation 11-44, 


; ao Sxx  BiSxy SSR 
kel ec eu 
Sve SS. 5p 


which is just the coefficient of determination. That is, the coefficient of determination R? is 
just the square of the correlation coefficient between Y and X. 
It is often useful to test the hypotheses 


Hp: p = 0 
Hyp #0 (11-45) 


The appropriate test statistic for these hypotheses is 





which has the ¢ distribution with n — 2 degrees of freedom if Hp: p = 0 is true. Therefore, we 
would reject the null hypothesis if [f| > t./2,-2. This test is equivalent to the test of the 


EXAMPLE 11-8 
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hypothesis H,: B, = 0 given in Section 11-6.1. This equivalence follows directly from 
Equation 10-51. 
The test procedure for the hypothesis is 


Ho: p = po 
Hi: p + Po (11-47) 


where pọ # 0 is somewhat more complicated. For moderately large samples (say, n = 25) the 
statistic 


T] PAR 
L&R 





l 
Z = arctanh R = 2 In (11-48) 


is approximately normally distributed with mean and variance 








uz = arctanh p = 5,11 a3 
— p = 


respectively. Therefore, to test the hypothesis Hp: p = po, we may use the test statistic 


Zo = (arctanh R — arctanh pg)(n — 3)!” (11-49) 


and reject Hy: p = po if the value of the test statistic in Equation 11-49 is such that |zọ| > Z,/2. 
It is also possible to construct an approximate 100(1 — «)% confidence interval for p, using 
the transformation in Equation 10-55. The approximate 100(1 — a)% confidence interval is 


Ze Z 
tanh (arcanh = e SS ey) (11-50) 
n des 





where tanh u = (e" — e ")/(e" + e "). 


In Chapter 1 (Section 1-3) an application of regression analysis 1s described in which an engineer 
at a semiconductor assembly plant is investigating the relationship between pull strength of a wire 
bond and two factors: wire length and die height. In this example, we will consider only one of 
the factors, the wire length. A random sample of 25 units is selected and tested, and the wire bond 
pull strength and wire length are observed for each unit. The data are shown in Table 1-2. We as- 
sume that pull strength and wire length are jointly normally distributed. 

Figure 11-13 shows a scatter diagram of wire bond strength versus wire length. We have 
used the Minitab option of displaying box plots of each individual variable on the scatter 
diagram. There is evidence of a linear relationship between the two variables. 

The Minitab output for fitting a simple linear regression model to the data is shown on the 
following page. 
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Strength 
e 


O 5 10 15 20 
Wire length 


Figure 11-13 Scatter plot of wire bond strength versus wire length, 
Example 11-8. 


Regression Analysis: Strength versus Length 


The regression equation is 
Strength = 5.11 + 2.90 Length 


Predictor Coef SE Coef T P 
Constant 5.115 1.146 4.46 0.000 
Length 2.9027 0.1170 24.80 0.000 


S = 3.093 R-Sq = 96.4% R-Sq(adj) = 96.2% 
PRESS = 272.144 R-Sq(pred) = 95.54% 


Analysis of Variance 


Source DF SS MS F P 
Regression 1 5885.9 615.08 0.000 
Residual Error 23 2O 9.6 

Total 24 6105.9 





Now S,,, = 698.56 and S, = 2027.7132, and the sample correlation coefficient is 


Sty 2027.7132 ogis 
—— ee 
[S..S$7]? — [(698.560)(6105.9)]' 


Note that r* = (0.9818)? = 0.9640 (which is reported in the Minitab output), or that approx- 
imately 96.40% of the variability in pull strength is explained by the linear relationship to wire 
length. 

Now suppose that we wish to test the hypothesis 


Ao: p = 0 
H;:p # 0 
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with a = 0.05. We can compute the t-statistic of Equation 11-46 as 


" rVn — 2 E 
Vw Apu VI = 0.9640 


0.9818 V23 
= 24.8 


This statistic is also reported in the Minitab output as a test of Hy: B, = 0. Because f 995.93 = 
2.069, we reject Ho and conclude that the correlation coefficient p # 0. 

Finally, we may construct an approximate 95% confidence interval on p from Equation 
10-57. Since arctanh r — arctanh 0.9818 — 2.3452, Equation 11-50 becomes 


1.96 1.96 
tanh| 2.3452 — —— |= p = tanh | 2.3452 + —— 


which reduces to 


EXERCISES FOR SECTION 11-10 


11-55. The final test and exam averages for 20 randomly 

selected students taking a course in engineering statistics and a 

course in operations research follow. Assume that the final av- 

erages are jointly normally distributed. 

(a) Find the regression line relating the statistics final average 
to the OR final average. 

(b) Test for significance of regression using a = 0.05. 


Statistics 86 13 69 75 90 
OR 80 81 75 81 92 
Statistics 94 83 86 71 | 65 
OR 95 80 81 76 12 
Statistics 84 ql 62 90 | 83 
OR 85 12 65 93 81 
Statistics 75 71 76 84 97 
OR 70 73 12 80 98 


(c) Estimate the correlation coefficient. 
(d) Test the hypothesis that p = 0, using a = 0.05. 
(e) Test the hypothesis that p = 0.5, using a = 0.05. 


(f) Construct a 95% confidence interval for the correlation 
coefficient. 


11-56. The weight and systolic blood pressure of 26 ran- 

domly selected males in the age group 25 to 30 are shown in 

the following table. Assume that weight and blood pressure 

are jointly normally distributed. 

(a) Find a regression line relating systolic blood pressure to 
weight. 

(b) Test for significance of regression using a = 0.05. 


3/22 


V22 


0.9585 = p = 0.9921 


Systolic Systolic 

Subject Weight BP Subject Weight BP 
1 165 130 14 172 153 
2 167 133 15 159 128 
3 180 150 16 168 132 
4 155 128 17 174 149 
5 212 151 18 183 158 
6 173 146 19 215 150 
7 190 150 20 195 163 
8 210 140 21 180 156 
9 200 148 22 143 124 
10 149 125 23 240 170 
11 158 133 24 235 165 
12 169 135 25 192 160 
13 170 150 26 187 159 


(c) Estimate the correlation coefficient. 

(d) Test the hypothesis that p = 0, using a = 0.05. 

(e) Test the hypothesis that p = 0.6, using a = 0.05. 

(f) Construct a 95% confidence interval for the correlation 
coefficient. 


11-57. Consider the NFL data introduced in Exercise 11-4. 

(a) Estimate the correlation coefficient between the number of 
games won and the yards rushing by the opponents. 

(b) Test the hypothesis Hp: p = 0 versus H,: p # 0 using 
a = 0.05. What is the P-value for this test? 

(c) Construct a 95% confidence interval for p. 

(d) Test the hypothesis Hj: p = —0.7 versus Hj: p # —0.7 
using a = 0.05. Find the P-value for this test. 
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11-58. Show that the t-statistic in Equation 11-46 for testing 

Hy: p = O is identical to the t-statistic for testing Hp: Bı = 0. 

11-59. A random sample of 50 observations was made on 

the diameter of spot welds and the corresponding weld shear 

strength. 

(a) Given that r — 0.62, test the hypothesis that p — 0, using 
a = 0.01. What is the P-value for this test? 

(b) Find a 99% confidence interval for p. 

(c) Based on the confidence interval in part (b), can you con- 
clude that p — 0.5 at the 0.01 level of significance? 


11-60. Suppose that a random sample of 10,000 (X, Y) 

pairs yielded a sample correlation coefficient of r — 0.02. 

(a) What is the conclusion that you would reach if you tested 
Ay: p = Ousing a = 0.05? What is the P-value for this test? 

(b) Comment on the practical significance versus the statisti- 
cal significance of your answer. 


11-61. The following data gave X = the water content of 
snow on April 1 and Y = the yield from April to July 
(in inches) on the Snake River watershed in Wyoming for 
1919 to 1935. (The data were taken from an article in 
Research Notes, Vol. 61, 1950, Pacific Northwest Forest 
Range Experiment Station, Oregon) 


x p x 2; 


23.1 10.5 37,9 22.8 
32.8 16.7 30.5 14.1 
31.8 18.2 25l 12.9 
32.0 17.0 12.4 8.8 
30.4 16.3 35.1 17.4 
24.0 10.5 31.5 14.9 
395 23d 21.1 10.5 
24.2 12.4 27.6 16.1 
32,5 24.9 


(a) Estimate the correlation between Y and X. 

(b) Test the hypothesis that p = 0, using a = 0.05. 

(c) Fit a simple linear regression model and test for signifi- 
cance of regression using a = 0.05. What conclusions 
can you draw? How is the test for significance of regres- 
sion related to the test on p in part (b)? 

(d) Test the hypothesis Hy: Bo = 0 versus H,: By # 0 and 
draw conclusions. Use a = 0.05. 

(e) Analyze the residuals and comment on model adequacy. 


11-62. A random sample of n = 25 observations was made 

on the time to failure of an electronic component and the tem- 

perature in the application environment in which the compo- 

nent was used. 

(a) Given that r — 0.83, test the hypothesis that p — 0, using 
a = 0.05. What is the P-value for this test? 

(b) Find a 95% confidence interval on p. 
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(c) Test the hypothesis Ho: p = 0.8 versus H,: p € 0.8, using 
a = 0.05. Find the P-value for this test. 


Supplemental Exercises 


11-63. Show that, for the simple linear regression model, 
the following statements are true: 


n 


> (y; — $)x; = 0 


i=1 


(a) 2, (y; — $) = 0 (b) 


© x hay 

11-64. An article in the ZEEE Transactions on 
Instrumentation and Measurement (“Direct, Fast, and 
Accurate Measurement of Vy and K of MOS Transistor Using =. 
V_-Sift Circuit,’ Vol. 40, 1991, pp. 951—955) described the 
use of a simple linear regression model to express drain cur- 
rent y (in milliamperes) as a function of ground-to-source 
voltage x (in volts). The data are as follows: 


y x 1A x 


0.734 1.1 1.50 1.6 
0.886 1.2 1.66 1.7 
1.04 1.3 1.81 1.8 
1.19 1.4 1.97 1.9 
1.35 1.5 2:12 2.0 


(a) Draw a scatter diagram of these data. Does a straight-line 
relationship seem plausible? 

(b) Fit a simple linear regression model to these data. 

(c) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(d) Find a 95% confidence interval estimate on the slope. 

(e) Test the hypothesis Ho: By = 0 versus H,: By ~ 0 using 
a = 0.05. What conclusions can you draw? 


11-65. The strength of paper used in the manufacture of 
cardboard boxes ( y) is related to the percentage of hardwood 
concentration in the original pulp (x). Under controlled condi- gu 
tions, a pilot plant manufactures 16 samples, each from a dif- © 
ferent batch of pulp, and measures the tensile strength. The 

data are shown in the table that follows: 


(a) Fit a simple linear regression model to the data. 

(b) Test for significance of regression using a = 0.05. 

(c) Construct a 90% confidence interval on the slope pj. 

(d) Construct a 90% confidence interval on the intercept Bo. 

(e) Construct a 95% confidence interval on the mean strength 
atx — 2.5. 

(f) Analyze the residuals and comment on model adequacy. 


y 101.4 117.4 117.1 106.2 
x 1.0 1.5 1.5 1.5 
y 131.9 146.9 146.8 133.9 
x 2.0 2.0 2.2 2.4 
y 111.0 123.0 125.1 145.2 
X 2.5 2:9 2.8 2.8 
y 134.3 144.5 143.7 146.9 
xX 3.0 3.0 3.2 3.3 


11-66. The vapor pressure of water at various temperatures 
follows: 


Observation Vapor pressure 
Number, i Temperature (K) (mm Hg) 
l 273 4.6 
2 283 9.2 
3 295 17.5 
4 303 31.8 
5 313 55.3 
6 323 92:9 
7 333 149.4 
8 343 233.7 
9 353 355.1 
10 363 525.8 
11 373 760.0 


(a) Draw a scatter diagram of these data. What type of rela- 
tionship seems appropriate in relating y to x? 

(b) Fit a simple linear regression model to these data. 

(c) Test for significance of regression using a = 0.05. What 
conclusions can you draw? 

(d) Plot the residuals from the simple linear regression model 
versus y;. What do you conclude about model adequacy? 


(e) The Clausis-Clapeyron equation states that In(P,)«— Ł, 
where P, is the vapor pressure of water. Repeat parts 
(a)-(d). using an appropriate transformation. 


11-67. An electric utility is interested in developing a model 
relating peak hour demand ( y in kilowatts) to total monthly en- 
ergy usage during the month (x, in kilowatt hours). Data for 50 
residential customers are shown in the following table. 

(a) Draw a scatter diagram of y versus x. 

(b) Fit the simple linear regression model. 

(c) Test for significance of regression using a = 0.05. 

(d) Plot the residuals versus y; and comment on the underly- 
ing regression assumptions. Specifically, does it seem that 
the equality of variance assumption is satisfied? 

(e) Find a simple linear regression model using Vy as the 
response. Does this transformation on y stabilize the in- 
equality of variance problem noted in part (d) above? 
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Customer x y Customer x y 

1 679 0.79 26 1434 0.31 
2 292 0.44 27 837 4.20 
3 1012 0.56 28 1748 4.88 
4 493 0.79 29 1381 3.48 
5 582 2.70 30 1428 17.58 
6 1156 3.64 31 1255 2.63 
7 997 4.73 32 1771 4.99 
8 2189 9.50 33 370 0.59 
9 1097 5.34 34 2316 8.19 
10 2078 6.85 35 1130 4.79 
11 1818 5.84 36 463 0.51 
[2 1700 3.2] 37 770 1.74 
13 747 3.25 38 724 4.10 
14 2030 4.43 39 808 3.94 
15 1643 3.16 40 790 0.96 
16 414 0.50 41 783 3.29 
17 354 0.17 42 406 0.44 
18 1276 1.88 43 1242 3.24 
19 745 0.77 44 658 2.14 
20 795 3.70 45 1746 ari 
21 540 0.56 46 895 4.12 
27 874 1.56 47 1114 1.90 
23 1543 5.28 48 413 0.51 
24 1029 0.64 49 1787 8.33 
25 710 4.00 50 3560 14.94 
11-68. Consider the following data. Suppose that the rela- 


tionship between Y and x is hypothesized to be Y = (Bo + 
Bix + e) !. Fit an appropriate model to the data. Does the as- 
sumed model form seem reasonable? 


xX 10 15 18 12 
y 0.1 0.13 0.09 0.15 
x 9 8 11 6 
y 0.20 0.21 0.18 0.24 
11-69. Consider the weight and blood pressure data in 


Exercise 11-56. Fit a no-intercept model to the data, and com- 
pare it to the model obtained in Exercise 11-56. Which model 
is superior? 

11-70. The following data, adapted from Montgomery, 
Peck, and Vining (2001), present the number of certified men- 
tal defectives per 10,000 of estimated population in the United 
Kingdom ( y) and the number of radio receiver licenses issued 
(x) by the BBC (in millions) for the years 1924 through 1937. 
Fit a regression model relating y and x. Comment on the 
model. Specifically, does the existence of a strong correlation 
imply a cause-and-effect relationship? 


@ 


@ 
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Year y x Year y X 
1924 8 1.350 1931 16 4.620 
1925 8 1.960 1932 18 5.497 
1926 9 2.270 1933 19 6.260 
1927 10 2.483 1934 20 7.012 
1928 11 2.730 1935 21 7.618 
1929 11 3.091 1936 22 8.131 
1930 12 3.674 1937 23 8.593 
11-71. An article in Air and Waste ("Update on Ozone 


Trends in California's South Coast Air Basin," Vol. 43, 1993) 
studied the ozone levels on the South Coast air basin of 
California for the years 1976—1991. The author believes that the 
number of days that the ozone level exceeds 0.20 parts per mil- 
lion depends on the seasonal meteorological index (the seasonal 
average 850 millibar temperature). The data follow: 


Year Days Index Year Days Index 
1976 9] 16.7 1984 81 18.0 
1977 105 17.1 1985 65 17.2 
1978 106 18.2 1986 61 16.9 
1979 108 18.1 1987 48 17.1 
1980 88 17.2 1988 61 18.2 
1981 91 18.2 1989 43 17.3 
1982 58 16.0 1990 33 17.5 
1983 82 17.2 1991 36 16.6 


(a) Construct a scatter diagram of the data. 

(b) Fit a simple linear regression model to the data. Test for 
significance of regression. 

(c) Find a 95% CI on the slope B4. 

(d) Analyze the residuals and comment on model adequacy. 

11-72.  Anarticle in the Journal of Applied Polymer Science 

(Vol. 56, pp. 471—476, 1995) studied the effect of the mole 

ratio of sebacic acid on the intrinsic viscosity of copolyesters. 

The data follow: 


Mole ratio 
xX 1.0 | 0.9 | 0.8 | 0.7 | 0.6 | 0.5 


0.4 | 0.3 





Viscosity 
y 0.45 | 0.20 | 0.34 | 0.58 | 0.70 | 0.57 | 0.55 | 0.44 


(a) Construct a scatter diagram of the data. 

(b) Fit a simple linear repression module. 

(c) Test for significance of regression. Calculate R? for the 
model. 

(d) Analyze the residuals and comment on model adequacy. 
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11-73. Suppose that we have n pairs of observations (x; y;) 
such that the sample correlation coefficient r is unity (approx- 
imately). Now let z; = y? and consider the sample correlation 
coefficient for the n-pairs of data (x;, z;). Will this sample cor- 
relation coefficient be approximately unity? Explain why or 
why not. 

11-74. The grams of solids removed from a material ( y) is 
thought to be related to the drying time. Ten observations 
obtained from an experimental study follow: 





x12.13.013.514.014.5/5.015.516.016.5 | 7.0 


(a) Construct a scatter diagram for these data. 

(b) Fit a simple linear regression model. 

(c) Test for significance of regression. 

(d) Based on these data, what is your estimate of the mean 
grams of solids removed at 4.25 hours? Find a 9596 confi- 
dence interval on the mean. 

(e) Analyze the residuals and comment on model adequacy. 


11-75. Two different methods can be used for measuring 
the temperature of the solution in a Hall cell used in aluminum 
smelting, a thermocouples implanted in the cell and an indi- 
rect measurement produced from an IR device. The indirect 
method is preferable became the thermocouples are eventually 
destroyed by the solution. Consider the following 10 measure- 
ments: 


Thermocouple 921 935 916 920 940 
IR 918 934 924 921 945 
Thermocouple 936 929 940 933 92] 
IR 930 919 943 932 935 


(a) Construct a scatter diagram for these data, letting x = 
thermocouple measurement and y = IR measurement. 

(b) Fit a simple linear regression model. 

(c) Test for significance a regression and calculate R?. What 
conclusions can you draw? 

(d) Is there evidence to support a claim that both 
devices produce equivalent temperature measurements? 
Formulate and test an appropriate hypothesis to support 
this claim. 

(e) Analyze the residuals and comment on model adequacy. 
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MIND-EXPANDING EXERCISES 


11-76. Consider the simple linear regression model 
Y = Bo + Bix + e, with E(e) = 0, V(e) = o^, and the 
errors e uncorrelated. 

(a) Show that cov(Bo, B1) = —xo?/S,,. 

(b) Show that cov(Y, 84) = 0. 

11-77. Consider the simple linear regression model 
Y = Bo + Bix + e, with E(e) = 0, V(e) = o°, and the 
errors e uncorrelated. 

(a) Show that E(627) = E(MS;) = o°. 

(b) Show that E(MS,) = o? + B7S,,. 

11-78. Suppose that we have assumed the straight-line 
regression model 


Y= Bo + Bix, + € 


but the response is affected by a second variable x, such 
that the true regression function is 


IOC) = [ey a= [Euge 525 


Is the estimator of the slope in the simple linear regres- 
sion model unbiased? 


11-79. Suppose that we are fitting a line and we wish 
to make the variance of the regression coefficient B, as 
small as possible. Where should the observations x; 
i = 1,2,...,n, be taken so as to minimize V(,)? Discuss 
the practical implications of this allocation of the x,. 
11-80. Weighted Least Squares. Suppose that we 
are fitting the line Y = By + x + e, but the variance 
of Y depends on the level of x; that is, 


cg? 


DLL ILE 


IMPORTANT TERMS AND CONCEPTS 
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model parameters 
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Analysis of variance of regression model 


test in regression parameters 
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mean response checking 


Prediction interval on a 
future observation 


Correlation coefficient 
Empirical models 


where the w; are constants, often called weights. Show 
that for an objective function in whole each squared 
residual is multiplied by the reciprocal of the variance of 
the corresponding observation, the resulting weighted 
least squares normal equations are 


n n n 
Bo Nw ae Bin X wx; = D Wi Yi 
=| =] ei 
n P n n 
Bo wx; + By > wjx; = > WiXi yj 
=I Ei i=1 


Find the solution to these normal equations. The solutions 
are weighted least squares estimators of By and p,. 


11-81. Consider a situation where both Y and X are 
random variables. Let s, and s, be the sample standard 
deviations of the observed x's and y's, respectively. 
Show that an alternative expression for the fitted simple 
linear regression model ? = By + jx is 


Sy 


YS Doe lee a) 


11-82. Suppose that we are interested in fitting a 
simple linear regression model Y = By + Bix + e, 
where the intercept, Bo, is known. 


(a) Find the least squares estimator of pj. 

(b) What is the variance of the estimator of the slope in 
part (a)? 

(c) Find an expression for a 100(1 — a)% confidence 
interval for the slope B,. Is this interval longer than 
the corresponding interval for the case where both 
the intercept and slope are unknown? Justify your 


answer. 
Residual plots CD MATERIAL 
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11-4 SOME COMMENTS ON USES OF REGRESSION (CD ONLY) 


Historical Note 

Sir Francis Galton first used the term regression analysis in a study of the heights of fathers (x) 
and sons ( y). Galton fit a least squares line and used it to predict the son's height from the 
fathers height. He found that if a father's height was above average, the son's height would also 
be above average, but not by as much as the father's height was. A similar effect was observed 
for short heights. That is, the son's height “regressed” toward the average. Consequently, 
Galton referred to the least squares line as a regression line. 


Abuses of Regression. 

Regression is widely used and frequently misused; several common abuses of regression are 
briefly mentioned here. Care should be taken in selecting variables with which to construct re- 
gression equations and in determining the form of the model. It is possible to develop statisti- 
cally significant relationships among variables that are completely unrelated in a causal sense. 
For example, we might attempt to relate the shear strength of spot welds with the number of 
empty parking spaces in the visitor parking lot. A straight line may even appear to provide a 
good fit to the data, but the relationship 1s an unreasonable one on which to rely. You can't 
increase the weld strength by blocking off parking spaces. A strong observed association be- 
tween variables does not necessarily imply that a causal relationship exists between those 
variables. This type of effect is encountered fairly often in retrospective data analysis, and 
even in observational studies. Designed experiments are the only way to determine cause- 
and-effect relationships. 

Regression relationships are valid only for values of the regressor variable within the 
range of the original data. The linear relationship that we have tentatively assumed may be 
valid over the original range of x, but it may be unlikely to remain so as we extrapolate—that 
is, 1f we use values of x beyond that range. In other words, as we move beyond the range of 
values of x for which data were collected, we become less certain about the validity of the 
assumed model. Regression models are not necessarily valid for extrapolation purposes. 

Now this does not mean don t ever extrapolate. There are many problem situations in 
science and engineering where extrapolation of a regression model is the only way to even 
approach the problem. However, there is a strong warning to be careful. A modest extrapola- 
tion may be perfectly all right in many cases, but a large extrapolation will almost never 
produce acceptable results. 


11-8.3 Lack-of-Fit Test (CD Only) 


Regression models are often fit to data to provide an empirical model when the true relation- 
ship between the variables Y and x is unknown. Naturally, we would like to know whether the 
order of the model tentatively assumed is correct. This section describes a test for the validity 
of this assumption. 

The danger of using a regression model that is a poor approximation of the true functional 
relationship is illustrated in Fig. S11-1. Obviously, a polynomial of degree two or greater in x 
should have been used in this situation. 

We present a test for the “goodness of fit" of the regression model. Specifically, the hy- 
potheses we wish to test are 


Ho: The simple linear regression model is correct. 


Hj: The simple linear regression model is not correct. 


11-1 


11-2 


Figure S11-1 A 
regression model 
displaying lack of fit. 


The test involves partitioning the error or residual sum of squares into the following 
components: 


S SE = SS pg e SSror (SI1-1) 


where SS; is the sum of squares attributable to pure error, and SS, 5; is the sum of squares at- 
tributable to the lack of fit of the model. To compute SS», we must have repeated observations 
on the response Y for at least one level of x. Suppose we have n total observations such that 


Vite Vi20+++>Vin, repeated observations at x, 
Ya V22» +--> Y2n, repeated observations at x» 


Yml» m2» <--> Ymn, Tepeated observations at xm 


Note that there are m distinct levels of x. The contribution to the pure-error sum of squares at 
x, (say) would be 


nı 


(Yiu — XY (S11-2) 


u=1 


where y, represents the average of all n; repeat observations on the response y at x,. The total sum 
of squares for pure error would be obtained by summing Equation S11-2 over all levels of x as 
nj 


SSPE = 2, M (Viu = yj)" (S11-3) 


u=1 


There are ny. = Min (n; — 1) = n — m degrees of freedom associated with the pure-error 
sum of squares. The sum of squares for lack of fit is simply 


SS ror E SS; m SSpr (S11-4) 
with n — 2 — n,, = m — 2 degrees of freedom. The test statistic for lack of fit would then be 


S SSror/(m — 2) _ M3ror 
: SSpy/(n um m) MSpg 





(S11-5) 


and we would reject the hypothesis that the model adequately fits the data if fo > f, 5, 


EXAMPLE S11-1 


11-3 


This test procedure may be easily introduced into the analysis of variance conducted for 
the significance of regression. If the null hypothesis of model adequacy is rejected, the model 
must be abandoned and attempts must be made to find a more appropriate model. If Ho is not 
rejected, there is no apparent reason to doubt the adequacy of the model, and MS», and MS, or 
are often combined to estimate o°. 


Consider the data on two variables y and x shown below. Fit a simple linear regression model 
and test for lack of fit, using a = 0.05. 


x i X y 

1.0 2.3, 1.8 5.6 3.9572:9, 2.1 
2.0 25 6.0 3.4, 3.2 

3.9 1.8, 3.7 6.5 3.4 

4.0 20,2505 2.2 6:9 5.0 

5.0 2:0 





The regression model is y = 1.697 + 0.259x, and the regression sum of squares is SSp = 3.4930. 
The pure-error sum of squares is computed as follows: 


Level of x ee we Degrees of Freedom 
u=] 
1.0 0.1250 l 
3.3 1.8050 1 
4.0 0.0166 2 
5.6 0.9800 2 
6.0 0.0200 1 
Total 3.0366 7 


The analysis of variance is summarized in Table S11-1. Since the lack-of-fit F-statistic is 
fo = 1.42, which has a P-value of P = 0.3276, we cannot reject the hypothesis that the tentative 
model adequately describes the data. We will pool lack-of-fit and pure-error mean squares to form 
the residual mean square that is the denominator mean square in the test for significance of 
regression. In addition, since the P-value for the statistic f, = 6.66 with 1 and 14 degrees of free- 
dom associated with significance of regression is P = 0.0216, we conclude that Q, # 0. 


In fitting a regression model to experimental data, a good practice is to use the lowest 
degree model that adequately describes the data. The lack-of-fit test may be useful in this 


Table S11-1 Analysis of Variance for Example S11-1 


Source of Sum of Degrees of Mean 

Variation Squares Freedom Square fo P-value 
Regression 3.4930 l 3.4930 6.66 0.0218 
Residual 7.3372 14 0.5241 

(Lack of fit) 4.3005 d 0.6144 1.42 0.3276 

(Pure error) 3.0366 7 0.4338 
Total 10.8300 15 
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windmill data. Wind velocity, x 


respect. However, it is always possible to fit a polynomial of degree n — 1 to n data points, 
and the experimenter should not consider using a model that is “saturated’’—that is, that has 
very nearly as many independent variables as observations on y. 


11-10 MORE ABOUT TRANSFORMATIONS (CD ONLY) 


An Example 
As noted earlier in Section 11-9, transformations can be very useful in many situations where 


the true relationship between the response Y and the regressor x is not well approximated by a 
straight line. The utility of a transformation is illustrated in the following example. 


EXAMPLE S11-2 | A research engineer is investigating the use of a windmill to generate electricity and has col- 
lected data on the DC output from this windmill and the corresponding wind velocity. The 
data are plotted in Figure S11-2 and listed in Table 811-2. 
Inspection of the scatter diagram indicates that the relationship between DC output Y and 
wind velocity (x) may be nonlinear. However, we initially fit a straight-line model to the data. 
The regression model is 


$ = 0.1309 + 0.2411x 


0.4 


0.0 à " 


Figure S11-3 Plot of 
residuals e; versus fit- 

ted values y; for the 04 08 12 16 20 24 
windmill data. 3 
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The summary statistics for this model are R? = 0.8745, MS, = 6? = 0.0557 and Fy = 
160.26 (the P value is «0.0001 ). 

A plot of the residuals versus y; is shown in Figure S11-3. This residual plot indicates 
model inadequacy and implies that the linear relationship has not captured all of the infor- 
mation in the wind speed variable. Note that the curvature that was apparent in the scatter 
diagram of Figure S11-2 is greatly amplified in the residual plots. Clearly some other model 
form must be considered. 

We might initially consider using a quadratic model such as 


y = Bo + Bix + Box? + € 


to account for the apparent curvature. However, the scatter diagram Figure S11-2 suggests that 
as wind speed increases, DC output approaches an upper limit of approximately 2.5. This is 
also consistent with the theory of windmill operation. Since the quadratic model will eventu- 
ally bend downward as wind speed increases, it would not be appropriate for these data. A more 
reasonable model for the windmill data that incorporates an upper asymptote would be 


l 
»-&«B(1)«- 


Table S11-2 Observed Values y; and Regressor Variable x; for 
Example S11-2 


Observation Wind Velocity DC Output, 
Number, i (mph), x; yj 
l! 5.00 1.582 
2 6.00 1.822 
3 3.40 1:057 
4 2.70 0.500 
5 10.00 2.236 
6 9.70 2.386 
d 9.55 2.294 
8 3.05 0.558 
9 8.15 2.166 
10 6.20 1.866 
11 2.90 0.653 
12 6.35 1.930 
13 4.60 1.562 
14 5.80 1.737 
15 7.40 2.088 
16 3.60 1.137 
17 7.85 2.179 
18 8.80 2.112 
19 7.00 1.800 
20 5.45 1.501 
21l 9.10 2.303 
22 10.20 2.310 
23 4.10 1.194 
24 3.95 1.144 


NO 
GN 


2.45 0.123 


11-6 


Figure S11-4 Plot of 
DC output versus 
x’ = 1/x for the wind- 


mill data. 
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Figure S11-4 is a scatter diagram with the transformed variable x' = 1/x. This plot appears lin- 
ear, indicating that the reciprocal transformation is appropriate. The fitted regression model is 


y = 2.9789 — 6.9345x' 


The summary statistics for this model are R? = 0.9800, MS, = 6? = 0.0089, and Fy = 1128.43 
(the P value is <0.0001). 

A plot of the residuals from the transformed model versus y is shown in Figure S11-5. 
This plot does not reveal any serious problem with inequality of variance. The normal proba- 
bility plot, shown in Figure 811-6, gives a mild indication that the errors come from a distri- 
bution with heavier tails than the normal (notice the slight upward and downward curve at the 
extremes). This normal probability plot has the z-score value plotted on the horizontal axis. 
Since there is no strong signal of model inadequacy, we conclude that the transformed model 
is satisfactory. 


Logistic Regression 

Linear regression often works very well when the response variable is quantitative. We now 
consider the situation where the response variable takes on only two possible values, 0 and 1. 
These could be arbitrary assignments resulting from observing a qualitative response. For 
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Figure S11-5 Plot of residuals versus Figure S11-6 Normal probability plot of 
fitted values y; for the transformed model the residuals for the transformed model for 


for the windmill data. the windmill data. 
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example, the response could be the outcome of a functional electrical test on a semiconductor 

device for which the results are either a "success," which means the device works properly, or 

a “failure,” which could be due to a short, an open, or some other functional problem. 
Suppose that the model has the form 


= Po © Dix; € (S11-6) 


and the response variable Y; takes on the values either 0 or 1. We will assume that the response 
variable Y; is a Bernoulli random variable with probability distribution as follows: 


Probability 
l n (yi = 1) = T; 


Now since E(e;) = 0, the expected value of the response variable is 
EY) = 1(uj) + 0(1 — m) 
This implies that 
EU) = By? Bx ms 
This means that the expected response given by the response function E(Y;) = By + Dix; 1s 
just the probability that the response variable takes on the value 1. 


There are some substantive problems with the regression model in Equation S11-6. First, 
note that if the response is binary, the error terms e; can only take on two values, namely, 


e —-l1—(BotBix) when ¥; = 1 
e; = —(Bo + Bix;) when Y; = 0 


Consequently, the errors in this model cannot possibly be normal. Second, the error variance 
is not constant, since 
2 _ _ 2 
0y, m ELY; E( Y;)} 
(1 — m,)’ar, + (0 — m) (1 — mj) 
m(l-— mj) 


Notice that this last expression is just 


o? = E(Y)[1 — E(Y,)] 


since E(Y;) = Bo + Bix; = 7;. This indicates that the variance of the observations (which is 
the same as the variance of the errors because e; = Y; — m, and m, is a constant) is a function 
of the mean. Finally, there is a constraint on the response function, because 


0x E(Y)-m;-1 
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This restriction can cause serious problems with the choice of a linear response function, as 
we have initially assumed in Equation $11-6. It would be possible to fit a model to the data for 
which the predicted values of the response lie outside the 0, 1 interval. 

Generally, when the response variable is binary, there is considerable empirical evidence 
indicating that the shape of the response function should be nonlinear. A monotonically 
increasing (or decreasing) S-shaped (or reverse S-shaped) function, such as shown in 
Figure S11-7, is usually employed. This function is called the logit response function, and has 
the form 

E(Y) Em | exp(Bo + Bix) _ (S11-7) 
1 + exp(Bo + Bix) 


or equivalently, 


1 


EW) = T Y exp[- (Bo + Bi] 


(S11-8) 


In logistic regression we assume that E(Y) is related to x by the logit function. It is easy to 
show that 


= exp% +P (S11-9) 


The quantity exp( By + 4x) on the right-hand side of Equation S11-9 is called the odds ra- 
tio. It has a straightforward interpretation: If the odds ratio is 2 for a particular value of x, it 
means that a success 1s twice as likely as a failure at that value of the regressor x. Notice that 
the natural logarithm of the odds ratio is a linear function of the regressor variable. Therefore 
the slope B, is the change in the log odds that results from a one-unit increase in x. This means 
that the odds ratio changes by eP' when x increases by one unit. 

The parameters in this logistic regression model are usually estimated by the method of 
maximum likelihood. For details of the procedure, see Montgomery, Peck, and Vining 
(2001). Minitab will fit logistic regression models and provide useful information on the 
quality of the fit. 


1.2 


Figure S11-7 Examples of the logistic response function. (a) E(Y) = 1/(1 + e $9 195) (b) E(Y) = 1/(1 + e OOF), 
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We will illustrate logistic regression using the data on launch temperature and O-ring fail- 
ure for the 24 space shuttle launches prior to the Challenger disaster of January 1986. There 
are six O-rings used on the rocket motor assembly to seal field joints. The table below presents 
the launch temperatures. A 1 in the “O-Ring Failure" column indicates that at least one O-ring 
failure had occurred on that launch. 


O-Ring O-Ring O-Ring 
Temperature Failure Temperature Failure Temperature Failure 
53 1 68 0 75 0 
56 1 69 0 75 1 
57 1 70 0 76 0 
63 0 70 1 76 0 
66 0 70 1 78 0 
67 0 70 1 79 0 
67 0 T2 0 80 0 
67 0 73 0 81 0 


Figure S11-8 is a scatter plot of the data. Note that failures tend to occur at lower temperatures. 
The logistic regression model fit to this data from Minitab is shown in the following boxed 
display. 


Binary Logistic Regression: O-Ring Failure versus Temperature 


Link Function: Logit 
Response Information 


Variable Value Count 
O-Ring F I 7 
0 1j 
Total 24 


Logistic Regression Table 


Odds 95% 
Predictor Coef SE Coef Z P Ratio Lower 
Constant 10.875 5.703 1.91 0.057 
Temperat —0.17132 0.08344 —2.05 0.040 0.84 0.72 


Log-Likelihood = —11.515 
Test that all slopes are zero: G = 5.944, DF = 1, P-Value = 0.015 





The fitted logistic regression model is 


. 1 
P 1+ exp[ -(10.875 — 0.17132x)] 


The standard error of the slope B, is se(By) — 0.08344. For large samples, Bi has an approximate 
normal distribution, and so ,/se(,) can be compared to the standard normal distribution to 
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Figure S11-8 Scatter plot of O-ring failures 
versus launch temperature for 24 space shuttle 


flights. 
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Figure S11-9 Probability of O-ring failure 
versus launch temperature (based on a 
logistic regression model). 


test Hy: Bı = 0. Minitab performs this test. The P-value is 0.04, indicating that temperature 
has a significant effect on the probability of O-ring failure. The odds ratio 1s 0.84, so every one 
degree increase in temperature reduces the odds of failure by 0.84. Figure S11-9 shows the 
fitted logistic regression model. The sharp increase in the probability of O-ring failure is very 
evident in this graph. The actual temperature at the Challenger launch was 31°F. This is well 
outside the range of other launch temperatures, so our logistic regression model is not likely 
to provide highly accurate predictions at that temperature, but it is clear that a launch at 31°F 
is almost certainly going to result in O-ring failure. 

It is interesting to note that all of these data were available prior to launch. However, en- 
gineers were unable to effectively analyze the data and use them to provide a convincing ar- 
gument against launching Challenger to NASA managers. Yet a simple regression analysis of 
the data would have provided a strong quantitative basis for this argument. This 1s one of the 
more dramatic instances that points out why engineers and scientists need a strong back- 
ground in basic statistical techniques. 
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After careful study of this chapter, you should be able to do the following: 


1. Use multiple regression techniques to build empirical models to engineering and scientific data 


2. Understand how the method of least squares extends to fitting multiple regression models 
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3. Assess regression model adequacy 
4. Test hypotheses and construct confidence intervals on the regression coefficients 


5. Use the regression model to estimate the mean response and to make predictions and to construct 
confidence intervals and prediction intervals 


. Build regression models with polynomial terms 


NO 


. Use indicator variables to model categorical regressors 


8. Use stepwise regression and other model building techniques to select the appropriate set of vari- 
ables for a regression model 


CD MATERIAL 
9. Understand how ridge regression provides an effective way to estimate model parameters where 
there is multicollinearity. 


10. Understand the basic concepts of fitting a nonlinear regression model. 


Answers for many odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e-Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e-Text immediately following the section they accompany. 


12-1 MULTIPLE LINEAR REGRESSION MODEL 
12-1.1 Introduction 


Many applications of regression analysis involve situations in which there are more than one 
regressor variable. A regression model that contains more than one regressor variable is called 
a multiple regression model. 

As an example, suppose that the effective life of a cutting tool depends on the cutting speed 
and the tool angle. A multiple regression model that might describe this relationship is 


Y= Do + Dix, + Dx» +e (12-1) 


where Y represents the tool life, x, represents the cutting speed, x, represents the tool angle, 
and e is a random error term. This is a multiple linear regression model with two regressors. 
The term /inear is used because Equation 12-1 is a linear function of the unknown parameters 
Bo, By, and p». 

The regression model in Equation 12-1 describes a plane in the three-dimensional space 
of Y, xı, and x,. Figure 12-1(a) shows this plane for the regression model 


E(Y) = 50 + 10x; + 7x; 


where we have assumed that the expected value of the error term is zero; that is E(e) = 0. 
The parameter D, is the intercept of the plane. We sometimes call 8, and B, partial regres- 
sion coefficients, because D, measures the expected change in Y per unit change in x, when 
x, is held constant, and B, measures the expected change in Y per unit change in x, when x, 
is held constant. Figure 12-1(b) shows a contour plot of the regression model— that is, lines 
of constant E(Y) as a function of x, and x,. Notice that the contour lines in this plot are 
straight lines. 
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(a) 


(b) 


Figure 12-1 (a) The regression plane for the model E(Y) = 50 + 10x, + 7x. (b) The contour plot. 


In general, the dependent variable or response Y may be related to k independent or 
regressor variables. The model 


Y = Bo + Dixi + Borg toc + Bix + € (12-2) 


is called a multiple linear regression model with k regressor variables. The parameters B, 7 = 0, 
l, ..., k, are called the regression coefficients. This model describes a hyperplane in the k- 
dimensional space of the regressor variables {x;}. The parameter B; represents the expected change 
in response Y per unit change in x; when all the remaining regressors x; (i # j) are held constant. 

Multiple linear regression models are often used as approximating functions. That is, the 
true functional relationship between Y and x,, x», ..., x, is unknown, but over certain ranges of 
the independent variables the linear regression model is an adequate approximation. 

Models that are more complex in structure than Equation 12-2 may often still be analyzed 
by multiple linear regression techniques. For example, consider the cubic polynomial model 
in one regressor variable. 


Y = Bo + Bix + Box? + B3x? + € (12-3) 
If we let x, = x, x = x?, x; = x^, Equation 12-3 can be written as 
Y = Bo + Bix, + Box; + B3x3 + € (12-4) 


which is a multiple linear regression model with three regressor variables. 

Models that include interaction effects may also be analyzed by multiple linear regres- 
sion methods. An interaction between two variables can be represented by a cross-product 
term in the model, such as 


Y = Bo + Bix, + Box. + Bizxixz + € (12-5) 
If we let x; = x,x, and B4 = 9,, Equation 12-5 can be written as 
Y = Bo + Bix, + Box; + B3x3 + € 


which is a linear regression model. 
Figure 12-2(a) and (b) shows the three-dimensional plot of the regression model 


Y = 50 + 10x, + 7x; + 5x x 





12-1 MULTIPLE LINEAR REGRESSION MODEL 413 


1000 
800 





(b) (b) 


Figure 12-2 (a) Three-dimensional plot of the regression Figure 12-3 (a) Three-dimensional plot of the regression 
model E(Y) = 50 + 10x, + 7x; + 5xjx;. (b) The contour model E(Y) = 800 + 10x, + 7x, — 8.5x? — 5x5 + 4xix». 


plot. 


(b) The contour plot. 


and the corresponding two-dimensional contour plot. Notice that, although this model is a lin- 
ear regression model, the shape of the surface that is generated by the model is not linear. In 
general, any regression model that is linear in parameters (the (9s) is a linear regression 
model, regardless of the shape of the surface that it generates. 

Figure 12-2 provides a nice graphical interpretation of an interaction. Generally, interac- 
tion implies that the effect produced by changing one variable (x,, say) depends on the level 
of the other variable (x,). For example, Fig. 12-2 shows that changing x, from 2 to 8 produces 
a much smaller change in E(Y) when x, = 2 than when x, = 10. Interaction effects occur fre- 
quently in the study and analysis of real-world systems, and regression methods are one of the 
techniques that we can use to describe them. 

As a final example, consider the second-order model with interaction 


Y = Bo + Bix, + Box, + Bixi + Boox5  Bizxixo + € (12-6) 


If we let X3 = Xin X4 — x Xs — X1X5, B3 E Di B4 = 355, and B; = Di», Equation 12-6 can be 
written as a multiple linear regression model as follows: 


Y = Bo + Bux, + Box, + B3x3 + B4x4 + Bsxs + € 


Figure 12-3(a) and (b) how the three-dimensional plot and the corresponding contour plot for 
E(Y) = 800 + 10x, + 7x; — 8.5x? — 5x3 + 4Axqx; 
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These plots indicate that the expected change in Y when x, is changed by one unit (say) is a 
function of both x, and x;. The quadratic and interaction terms in this model produce a mound- 
shaped function. Depending on the values of the regression coefficients, the second-order 
model with interaction is capable of assuming a wide variety of shapes; thus, it is a very flex- 
ible regression model. 


12-1.2 Least Squares Estimation of the Parameters 
The method of least squares may be used to estimate the regression coefficients in the mul- 
tiple regression model, Equation 12-2. Suppose that n > k observations are available, and let 
x; denote the ith observation or level of variable x, The observations are 


(xj, Xiz <--> Xiks Yih i = 1,2,...,n and nk 


It is customary to present the data for multiple regression in a table such as Table 12-1. 
Each observation (x5, x;?, ... , Xj, y;), satisfies the model in Equation 12-2, or 


yi ^ Bo + Bixa + Boxio t c + BEX + €i 


k 
- Bot Y Dc € i = 1,2,...,n (227) 
jal 


The least squares function is 


k 2 
L= 25 e EP (y =" po 2, xy) (12-8) 
j= 


S 
S 


We want to minimize L with respect to Bo, Bi, ... , By} The least squares estimates of Bo, 
Bi, ..., By must satisfy 





oL n . ko 
Balaa 4 x (y A > By) = (12-9a) 
Bo Bibs Piscean DE i=l j=l 
and 
AL n . k . 
aa, . 7 22 Bo 2 Boy Jap =O j=1,2,...,k (12-90) 
b; Duo Biot Bz i=l j=l 





Simplifying Equation 12-9, we obtain the least squares normal Equations 
n 


n n n 
nBo + Bi Š xa + Bo Eu +o + BS = dy; 
i= i= i= 


i=] 


A n " n n n n 
r P " 
Po 2, Xa t Pi X xA + By > xax fe te al Dy Maxaxg = xi 
i= i=l =i i=] T 


^ n X n a n p n n 
Bo 2, T B, MN xaxa Ex B» X xxn a He B; bx = 2 d (12-10) 
i= i-l i=l i-l i=l 


Note that there are p = k + 1 normal Equations, one for each of the unknown regression 
coefficients. The solution to the normal Equations will be the least squares estimators of the 


EXAMPLE 12-1 
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Table 12-1 Data for Multiple Linear Regression 


y X| X5 Cs Xk 
Yı X11 X12 e. Nik 
y2 X21 X22 e. Xk 
Yn Xnl Xn2 LE Xpnk 


regression coefficients, Bo, B4, ... , Bj. The normal Equations can be solved by any method 
appropriate for solving a system of linear Equations. 


In Chapter 1, we used data on pull strength of a wire bond in a semiconductor manufacturing 

process, wire length, and die height to illustrate building an empirical model. We will use the 

same data, repeated for convenience in Table 12-2, and show the details of estimating the 

model parameters. A three-dimensional scatter plot of the data is presented in Fig. 1-13. Fig- 

ure 12-4 shows a matrix of two-dimensional scatter plots of the data. These displays can be 

helpful in visualizing the relationships among variables in a multivariable data set. 
Specifically, we will fit the multiple linear regression model 


Y = Bo + Bix + Box + € 
where Y = pull strength, x, = wire length, and x, = die height. From the data in Table 12-2 
we calculate 
25 


n= 25, Wy; = 725.82 
i=l 


25 25 
X xı = 206, 9 x; = 8,294 
i-l i-l 


25 25 
Sd x7 2,96, X xh = 3,531,848 
i=l i-l 


25 25 25 
X, xaxn = 77,177, > xay; = 8,008.37, X, xny; = 274,811.31 
i=] p i-l 


Table 12-2 Wire Bond Data for Example 11-1 


Observation 
Number 


— = 
Pe COO ON WN Nn A L N e 


— — 
WW N 


Pull Strength Wire Length Die Height Observation Pull Strength Wire Length Die Height 

y EO X5 Number y A 207 

9.95 2 50 14 11.66 2 360 
24.45 8 110 15 21.65 4 205 
31.75 11 120 16 17.89 4 400 
35.00 10 550 17 69.00 20 600 
25.02 8 295 18 10.30 l 585 
16.86 4 200 19 34.93 10 540 
14.38 2 375 20 46.59 15 250 

9.60 2 32 21 44.88 15 290 
24.35 9 100 22 54.12 16 510 
27.50 8 300 23 56.63 I7 590 
17.08 4 412 24 22.13 6 100 
37.00 11 400 25 21:153 2 400 
41.95 12 500 
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Figure 12-4 Matrix of scatter plots (from Minitab) for the wire bond pull 
strength data in Table 12-2. 


For the model Y = Bo + B,x, + Box; + e, the normal Equations 12-10 are 
z a n " n n 
nBo + Bi xi ct Bo xn = Xvi 
i=] i-l j=l 
n n n n 
A A 2 A = 
Bo Xn + Bi xa + BD xax = X xay: 
i-l ied i-l T 
n n n n 
A A A es 
Bo > xi; + Bi S xax + Bo Dx = X xyi 
i=l i=l i-l je 
Inserting the computed summations into the normal equations, we obtain 


258, + — 2068, + 82940, = 725.82 
2068, + 23968, 77,1778, = 8,008.37 
82948, + 77,1778, + 3,531,8488, = 274,811.31 


The solution to this set of equations is 
By = 2.26379, 8, = 2.74427, By = 0.01253 
Therefore, the fitted regression equation is 
y = 2.26379 + 2.74427x, + 0.01253x, 
This equation can be used to predict pull strength for pairs of values of the regressor variables 
wire length (x,) and die height (x,). This is essentially the same regression model given in 


Equation 1-7, Section 1-3. Figure 1-14 shows a three-dimentional plot of the plane of predicted 
values y generated from this equation. 


12-1 MULTIPLE LINEAR REGRESSION MODEL 417 


12-1.3 Matrix Approach to Multiple Linear Regression 
In fitting a multiple regression model, it is much more convenient to express the mathe- 
matical operations using matrix notation. Suppose that there are k regressor variables and 
n observations, (xi, X5, ..., Xj, Vj), 1 = 1, 2,..., n and that the model relating the regres- 
sors to the response is 


y Pot Dp Dix; cro BEBE i = 1,2,...,A 


This model is a system of n equations that can be expressed in matrix notation as 


y=XB +e (12-11) 
where 
^1 | xu xX Xi Bo €i 
12 l x x x EJ 
y= x= ic ls d p = Pi and €= 
Yn L Aq Xx dur m €, 


In general, y is an (n X 1) vector of the observations, X is an (n X p) matrix of the levels 
of the independent variables, B is a (p X 1) vector of the regression coefficients, and e is a 
(n X 1) vector of random errors. : 

We wish to find the vector of least squares estimators, D, that minimizes 


L= X a = e'e = (y - XB)'(y - Xp) 


n 
=] 


l 


The least squares estimator B is the solution for D in the equations 


Le 
op 


We will not give the details of taking the derivatives above; however, the resulting equations 
that must be solved are 


X'XB = X'y (12-12) 


Equations 12-12 are the least squares normal equations in matrix form. They are identical to 
the scalar form of the normal equations given earlier in Equations 12-10. To solve the normal 
equations, multiply both sides of Equations 12-12 by the inverse of X'X. Therefore, the least 
squares estimate of p is 


B = (X'X)'X'y (12-13) 
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EXAMPLE 12-2 


Note that there are p = k + 1 normal equations in p = k + 1 unknowns (the values of 
Bo. Br eon, Bj. Furthermore, the matrix X'X is always nonsingular, as was assumed above, 
so the methods described in textbooks on determinants and matrices for inverting these matri- 
ces can be used to find (X'X) !. In practice, multiple regression calculations are almost al- 
ways performed using a computer. 

It is easy to see that the matrix form of the normal equations is identical to the scalar form. 
Writing out Equation 12-12 in detail, we obtain 


n n n 
n X xa X xp UC 25k 
i=] i=] i=] 
n n n n " n 
2, Xil >, xf 2, Xii U 2, XüX;i | | Bi >, XiJi 
i= i= i= i= — | i= 


S 
iM: 
c 


n n n n m 
2, Nik 2, Ni >, XikXi2 77 2, Xj By 2, XikYi 
i= i= i= i= i= 


If the indicated matrix multiplication is performed, the scalar form of the normal equations 
(that is, Equation 12-10) will result. In this form it is easy to see that X’X is a(p X p) sym- 
metric matrix and X’y is a(p X 1) column vector. Note the special structure of the X’X ma- 
trix. The diagonal elements of X’X are the sums of squares of the elements in the columns of 
X, and the off-diagonal elements are the sums of cross-products of the elements in the 
columns of X. Furthermore, note that the elements of X'y are the sums of cross-products of 
the columns of X and the observations {y;}. 
The fitted regression model is 


$; 7 bo + M Bx; i=1,2,...,n (12-14) 
=] 


In matrix notation, the fitted model is 
y = XB 


The difference between the observation y; and the fitted value y; is a residual, say, 
e; = y; — y;. The (n X 1) vector of residuals is denoted by 


e-y-j5 (12-15) 


In Example 12-1, we illustrated fitting the multiple regression model 


y = Bo + Bix, + Box. + € 


where y is the observed pull strength for a wire bond, x, is the wire length, and x, is the 
die height. The 25 observations are in Table 12-2. We will now use the matrix approach 
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to fit the regression model above to these data. The X matrix and y vector for this 
model are 





l|] 2 50 9.95 
l 8 110 24.45 
| dT 120 3175 
1 10 550 35.00 
1 8 295 25.02 
1 4 200 16.86 
i 2 375 14.38 
|] 2 32 9.60 
1 9 100 24.35 
l 8 300 27.50 
1 4 412 17.08 
1 11 400 37.00 
X=/1 12 500| y= |41.95 
|] 2 360 11.66 
1 4 205 21.65 
1 4 400 17.89 
1 20 600 69.00 
1 ] 585 10.30 
1 10 540 34.93 
] 15 250 46.59 
] 15 290 44.88 
1 16 $510 54.12 
1 17 590 56.63 
1 6 100 22.13 
1 5 400 21.15 
The X'X matrix is 
1 L s 1 di 25 206 8,294 
X'X^5|2 8 e 5 d. n = | 206 2,396 PEAT? 
50 110 --- 400 l l 8,294 77,177 3,531,848 
1 5 400 
and the X'y vector is 
9.95 
1 l ii 1 725.82 
24.45 
X'y =| 2 8 ; = | 8,008.37 
50 110 -- 400 l 274,811.31 
21:15 


The least squares estimates are found from Equation 12-13 as 


Ê = (X'X)''x'y 
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or 
Bo 25 206 8294 |~![ 725.82 
ĝl =| 206 2396 77,177 8,008.37 
B. 8294 77,177 3,531,848 274,811.31 
0.214653  —0.007491 —0.000340 725.82 2.26379143 
= | —0.007491 0.001671 —0.000019 8,008.47 | = | 2.74426964 
—0.000340  —0.000019 +0.0000015 | | 274,811.31 0.01252781 


Therefore, the fitted regression model with the regression coefficients rounded to five decimal 
places is 


y = 2.26379 + 2.74427x, + 0.01253x; 


This 1s identical to the results obtained in Example 12-1. 

This regression model can be used to predict values of pull strength for various values of 
wire length (x,) and die height (x;). We can also obtain the fitted values y; by substituting each 
observation (x4, X), i = 1,2,...,n, into the equation. For example, the first observation has 
xı; = 2 and x, = 50, and the fitted value is 


Py = 2.26379 + 2.74427x1, + 0.01253x; 
= 2.26379 + 2.74427(2) + 0.01253(50) 
= 8.38 


The corresponding observed value is y, = 9.95. The residual corresponding to the first obser- 
vation is 


e = Vi 7 5i 
9.95 — 8.38 
= 1.57 


Table 12-3 displays all 25 fitted values y; and the corresponding residuals. The fitted values 
and residuals are calculated to the same accuracy as the original data. 


Table 12-3 Observations, Fitted Values, and Residuals for Example 12-2 


Observation 
Number 


= = 
FP CoO WON NWN FB LUN e 


Ó -— 
Ww N 


Observation 

Ja ji ey oc Number Vi ji qu om), 

9.95 8.38 1.57 14 11.66 12.26 —0.60 
24.45 25.60 =115 15 21.65 15.81 5.84 
31.75 33.95 =2.20 16 17.89 18.25 —0.36 
35.00 36.60 — 1.60 17 69.00 64.67 4.33 
25.02 249] — 2.89 18 10.30 12.34 — 2.04 
16.86 15:75 1.11 19 34.93 36.47 —1.54 
14.38 12.45 1.93 20 46.59 46.56 — 0.03 

9.60 8.40 1.20 2l 44.88 47.06 L2. 05 
24.35 28.21 —3.86 22 54.12 52.56 1.56 
27.50 27.98 — 0.48 23 56.63 56.31 0.32 
17.08 18.40 =j 24 22.13 19.98 2.15 
37.00 37.46 — 0.46 25 21.15 21.00 0.15 
41.95 41.46 0.49 
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Computers are almost always used in fitting multiple regression models. Table 12-4 pre- 
sents some annotated output from Minitab for the least squares regression model for wire bond 
pull strength data. The upper part of the table contains the numerical estimates of the regres- 
sion coefficients. The computer also calculates several other quantities that reflect important 
information about the regression model. In subsequent sections, we will define and explain the 
quantities in this output. 


Estimating o^ 

Just as in simple linear regression, it is important to estimate o7, the variance of the error term 
e, in a multiple regression model. Recall that in simple linear regression the estimate of o? was 
obtained by dividing the sum of the squared residuals by n — 2. Now there are two parame- 
ters in the simple linear regression model, so in multiple linear regression with p parameters a 
logical estimator for o? is 


(12-16) 





This is an unbiased estimator of g. Just as in simple linear regression, the estimate of o? is 
usually obtained from the analysis of variance for the regression model. The numerator of 
Equation 12-16 is called the error or residual sum of squares, and the denominator n — p is 
called the error or residual degrees of freedom. Table 12-4 shows that the estimate of o? for 
the wire bond pull strength regression model is 6? = 115.2/22 = 5.2364. The Minitab output 
rounds the estimate to 6^ — 5.2. 


12-1.4 Properties of the Least Squares Estimators 


The statistical properties of the least squares estimators Bo, Bi, ... , B, may be easily found, 
under certain assumptions on the error terms €}, €», ..., €,,, in the regression model. Paralleling 
the assumptions made in Chapter 11, we assume that the errors e; are statistically independent 
with mean zero and variance o°. Under these assumptions, the least squares estimators 
Bo, B1, ..., Bz are unbiased estimators of the regression coefficients Bo, B1, ... , Bz This 
property may be shown as follows: 


E(B) = E((X'X) x Y] 

E((X'X) 'X'(XB + €)] 

= E[((X'X) !X'Xp + (X'X) !X'e] 
b 


since E(e) = 0 and (X'X) !X'X = I, the identity matrix. Thus, B is an unbiased estimator of [. 

The variances of the (3's are expressed in terms of the elements of the inverse of the X'X 
matrix. The inverse of X'X times the constant o? represents the covariance matrix of the 
regression coefficients B. The diagonal elements of o? (X'X) ' are the variances of Bo, 
Bi, ..., Bi, and the off-diagonal elements of this matrix are the covariances. For example, if 
we have k — 2 regressors, such as in the pull-strength problem, 


Coo Coi Co 


C -(XX)'- Cio Cru Cy 
Co C3 C» 
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Table 12-4 Minitab Multiple Regression Output for the Wire Bond Pull Strength Data 


Regression Analysis: Strength versus Length, Height 


The regression equation 1s 
Strength = 2.26 + 2.74 Length + 0.0125 Height 


Predictor Coef SE Coef T P VIF 
Constant Bo» 2.264 1.060 2.14 0.044 
Length B »2.74427 0.09352 | 2934 0.000 1.2 


^ 


Height D,» 0.012528 0.002798 4.48 0.000 1.2 


S — 2.288 R-Sq — 98.196 R-Sq (adj) = 97.9% 
PRESS = 156.163 R-Sq (pred) = 97.44% 


Analysis of Variance 


Source DF SS MS F P 
Regression Z 5990.8 2995.4 512217 0.000 
Residual Error 22 1152 52 <6? 

Total 24 6105.9 

Source DF Seq SS 

Length l 5885.9 

Height ] 104.9 


Predicted Values for New Observations 


New Obs Fit SE Fit 95.0% CI 95.0% PI 
l 27.663 0.482 (26.663, 28.663) (22.814, 32.512) 


Values of Predictors for New Observations 


New Obs Length Height 
| 8.00 275 


which is symmetric (Cio = Cy}, Coo = Cy, and Cs; = C2) because (X'X) ! is symmetric, 
and we have 

S REN 
cov(B;, B) = a^ Cj, l Fj 


j = 9, 1, 2 


In general, the covariance matrix of B isa (p X p) symmetric matrix whose jjth element is the 
variance of B; and whose i, jth element is the covariance between p; and B, that is, 


cov(B) = o*X'X) ! = o?C 


The estimates of the variances of these regression coefficients are obtained by replacing 
o? with an estimate. When o? is replaced by it's estimate 6^, the square root of the estimated 
variance of the jth regression coefficient is called the estimated standard error of B; Or 
se(B,) = V 6^Cj. These standard errors are a useful measure of the precision of estimation 
for the regression coefficients; small standard errors imply good precision. 

Multiple regression computer programs usually display these standard errors. For 
example, the Minitab output in Table 12-4 reports se(Bo) = 1.060, se(B,) = 0.09352, and 


12-1 MULTIPLE LINEAR REGRESSION MODEL 423 


se(B,) — 0.002798. The slope estimate is about twice the magnitude of its standard error, and 


B, and B, are considerably larger than se(B,) and se(ĝ2). This implies reasonable precision 
of estimation, although the parameters D, and B, are much more precisely estimated than the 
intercept (this is not unusual in multiple regression). 


EXERCISES FOR SECTION 12-1 


12-1. A study was performed to investigate the shear 

strength of soil ( y) as it related to depth in feet (x,) and mois- 

ture content (x;). Ten observations were collected, and the fol- 

lowing summary quantities obtained: n = 10, Xx; = 223, 

235553. dy, = 1,916, Dx = 5,200.9, Xx} = 31,729, 

XX = 12,352, Xx y; = 43,550.8, Xxpy; = 104,736.8, 

and Sy? = 371,595.6. 

(a) Set up the least squares normal equations for the model 
Y= Bo + Bixi + Box, + e. 

(b) Estimate the parameters in the model in part (a). 

(c) What is the predicted strength when x, = 18 feet and 
x, = 43%? 

12-2. A regression model is to be developed for predicting 

the ability of soil to absorb chemical contaminants. Ten obser- 

vations have been taken on a soil absorption index ( y) and two 

regressors: x, — amount of extractable iron ore and x, — 

amount of bauxite. We wish to fit the model Y = By + B,x, + 

Bx, + e. Some necessary quantities are: 


1.17991 — 7.30982 E-3 
(X'X)!2 | —7.30982 E-3 7.9799 E-5 
7.3006 E-4 1.23713 E-4 
(a) Estimate the regression coefficients in the model specified 


above. 
(b) What is the predicted value of the absorption index y 
when x, = 200 and x, = 50? 


12-3. A chemical engineer is investigating how the amount 
of conversion of a product from a raw material ( y) depends on 
reaction temperature (x,) and the reaction time (x5). He has 
developed the following regression models: 

1. $ = 100 + 2x, + 4x; 

2. $ = 95 + 1.5x, + 3x; + 2xix; 

Both models have been built over the range 0.5 = x, - 10. 

(a) What is the predicted value of conversion when x, = 2? 
Repeat this calculation for x, = 8. Draw a graph of the 
predicted values for both conversion models. Comment 
on the effect of the interaction term in model 2. 

(b) Find the expected change in the mean conversion for a 
unit change in temperature x, for model 1 when x, = 5. 
Does this quantity depend on the specific value of reac- 
tion time selected? Why? 

(c) Find the expected change in the mean conversion for a 
unit change in temperature x, for model 2 when x, = 5. 
Repeat this calculation for x, = 2 and x, = 8. Does the 
result depend on the value selected for x,? Why? 


12-4. The data in Table 12-5 are the 1976 team performance 
statistics for the teams in the National Football League 
(Source: The Sporting News). 

(a) Fit a multiple regression model relating the number 
of games won to the teams’ passing yardage (x,), the percent 
rushing plays (x;), and the opponents’ yards rushing (x). 

(b) Estimate oO”. 

(c) What are the standard errors of the regression coefficients? 

(d) Use the model to predict the number of games won when 
x, = 2000 yards, x, = 60%, and x, = 1800. 

12-5. Table 12-6 presents gasoline mileage performance for 

25 automobiles (Source: Motor Trend, 1975). 

(a) Fit a multiple regression model relating gasoline mileage 
to engine displacement (x,) and number of carburetor 
barrels (xg). 

(b) Estimate o°. 

(c) Use the model developed in part (a) to predict mileage per- 
formance for a car with displacement x, = 300 and x, = 2. 


7.3006 E-4 220 
—1.23713 E-4 |, X'y = | 36,768 
4.6576 E-4 9,965 


12-6. The electric power consumed each month by a chem- 
ical plant is thought to be related to the average ambient 
temperature (x,), the number of days in the month (x,), the 
average product purity (x4), and the tons of product produced 
(x4). The past year's historical data are available and are pre- 
sented in the following table: 


24 Xi X2 X3 X4 
240 25 24 91 100 
236 31 PA 90 95 
270 45 24 88 110 
274 60 29 87 88 
301 65 29 9] 94 
316 T2 26 94 99 
300 80 25 87 97 
296 84 25 86 96 
267 75 24 88 110 
276 60 25 91 105 
288 50 25 90 100 
261 38 23 89 98 


(a) Fit a multiple linear regression model to these data. 
(b) Estimate oO”. 
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Team 
Washington 
Minnesota 
New England 
Oakland 
Pittsburgh 
Baltimore 
Los Angeles 
Dallas 
Atlanta 
Buffalo 
Chicago 
Cincinnati 
Cleveland 
Denver 
Detroit 
Green Bay 
Houston 
Kansas City 
Miami 
New Orleans 
New York Giants 
New York Jets 
Philadelphia 
St. Louis 
San Diego 
San Francisco 
Seattle 
Tampa Bay 


2 
10 
11 


mÓ 
O rf A 
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X 
2113 
2003 
2957 
2285 
297] 
2309 
2528 
2147 
1689 
2566 
2363 
2109 
2295 
1932 
2215 
1722 
1498 
1873 
2118 
1775 
1904 
1929 
2080 
2301 
2040 
2447 
1416 
1503 


y: Games won (per 14 game season) 
x,: Rushing yards (season) 


MR 


2: Passing yards (season) 
3: Punting yards (yds/punt) 
x4: Field goal percentage (Field goals made/Field goals attempted— season) 


X2 
1985 
2855 
1737 
2905 
1666 
2927 
2341 
2737 
1414 
1838 
1480 
2191 
2229 
2204 
2140 
1730 
2072 
2929 
2268 
1983 
1792 
1606 
1492 
2835 
2416 
1638 
2649 
1503 


X3 
38.0 
38.8 
40.1 
41.6 
39.7 
39.7 
38.1 
37.0 
42.1 
42.3 
IRI 
39.5 
37.4 
351 
38.8 
36.6 
35.3 
41.1 
38.6 
39.3 
39.7 
39.7 
39 
35.3 
38.7 
39.9 
37.4 
39.3 


X4 
64.7 
61.3 
60.0 
45.3 
53.8 
74.1 
65.4 
78.3 
47.6 
54.2 
48.0 
519 
53.6 
71.4 
58.3 
52.6 
29:3 
55.3 
69.6 
78.3 
38.1 
68.8 
68.8 
74.1 
50.0 
37.1 
56.3 
47.0 


x5: Turnover differential (turnovers acquired—turnovers lost) 


= 


ę: Penalty yards (season) 


x: Percent rushing (rushing plays/total plays) 


Table 12-5 National Football League 1976 Team Performance 


X6 
868 
615 
914 
957 
836 
786 
754 
797 
714 
797 
984 
819 
1037 
986 
819 
791 
716 
789 
582 
901 
734 
627 
122 
683 
576 
848 
684 
875 


X7 
59.7 
55.0 
65.6 
61.4 
66.1 
61.0 
66.1 
58.0 
57.0 
58.0 
68.5 
239.2 
58.8 
58.6 
292 
54.4 
49.6 
54.3 
58.7 
51.7 
61.9 
92.1 
57.8 
59.7 
54.0 
65.3 
43.8 
53.5 


Xg 
2205 
2096 
1847 
1903 
1457 
1848 
1564 
2476 
2571 
2476 
1984 
1901 
1761 
1709 
1901 
2288 
2072 
2861 
2411 
2289 
2203 
2592 
2053 
1979 
2048 
1786 
2876 
2560 


X9 
1917 
1575 
2115 
2476 
1866 
2539 
2092 
2254 
2001 
2254 
2217 
1686 
2032 
2025 
1686 
1835 
1914 
2496 
2670 
2202 
1988 
2324 
2550 
2110 
2628 
1776 
2524 
2241 


Table 12-6 Gasoline Mileage Performance for 25 Automobiles 


Automobile y Xj 
Apollo 18.90 350 
Nova 20.00 250 
Monarch 18.25 351 
Duster 20.07 225 
Jenson Conv. 11.2 440 
Skyhawk 22.12 231 
Scirocco 34.70 897 
Corolla SR-5 30.40 96.9 
Camaro 16.50 350 
Datsun B210 36.50 853 
Capri II 21:350 -171 
Pacer 19.70 258 
Granada 17.80 302 
Eldorado 14.39 500 
Imperial 14.89 440 
Nova LN 17.80 350 
Starfire 23.54 231 
Cordoba 21.47 360 


Trans Am 16.59 400 
Corolla E-5 31.90 96.9 
Mark IV 13.27 460 
Celica GT 23.900 133.6 
Charger SE 19.73 318 


Cougar 13.90 351 
Corvette 16.50 350 


y: Miles/gallon 

x1: Displacement (cubic inches) 
: Horsepower (foot-pounds) 
X3: Torque (foot-pounds) 

x4: Compression ratio 

: Rear axle ratio 

: Carburetor (barrels) 

: No. of transmission speeds 
: Overall length (inches) 

: Width (inches) 

X19: Weight (pounds) 


= 
N 
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X3 
260 
185 
255 
170 
330 
L75 

81 
83 
250 
83 
146 


195 
220 
360 
330 
250 
175 
290 
NA 

83 
366 
120 
255 


243 
253 


X4 
8.0:1 
8.25:1 
8.0:1 
8.4:1 
8.2:1 
8.0:1 
8.2:1 
9.0:1 
8.5:1 
8.5:1 
8.2:1 


8.0:1 
8.0:1 
8.5:1 
8.2:1 
8.5:1 
8.0:1 
8.4:1 
7.6:1 
9.0:1 
8.0:1 
8.4:1 
8.5:1 


8.0:1 
8.5:1 


x1: Type of transmission (A—automatic, M—manual) 


Xs 
2.56:1 
2.19351 
3.00:1 
2.76:1 
2.88:1 
2.56:1 
3.90:1 
4.30:1 
3.08:1 
3.89:1 
3.22:1 


3.08:1 

3.0:1 
2.73: 
2. Hl 
3.08:1 
2.56:1 
2.45:1 
3.08:1 
4.30:1 
3.00:1 
3,911 
2.71:1 


321 
2.1321 
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Xg 
200.3 
196.7 
199.9 
194.1 
184.5 
179.3 
155.7 
165.2 
195.4 
160.6 
170.4 


171.5 
199.9 
224.1 
231.0 
196.7 
179.3 
214.2 

196 
165.2 

228 
171.5 
2153 


215.5 
185.2 


X9 
69.9 
12.2 
74.0 
71.8 
69 
65.4 
64 
65 
74.4 
62.2 
66.9 


77 

74 

79.8 
19.7 
12.2 
65.4 
76.3 
43 

61.8 
79.8 
63.4 
46.3 


78.5 
69 


X10 
3910 
3510 
3890 
3365 
4215 
3020 
1905 
2320 
3885 
2009 
2655 


3375 
3890 
5290 
5185 
3910 
3050 
4250 
3850 
2273 
5430 
2535 
4370 


4540 
3660 
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(c) Compute the standard errors of the regression coefficients. 
(d) Predict power consumption for a month in which 
x, = 75°F, x, = 24 days, x; = 90%, and x, = 98 tons. 
12-7. A study was performed on wear of a bearing y and its 
relationship to x, = oil viscosity and x, = load. The follow- 

ing data were obtained. 


2i X X2 
293 1.6 851 
230 15.5 816 
172 22.0 1058 
91 43.0 1201 
113 33.0 1357 
125 40.0 1115 


(a) Fit a multiple linear regression model to these data. 

(b) Estimate o* and the standard errors of the regression 
coefficients. 

(c) Use the model to predict wear when x, = 25 and x; = 1000. 

(d) Fit a multiple linear regression model with an interaction 
term to these data. 

(e) Estimate c? and se(B;) for this new model. How did these 
quantities change. Does this tell you anything about the 
value of adding the interaction term to the model? 

(f) Use the model in (d) to predict when x, = 25 and x, = 
1000. Compare this prediction with the predicted value 
from part (b) above. 


12-8. The pull strength of a wire bond is an important charac- 
teristic. The following table gives information on pull strength 


- (y), die height (x), post height (x,), loop height (x5), wire length 
= (x4), bond width on the die (x5), and bond width on the post (x¢). 


2 X1 X2 X3 X4 Xs X6 
8.0 5.2 19.6 29.6 94.9 2.1 2.3 
8.3 2.2 19.8 32.4 89.7 21 1.8 
8.5 5.8 19.6 31.0 96.2 2.0 2.0 
8.8 6.4 19.4 32.4 95.6 2.2 2.1 
9.0 5.8 18.6 28.6 86.5 2.0 1.8 
9.3 3.2 18.8 30.6 84.5 2.1 Zl 
9.3 5.6 20.4 32.4 88.8 2.2 1.9 
9.5 6.0 19.0 32.6 85.7 2.1 1.9 
9.8 3:2 20.8 322 93.6 23 2.1 
10.0 5.8 19.9 31.8 86.0 2.1 1.8 
10.3 6.4 18.0 32:6 87.1 2.0 1.6 
10.5 6.0 20.6 33.4 93.1 ZA 2.1 
10.8 6.2 20.2 31.8 83.4 22 2.1 
11.0 6.2 20.2 32.4 94.5 2.4 1.9 
11.3 6.2 192 31.4 83.4 1.9 1.8 
ILo 5.6 17.0 35.2 85.2 2.4 zl 
11.8 6.0 19.8 35.4 84.1 2.0 1.8 
12.3 5.8 18.8 34.0 86.9 2.1 1.8 
12.9 5.6 18.6 34.2 83.0 1.9 2.0 


(a) Fit a multiple linear regression model using x», x3, x4, and 
xs as the regressors. 

(b) Estimate o?. 

(c) Find the se(B)). How precisely are the regression coeffi- 
cients estimated, in your opinion? 

(d) Use the model from part (a) to predict pull strength when 
x 20, x4 = 30, x, = 90, and x; = 2.0. 

12-9. An engineer at a semiconductor company wants to 

model the relationship between the device HFE ( y) and three 

parameters: Emitter-RS (x,), Base-RS (x2), and Emitter-to-Base 

RS (x3). The data are shown in the following table. 


X X2 X3 y 
Emitter-RS Base-RS E-B-RS HFE-1M-5V 
14.620 226.00 7.000 128.40 
15.630 220.00 3.375 52.62 
14.620 217.40 6.375 113.90 
15.000 220.00 6.000 98.01 
14.500 226.50 7.625 139.90 
15.250 224.10 6.000 102.60 
16.120 220.50 3.375 48.14 
15.130 223.50 6.125 109.60 
15.500 217.60 5.000 82.68 
15.130 228.50 6.625 112.60 
15.500 230.20 5.750 97.32 
16.120 226.50 3.750 59.06 
15.130 226.60 6.125 111.80 
15.630 225.60 5.375 89.09 
15.380 229.70 5.875 101.00 
14.380 234.00 8.875 171.90 
15.500 230.00 4.000 66.80 
14.250 224.30 8.000 157.10 
14.500 240.50 10.870 208.40 
14.620 223.10 1,315 133.40 


(a) Fit a multiple linear regression model to the data. 

(b) Estimate o?. 

(c) Find the standard errors se(B,). 

(d) Predict HFE when x, = 14.5, x; = 220, and x, = 5.0. 


12-10. Heat treating 1s often used to carburize metal parts, 
such as gears. The thickness of the carburized layer is consid- 
ered a crucial feature of the gear and contributes to the overall 
reliability of the part. Because of the critical nature of this fea- 
ture, two different lab tests are performed on each furnace 
load. One test is run on a sample pin that accompanies each 
load. The other test is a destructive test, where an actual part is 
cross-sectioned. This test involves running a carbon analysis 
on the surface of both the gear pitch (top of the gear tooth) and 
the gear root (between the gear teeth). Table 12-7 shows the 
results of the pitch carbon analysis test for 32 parts. 

The regressors are furnace temperature (TEMP), carbon 
concentration and duration of the carburizing cycle 


Table 12-7 
TEMP SOAKTIME 
1650 0.58 
1650 0.66 
1650 0.66 
1650 0.66 
1600 0.66 
1600 0.66 
1650 1.00 
1650 1.17 
1650 1.17 
1650 1.17 
1650 1.17 
1650 1.17 
1650 1.17 
1650 1.20 
1650 2.00 
1650 2.00 
1650 2.20 
1650 2.20 
1650 2.20 
1650 2.20 
1650 2.20 
1650 2.20 
1650 3.00 
1650 3.00 
1650 3.00 
1650 3.00 
1650 3.33 
1700 4.00 
1650 4.00 
1650 4.00 
1700 12.50 
1700 18.50 


(SOAKPCT, SOAKTIME), and carbon concentration and 

duration of the diffuse cycle (DIFFPCT, DIFFTIME). 

(a) Fit a linear regression model relating the results of the pitch 
carbon analysis test (PITCH) to the five regressor variables. 

(b) Estimate o°. 

(c) Find the standard errors se(,). 

(d) Use the model in part (a) to predict PITCH when 
TEMP = 1650, SOAKTIME = 1.00, SOAKPCT = 1.10, 
DIFFTIME = 1.00, and DIFFPCT = 0.80. 


12-11. Statistics for 21 National Hockey League teams 


were obtained from the Hockey Encyclopedia and are shown 
in Table 12-8. 
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SOAKPCT 
1.10 
1.10 
1.10 
1.10 
1.15 
1.15 
1.10 
1.10 
1.10 
1.10 
1.10 
1.10 
1.15 
1.15 
1.15 
1.10 
1.10 
1.10 
1.15 
1.10 
1.10 
1.10 
1.15 
1.10 
1.10 
1.15 
1.10 
1.10 
1.10 
1.15 
1.00 
1.00 


DIFFTIME DIFFPCT PITCH 
0.25 0.90 0.013 
0.33 0.90 0.016 
0.33 0.90 0.015 
0.33 0.95 0.016 
0.33 1.00 0.015 
0.33 1.00 0.016 
0.50 0.80 0.014 
0.58 0.80 0.021 
0.58 0.80 0.018 
0.58 0.80 0.019 
0.58 0.90 0.021 
0.58 0.90 0.019 
0.58 0.90 0.021 
1.10 0.80 0.025 
1.00 0.80 0.025 
1.10 0.80 0.026 
1.10 0.80 0.024 
1.10 0.80 0.025 
1.10 0.80 0.024 
1.10 0.90 0.025 
1.10 0.90 0.027 
1.50 0.90 0.026 
1.50 0.80 0.029 
1.50 0.70 0.030 
1.50 0.75 0.028 
1.66 0.85 0.032 
1.50 0.80 0.033 
1.50 0.70 0.039 
1.50 0.70 0.040 
1.50 0.85 0.035 
1.50 0.70 0.056 
1.50 0.70 0.068 


The variables and definitions are as follows: 


Wins 
Pts 


GF 


GA 


PPG 


PPcT 


Number of games won in a season. 

Points awarded in a season. Two points for win- 
ning a game, one point for losing in overtime, 
zero points for losing in regular time. 

Goals for. Total goals scored during the season. 
Goals against. Goals scored against the team dur- 
ing the season. 

Power play goals. Points scored while on power 
play. 

Power play percentage. The number of power 
play goals divided by the number of power play 
opportunities. 
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PRCT SHG PPGA PKPcT SHGA 
27.2 4 71 76.6 6 
26.4 17 67 80.7 20 
22.9 13 83 75 9 
21:2 9 63 81.3 12 
19.3 7 80 72.6 9 
29.9 22 89 71.5 6 
27 7 39 77.1 6 
23.8 5 56 80.8 13 
23.6 10 67 72.8 7 
23.8 10 94 68.2 14 
24.6 15 61 82 7 
25.8 10 55 83.4 3 
20.9 3 53 81.6 11 
22.4 12 75 76 8 
21.9 74 78 13:9 10 
22.6 3 110 12.2 15 
29. 8 29 80.7 6 
222 8 68 73.8 8 
2125 12 48 82.5 9 
20.7 6 92 T30 6 
19.3 6 70 76.1 9 


Table 12-8 
Team Wins Pts GF GA PPG 
Chicago 47 104 338 268 86 
Minnesota 40 96 321 290 91 
Toronto 28 68 23 330 79 
St. Louis 25 65 285 316 67 
Detroit 21 57 263 344 37 
Edmonton 47 106 424 315 86 
Calgary 22 78 321 217 90 
Vancouver 30 75 303 309 65 
Winnipeg 33 74 311 2333 78 
Los Angeles 2d 66 308 365 81 
Philadelphia 49 106 326 240 60 
NY Islanders 42 96 302 226 69 
Washington 39 94 306 283 75 
NY Rangers 35 80 306 287 71 
New Jersey 17 48 230 338 66 
Pittsburgh 18 45 257 394 81 
Boston 50 110 327 228 67 
Montreal 42 98 350 286 64 
Buffalo 38 89 318 283 67 
Quebec 34 80 343 336 61 
Hartford 19 45 261 403 51 

SHG Short-handed goals scored during the season. 

PPGA Power play goals against. 

PKPcT Penalty killing percentage. Measures a team’s 
ability to prevent goals while its opponent is on a 
power play. Opponent power play goals divided 
by opponent’s opportunities. 

SHGA Short-handed goals against. Fit a multiple linear 


regression model relating wins to the other vari- 
ables. Estimate o? and find the standard errors of 
the regression coefficients. 


12-12. Consider the linear regression model 


Y; = Bo + BiG — xi) + Bora — x) + €i 

where x, = > x,,/n and x, = Xxp/n. 

(a) Write out the least squares normal equations for this model. 

(b) Verify that the least squares estimate of the intercept in 
this model is Bh = Xyj/n = y. 

(c) Suppose that we use y; — y as the response variable in the 
model above. What effect will this have on the least 
squares estimate of the intercept? 


12-2 HYPOTHESIS TESTS IN MULTIPLE LINEAR REGRESSION 


In multiple linear regression problems, certain tests of hypotheses about the model parameters 
are useful in measuring model adequacy. In this section, we describe several important 
hypothesis-testing procedures. As in the simple linear regression case, hypothesis testing 
requires that the error terms e; in the regression model are normally and independently dis- 
tributed with mean zero and variance o°. 


12-2.1 Test for Significance of Regression 


The test for significance of regression is a test to determine whether a linear relationship exists 
between the response variable y and a subset of the regressor variables x,, x5, ... , x. The 
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appropriate hypotheses are 


HPE EDI 


Hip; #0 for at least one j (12-17) 





Rejection of Hp: Dj = B2 = + = B, = 0 implies that at least one of the regressor variables 
X, X2, ... , X, contributes significantly to the model. 

The test for significance of regression is a generalization of the procedure used in simple 
linear regression. The total sum of squares SS; is partitioned into a sum of squares due to re- 
gression and a sum of squares due to error, say, 


SS; = SS, F SS; 


Now if Hy: B1 = Bo = °°: = B, = 0 is true, SSp/o7 is a chi-square random variable with k 
degrees of freedom. Note that the number of degrees of freedom for this chi-square random 
variable is equal to the number of regressor variables in the model. We can also show the 
SS;,/07 is a chi-square random variable with n — p degrees of freedom, and that SS, and SS; 
are independent. The test statistic for Hp: B} = Bo = = = B = 01s 


SSp/k Msgr rm 


° SSg/(n— p)  MSg 





We should reject Hp if the computed value of the test statistic in Equation 12-18, fọ, 1s greater than 
Jakn-p The procedure is usually summarized in an analysis of variance table such as Table 12-9. 
We can find a computing formula for SS, as follows: 


n n 
SSE = > a bY = 2.6 — e'e 
i-l E 


Substituting e = y — $ = y — Xf into the above, we obtain 
SS; = yy — B'X'y (12-19) 


Table 12-9 Analysis of Variance for Testing Significance of Regression in Multiple Regression 


Source of Degrees of 

Variation Sum of Squares Freedom Mean Square Fo 
Regression SS, k MS, MS,/ MS, 
Error or residual SS, Hp MS; 


Total SS n=] 


430 CHAPTER 12 MULTIPLE LINEAR REGRESSION 


EXAMPLE 12-3 


A computational formula for SS, may be found easily. Now since $$; = Xy; — 
(E iy)! /n = y'y — (Ei yi) /n, we may rewrite Equation 12-19 as 


e) [LU 


SSg—YY-- > 


Or 
SS; = S; mm SSR 


Therefore, the regression sum of squares is 


(12-20) 


We will test for significance of regression (with a = 0.05) using the wire bond pull strength 
data from Example 12-1. The total sum of squares is 


n 
(725.82) 


— 27,177.9510 — je T 6105.9447 


SSp = yy- 


The regression sum of squares is computed from Equation 12-20 as follows: 


n 


(725.82) 


= 27,062.7775 — 955. 5990.7712 


SS, = B'X'y — 


and by subtraction 


SSE = SST — SSR 
= y'y — Î'X'y = 115.1735 


The analysis of variance is shown in Table 12-10. To test Hp: Q4 = B2 = 0, we calculate the 
statistic 


T- MSr 2995.3856 
0 MS; 5.2352 
Since fo > fo.05,.2.22 = 3.44 (or since the P-value is considerably smaller than a = 0.05), 


we reject the null hypothesis and conclude that pull strength is linearly related to either wire 
length or die height, or both. However, we note that this does not necessarily imply that the 


= 572.17 
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Table 12-10 Test for Significance of Regression for Example 12-3 


Source of Degrees of 

Variation Sum of Squares Freedom Mean Square fo P-value 
Regression 5990.7712 2 2995.3856 572.17 1.08E-19 
Error or residual 115.1735 22 3.2352 
Total 6105.9447 24 


relationship found is an appropriate model for predicting pull strength as a function of wire 
length and die height. Further tests of model adequacy are required before we can be 
comfortable using this model in practice. 


Most multiple regression computer programs provide the test for significance of regression 
in their output display. The middle portion of Table 12-4 1s the Minitab output for this example. 
Compare Tables 12-4 and 12-10 and note their equivalence apart from rounding. The P-value is 
rounded to zero in the computer output. 


R? and Adjusted R? 
We may also use the coefficient of multiple determination R? as a global statistic to assess 
the fit of the model. Computationally, 


Sr y S% 
E SS; 


: (12:21) 


For the wire bond pull strength data, we find that R? = SS,/SS; = 5990.7712/6105.9447 = 
0.9811. Thus the model accounts for about 98% of the variability in the pull strength response 
(refer to the Minitab output in Table 12-4). The R? statistic is somewhat problematic as a 
measure of the quality of the fit for a multiple regression model because it always increases 
when a variable is added to a model. 

To illustrate, consider the model fit to wire bond pull strength data in Example 11-8. This 
was a simple linear regression model with x, = wire length as the regressor. The value of R? 
for this model is R? = 0.9640. Therefore, adding x, — die height to the model increases R^ by 
0.9811 — 0.9640 — 0.0171, a very small amount. Since R? always increases when a regressor 
is added, it can be difficult to judge whether the increase is telling us anything useful about the 
new regressor. It is particularly hard to interpret a small increase, such as observed in the pull 
strength data. 

Many regression users prefer to use an adjusted R? statistic: 


(12-22) 





Because SS;/(n — p) is the error or residual mean square and SS7/(n — 1) isa constant, R4 will 


only increase when a variable is added to the model if the new variable reduces the error mean 
square. Note that for the multiple regression model for the pull strength data Rs — 0.979 (see the 
Minitab output in Table 12-4), whereas in Example 11-8 the adjusted R? for the one-variable 
model is Rei = 0.962. Therefore, we would conclude that adding x, = die height to the model 
does result in a meaningful reduction in unexplained variability in the response. 
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The adjusted R? statistic essentially penalizes the analyst for adding terms to the model. It is 
an easy way to guard against overfitting, that 1s, including regressors that are not really useful. 
Consequently, it is very useful in comparing and evaluating competing regression models. We 


will use Rs for this when we discuss variable selection in regression in Section 12-6.3. 


12-2.2 Tests on Individual Regression Coefficients 
and Subsets of Coefficients 


EXAMPLE 12-4 


We are frequently interested in testing hypotheses on the individual regression coefficients. Such 
tests would be useful in determining the potential value of each of the regressor variables in the re- 
gression model. For example, the model might be more effective with the inclusion of additional 
variables or perhaps with the deletion of one or more of the regressors presently in the model. 

Adding a variable to a regression model always causes the sum of squares for regression 
to increase and the error sum of squares to decrease (this is why R? always increases when a 
variable is added). We must decide whether the increase in the regression sum of squares is 
large enough to justify using the additional variable in the model. Furthermore, adding an 
unimportant variable to the model can actually increase the error mean square, indicating that 
adding such a variable has actually made the model a poorer fit to the data (this is why Reg is 
a better measure of global model fit then the ordinary R^). 

The hypotheses for testing the significance of any individual regression coefficient, say 


B; are 





If Ho: B; = 0 is not rejected, this indicates that the regressor x; can be deleted from the model. 
The test statistic for this hypothesis is 


(12-24) 





where C; is the diagonal element of (X'X) ! corresponding to Ê.. Notice that the denominator 


of Equation 12-24 is the standard error of the regression coefficient B ;. The null hypothesis Ho: 
B; = 0 is rejected if |f| > ta/2n-p- This is called a partial or marginal test because the 


regression coefficient B; depends on all the other regressor variables x;(i # j) that are in the 
model. More will be said about this in the following example. 


Consider the wire bond pull strength data, and suppose that we want to test the hypothesis that 
the regression coefficient for x, (die height) is zero. The hypotheses are 


Ho: B2 = 0 
Hi: D» X0 
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The main diagonal element of the (XX) ^ matrix corresponding to B- is C54 = 0.0000015, 
so the t-statistic in Equation 12-24 is 


B 0.01253 


fy = Z aS 
? V6^C,  V/(5.2352)(0.0000015) 


— 4.4767 


Note that we have used the estimate of o? reported to four decimal places in Table 12-10. Since 
1502525 = 2.074, we reject Hy: B; = 0 and conclude that the variable x, (die height) con- 
tributes significantly to the model. We could also have used a P-value to draw conclusions. 
The P-value for tọ = 4.4767 is P = 0.0002, so with a = 0.05 we would reject the null hy- 
pothesis. Note that this test measures the marginal or partial contribution of x, given that x, is 
in the model. That is, the t-test measures the contribution of adding the variable x, = die 
height to a model that already contains x, — wire length. Table 12-4 shows the value of the 
t-test computed by Minitab. The Minitab t-test statistic is reported to two decimal places. Note 
that the computer produces a f-test for each regression coefficient in the model. These f-tests 
indicate that both regressors contribute to the model. 


There is another way to test the contribution of an individual regressor variable to the 
model. This approach determines the increase in the regression sum of squares obtained by 
adding a variable x; (say) to the model, given that other variables x;(i 7^ j) are already included 
in the regression equation. 

The procedure used to do this is called the general regression significance test, or the 
extra sum of squares method. This procedure can also be used to investigate the contribution 
of a subset of the regressor variables to the model. Consider the regression model with k 
regressor variables 


y=XPt+e (12-25) 


where y is (n X 1), X is (n X p), Bis(p X 1), eis(n X 1), and p = k + 1. We would like to 
determine if the subset of regressor variables xj, x5, . . . , x, (r < k) as a whole contributes sig- 
nificantly to the regression model. Let the vector of regression coefficients be partitioned as 
follows: 


= ^| 12-26 
P i od) 


where p, is (r X 1) and B, is [(p — r) X 1]. We wish to test the hypotheses 


(12-27) 





where 0 denotes a vector of zeroes. The model may be written as 
y= Xp +E= Xii EE Xp. te (12-28) 


where X, represents the columns of X associated with D, and X, represents the columns of X 
associated with .. 
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For the full model (including both B, and B,), we know that B = (X'X) ! X'y. In 
addition, the regression sum of squares for all variables including the intercept is 


SS,(B) = B'X'y (p =k + 1 degrees of freedom) 


and 


SSp(B) is called the regression sum of squares due to B. To find the contribution of the terms 
in B, to the regression, fit the model assuming the null hypothesis Hp: D, = 0 to be true. The 
reduced model is found from Equation 12-28 as 


y-Xp-e (12-29) 
The least squares estimate of B» is B, = (XX) !X.y, and 
SSp(B2) = B2Xoy — (p — r degrees of freedom) (12-30) 
The regression sum of squares due to B, given that B, is already in the model is 


SSr(Bi|B2) i SSr(B) v SSr(B2) (12-31) 


This sum of squares has r degrees of freedom. It is sometimes called the extra sum of squares 
due to B,. Note that SSp(B;|B2) is the increase in the regression sum of squares due to 
including the variables x,, x», ..., x, in the model. Now SSp(B;|B 2) is independent of MS;, and 
the null hypothesis B, = 0 may be tested by the statistic 


mc SSr(Bi|B2)/r 


12-32 
arse (12-32) 





If the computed value of the test statistic fo > fa,rn-p we reject Ho, concluding that at least one 
of the parameters in D, is not zero and, consequently, at least one of the variables x,, x5,..., x, 
in X, contributes significantly to the regression model. Some authors call the test in Equation 
12-32 a partial F-test. 

The partial F-test is very useful. We can use it to measure the contribution of each indi- 
vidual regressor x; as if it were the last variable added to the model by computing 


SSr(B;| Bos Dis sss car Ditis Bj), J = | RE: 


This is the increase in the regression sum of squares due to adding x; to a model that already 
includes xj, ... , X;-1, Xj+15 - - - » X,. The partial F-test is a more general procedure in that we 
can measure the effect of sets of variables. In Section 12-6.3 we show how the partial F-test 
plays a major role in model building—that 1s, in searching for the best set of regressor vari- 


ables to use in the model. 


EXAMPLE 12-5 
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Consider the wire bond pull strength data in Example 12-1. We will investigate the contribution of 
the variable x, (die height) to the model using the partial F-test approach. That is, we wish to test 


Ho: D» = 0 
H;: D» # (0) 


To test this hypothesis, we need the extra sum of squares due to D», or 


SSr(B2| 81.80) = SSa(Bi.B».Bo) — SSr(B1,Bo0) 
= SSp(B1.B2|Bo) — SSa(Bi| Bo) 


In Example 12-3 we have calculated 


(3) 


SSp(B1,B2|Bo) = B'X'y — T — 5990.7712 (two degrees of freedom) 





and from Example 11-8, where we fit the model Y = By + Bix; + €, we can calculate 


SSr(B1| Bo) = Bi Sry = (2.9027)(2027.7132) 
= 5885.8521 (one degree of freedom) 


Therefore, 


SSp(B>| 81,80) = 5990.7712 — 5885.8521 
104.9191 (one degree of freedom) 


This is the increase in the regression sum of squares due to adding x, to a model already con- 
taining x,. To test Hy: 9; = 0, calculate the test statistic 


SS ,Bo)/1 104.9191/1 
p n(B2|B1.Bo) _ / 2:007 
MS; 5.2352 


Note that the MS; from the full model, using both x, and x, is used in the denominator of the 
test statistic. Since fo.95 1.22 = 4.30, we reject Hy: B2 = 0 and conclude that the regressor die 
height (x2) contributes significantly to the model. 

Table 12-4 shows the Minitab regression output for the wire bond pull strength data. Just 
below the analysis of variance summary in this table the quantity labeled “Seq SS" shows the 
sum of squares obtained by fitting x, alone (5885.9) and the sum of squares obtained by fitting 
x» after xı. Notationally, these are referred to above as SSp(B1| Bo) and SSs(» | B1, 8o). 


Since the partial F-test in the above example involves a single variable, it is equivalent to 
the t-test. To see this, recall from Example 12-5 that the t-test on Hp: 9; = 0 resulted in the test 
statistic f = 4.4767. Furthermore, the square of a t-random variable with v degrees of free- 
dom is an F-random variable with one and v degrees of freedom, and we note that tj = 
(4.4767) = 20.04 = f. 


12-2. More About the Extra Sum of Squares Method (CD Only) 
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EXERCISES FOR SECTION 12-2 


12-13. Consider the regression model fit to the soil shear 

strength data in Exercise 12-1. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Construct the t-test on each regression coefficient. What 
are your conclusions, using a = 0.05? 


12-14. Consider the absorption index data in Exercise 12-2. 

The total sum of squares for y is SS; = 742.00. 

(a) Test for significance of regression using a = 0.01. What 
is the P-value for this test? 

(b) Test the hypothesis Hp: Bı = 0 versus Hj: B, # 0 using 
a = 0.01. What is the P-value for this test? 

What conclusion can you draw about the usefulness of x, as a 

regressor in this model? 


12-15. Consider the NFL data in Exercise 12-4. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Conduct the t-test for each regression coefficient B,, B,, 
and Bs. Using a = 0.05, what conclusions can you draw 
about the variables in this model? 


12-16.  Reconsider the NFL data in Exercise 12-4. 

(a) Find the amount by which the regressor xg (opponents' 
yards rushing) increases the regression sum of squares. 

(b) Use the results from part (a) above and Exercise 12-14 to 
conduct an F-test for Hy: Bg = 0 versus H,: Dg ~ 0 using 
a = 0.05. What is the P-value for this test? What conclu- 
sions can you draw? 


12-17. Consider the gasoline mileage data in Exercise 12-5. 

(a) Test for significance of regression using a = 0.05. What 
conclusions can you draw? 

(b) Find the :-test statistic for both regressors. Using a = 
0.05, what conclusions can you draw? Do both regressors 
contribute to the model? 

12-18. A regression model Y = By + Bix, + Box. + B3x3 + 

€ has been fit to a sample of n = 25 observations. The calcu- 

lated f-ratios B, /se(B;), j71,2,3 are as follows: for Qi, 

fy = 4.82, for B», tj = 8.21 and for B3, to = 0.98. 

(a) Find P-values for each of the f-statistics. 

(b) Using a = 0.05, what conclusions can you draw about 
the regressor x4? Does it seem likely that this regressor 
contributes significantly to the model? 


12-19. Consider the electric power consumption data in 

Exercise 12-6. 

(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? 

(b) Use the t-test to assess the contribution of each regressor 
to the model. Using a = 0.05, what conclusions can you 
draw? 


12-20. Consider the bearing wear data in Exercise 12-7 
with no interaction. 


(a) Test for significance of regression using a = 0.05. What 
is the P-value for this test? What are your conclusions? 

(b) Compute the :-statistics for each regression coefficient. 
Using a = 0.05, what conclusions can you draw? 

(c) Use the extra sum of squares method to investigate the 
usefulness of adding x, = load to a model that already 
contains x, = oil viscosity. Use a = 0.05. 


12-21. Reconsider the bearing wear data from Exercises 

12-7 and 12-20. 

(a) Refit the model with an interaction term. Test for signifi- 
cance of regression using a = 0.05. 

(b) Use the extra sum of squares method to determine 
whether the interaction term contributes significantly to 
the model. Use a = 0.05. 

(c) Estimate o? for the interaction model. Compare this to the 
estimate of o? from the model in Exercise 12-20. 


12-22. Consider the wire bond pull strength data in 

Exercise 12-8. 

(a) Test for significance of regression using a = 0.05. Find 
the P-value for this test. What conclusions can you draw? 

(b) Calculate the t-test statistic for each regression coeffi- 
cient. Using a = 0.05, what conclusions can you draw? 
Do all variables contribute to the model? 


12-23. 

12-9. 

(a) Test for significance of regression using a = 0.05. What 
conclusions can you draw? 

(b) Calcuate the t-test statistic for each regression coefficient. 
Using a = 0.05, what conclusions can you draw? 


Reconsider the semiconductor data in Exercise 


12-24. Exercise 12-10 presents data on heat treating gears. 

(a) Test the regression model for significance of regression. 
Using a = 0.05, find the P-value for the test and draw 
conclusions. 

(b) Evaluate the contribution of each regressor to the model 
using the t-test with a = 0.05. 

(c) Fit a new model to the response PITCH using new 
regressors x, = SOAKTIME X SOAKPCT and x, = 
DIFFTIME X DIFFPCT. 

(d) Test the model in part (c) for significance of regression 
using a = 0.05. Also calculate the t-test for each regres- 
sor and draw conclusions. 

(e) Estimate o? for the model from part (c) and compare this 
to the estimate of o? for the model in part (a). Which 
estimate is smaller? Does this offer any insight regarding 
which model might be preferable? 


12-25. Data on National Hockey League team performance 

was presented in Exercise 12-11. 

(a) Test the model from this exercise for significance of 
regression using a = 0.05. What conclusions can you 
draw? 


12-3 CONFIDENCE INTERVALS IN MULTIPLE LINEAR REGRESSION 437 


(b) Use the t-test to evaluate the contribution of each play goals. Does this seem to be a logical choice of 
regressor to the model. Does it seem that all regressors are regressors, considering your answer to part (b)? Test this 
necessary? Use a = 0.05. new model for significance of regression and evaluate the 

(c) Fit a regression model relating the number of games won contribution of each regressor to the model using the 
to the number of points scored and the number of power t-test. Use a = 0.05. 


12-3 CONFIDENCE INTERVALS IN MULTIPLE LINEAR REGRESSION 


12-3.1 Confidence Intervals on Individual Regression Coefficients 


Definition 


EXAMPLE 12-6 


In multiple regression models, it is often useful to construct confidence interval estimates for 
the regression coefficients | B;). The development of a procedure for obtaining these confi- 
dence intervals requires that the errors [e;) are normally and independently distributed with 
mean zero and variance o^. This is the same assumption required in hypothesis testing. 
Therefore, the observations ( Y,;} are normally and independently distributed with mean Bo + 
3 B;x; and variance g’. Since the least squares estimator B is a linear combination of the 
observations, it follows that B is normally distributed with mean vector B and covariance 
matrix o°(X'X) '. Then each of the statistics 


T2———3 j;=61,...,k (12-33) 
O 


has a ¢ distribution with n — p degrees of freedom, where C; is the jjth element of the (X’X) ~ 
matrix, and G* is the estimate of the error variance, obtained from Equation 12-16. This 
leads to the following 100(1 — a)% confidence interval for the regression coefficient 


Bj =0,1,...,% 


A 100(1 — a) % confidence interval on the regression coefficient B, j = 0, 1,..., 
k in the multiple linear regression model is given by 


©, = tuse VIS Cy EE (Oy = IE dE unes WO Cg (12-34) 





Because V oC; is the standard error of the regression coefficient B, we would also write the 
CI formula as B; — taz n-p se(B;j) = B; = B; + tuo s, se(B;). 


We will construct a 95% confidence interval on the parameter B; in the wire bond pull strength 
problem. The point estimate of B, is B, = 2.74427 and the diagonal element of (X'X) ! 
corresponding to B, is Cj, = 0.001671. The estimate of o? is 6^ = 5.2352, and 1502522 = 2.074. 
Therefore, the 9596 CI on B, is computed from Equation 12-34 as 


2.74427 — (2.074) N/(5.2352)(.001671) = B, = 2.74427 + (2.074) V/(5.2352)(.001671) 


which reduces to 


2.55029 < p, =< 2.93825 
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12-3.2 Confidence Interval on the Mean Response 


Definition 


EXAMPLE 12-7 


We may also obtain a confidence interval on the mean response at a particular point, say, 


Xo» X92 +++» Xoz- LO estimate the mean response at this point, define the vector 


Xk. 
The mean response at this point is E(Y|xo) = yx, = xo» which is estimated by 
Py|x, = XoB (12-35) 


This estimator is unbiased, since E(x)B) = xB = E(Y |x) = Jy|x, and the variance of 


V(fiyjx,) = o^x(X'X)- xo (12-36) 
A 100(1 — a) % CI on py),, can be constructed from the statistic 


Wy) xo z HYJ xo 


V G^x9(X'X) | Xo 


(12-37) 


For the multiple linear regression model, a 100(1 — o)96 confidence interval on the 
mean response at the point xoj, Xo2, . . . , Xog 1S 


Arixo — ba/2n-p V Gx) (X'X) 'x, 
5 HY |x, = Arixo T fa/2,n-p : Gx (X'X) ! xo (12-38) 





Equation 12-38 is a CI about the regression plane (or hyperplane). It is the multiple regression 
generalization of Equation 11-31. 


The engineer in Example 12-1 would like to construct a 95% CI on the mean pull strength for 
a wire bond with wire length x, = 8 and die height x, = 275. Therefore, 
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The estimated mean response at this point is found from Equation 12-35 as 


2.26379 
fiyx, = xoB — [1 8 275]|2.74427 | = 27.66 
0.01253 
The variance of (Ly, is estimated by 
.414653 —.007491 —.000340 l 


G^xy((X'X) !xy = 5.2352 [1 8275] X | —.007491 .001671 —.000019 8 
—.000340 —.000019 .0000015]|| 275 


= 5.2352 (0.04444) = 0.23266 
Therefore, a 95% CI on the mean pull strength at this point is found from Equation 12-38 as 
27.66 — 2.074 V 0.23266 = yy, = 27.66 + 2.074 V0.23266 


which reduces to 
26.66 < Lyk, = 28.66 


Some computer software packages will provide estimates of the mean for a point of interest x 
and the associated CI. Table 12-4 shows the Minitab output for Example 12-7. Both the esti- 
mate of the mean and the 95% CI are provided. 


12-4 PREDICTION OF NEW OBSERVATIONS 


A regression model can be used to predict new or future observations on the response 
variable Y corresponding to particular values of the independent variables, say, xo), xo», --- , Xoz- 
If xp = [1,xo1, Xo», --- , Xo], a point estimate of the future observation Y, at the point xq, 
X025 -<-s XOK 1S 


Po = xo (12-39) 


A 100(1 — a)% prediction interval for this future observation is 


My = SN 6^(1 + xo (X'X) ! xj) 


=== or tape Gn eae xg ex) (12-40) 





This prediction interval is a generalization of the prediction interval given in Equation 11-33 
for a future observation in simple linear regression. If you compare the prediction interval 
Equation 12-40 with the expression for the confidence interval on the mean, Equation 12-38, 
you will observe that the prediction interval is always wider than the confidence interval. The 
confidence interval expresses the error in estimating the mean of a distribution, while the pre- 
diction interval expresses the error in predicting a future observation from the distribution at 
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EXAMPLE 12-8 
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Figure 12-5 An example of extrapolation in 
multiple regression. 


the point xp. This must include the error in estimating the mean at that point, as well as the in- 
herent variability in the random variable Y at the same value x = Xp. 

In predicting new observations and in estimating the mean response at a given point 
Xop Xo» - + + » Xop We must be careful about extrapolating beyond the region containing the 
original observations. It is very possible that a model that fits well in the region of the origi- 
nal data will no longer fit well outside of that region. In multiple regression it is often easy to 
inadvertently extrapolate, since the levels of the variables (x4, x5, ..., xg), i = 1,2, ...,n, 
jointly define the region containing the data. As an example, consider Fig. 12-5, which illus- 
trates the region containing the observations for a two-variable regression model. Note that 
the point (xo;, xo?) lies within the ranges of both regressor variables x, and x, but it is outside 
the region that is actually spanned by the original observations. Thus, either predicting the 
value of a new observation or estimating the mean response at this point is an extrapolation of 
the original regression model. 


Suppose that the engineer in Example 12-1 wishes to construct a 95% prediction interval on 
the wire bond pull strength when the wire length is x, = 8 and the die height is x; = 275. Note 
that x) = [1 8 275], and the point estimate of the pull strength is fọ = x) = 27.66. 
Also, in Example 12-7 we calculated xo (X'X) !x,y = 0.04444. Therefore, from Equation 
12-40 we have 


27.66 — 2.074 V/5.2352(1 + 0.04444) = Y, = 27.66 + 2.074 V/5.2352(1 + 0.04444) 
and the 9596 prediction interval is 
22.81 = Yə = 32.51 
Notice that the prediction interval is wider than the confidence interval on the mean response 


at the same point, calculated in Example 12-7. The Minitab output in Table 12-4 also displays 
this prediction interval. 


EXERCISES FOR SECTIONS 12-3 AND 12-4 


12-26. Consider the soil absorption data in Exercise 12-2. 

(a) Find a 95% confidence interval on the regression coeffi- 
cient By. 

(b) Find a 95% confidence interval on mean soil absorption 
index when x, = 200 and x, = 50. 

(c) Find a 95% prediction interval on the soil absorption in- 
dex when x, = 200 and x, = 50. 


12-27. Consider the NFL data in Exercise 12-4. 

(a) Find a 95% confidence interval on Bg. 

(b) What is the estimated standard error of fly;x, when x; = 
2000 yards, x, = 60%, and x, = 1800 yards? 

(c) Find a 95% confidence interval on the mean number of 
games won when x, = 2000, x, = 60, and x, = 1800. 


12-28. Consider the gasoline mileage data in Exercise 12-5. 

(a) Find 99% confidence intervals on B; and Bg. 

(b) Find a 99% confidence interval on the mean of Y when 
x, = 300 and x, = 4. 

(c) Fit a new regression model to these data using x4, X2, X6, 
and x; as the regressors. Find 99% confidence intervals 
on the regression coefficients in this new model. 

(d) Compare the lengths of the confidence intervals on (9; and 
8, from part (c) with those found in part (a). Which inter- 
vals are longer? Does this offer any insight about adding 
the variables x, and x, to the model? 


12-29. Consider the electric power consumption data in 

Exercise 12-6. 

(a) Find 95% confidence intervals on B,, B2, B3, and B4. 

(b) Find a 95% confidence interval on the mean of Y when 
X, = 75,x, = 24, x, = 90, and x, = 98. 

(c) Find a 95% prediction interval on the power consumption 
when x, = 75, xX, = 24, x, = 90, and x, = 98. 

12-30. Consider the bearing wear data in Exercise 12-7. 

(a) Find 99% confidence intervals on B; and B». 

(b) Recompute the confidence intervals in part (a) after the in- 
teraction term x,x, is added to the model. Compare the 
lengths of these confidence intervals with those computed 
in part (a). Do the lengths of these intervals provide any 
information about the contribution of the interaction term 
in the model? 

12-31. 

12-8. 

(a) Find 95% confidence interval on the regression coeffi- 
cients. 


Consider the wire bond pull strength data in Exercise 


12-5 MODEL ADEQUACY CHECKING 


12-5.1 Residual Analysis 
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(b) Find a 95% confidence interval on mean pull strength 
when x, = 20, x; = 30, x, = 90 and x; = 2.0. 

(c) Find a 95% prediction interval on pull strength when x, = 
20, x4 = 30, x, = 90, and x; = 2.0. 

12-32. Consider the semiconductor data in Exercise 12-9. 

(a) Find 99% confidence intervals on the regression coeffi- 
cients. 

(b) Find a 99% prediction interval on HFE when x, = 14.5, 
x, = 220, and x, = 5.0. 

(c) Find a 99% confidence interval on mean HFE when x, = 
14.5, x, = 220, and x, = 5.0. 

12-33. Consider the heat treating data from Exercise 12-10. 

(a) Find 9596 confidence intervals on the regression coeffi- 
cients. 

(b) Find a 9596 confidence interval on mean PITCH when 
TEMP = 1650, SOAKTIME = 1.00, SOAKPCT = 
1.10, DIFFTIME = 1.00, and DIFFPCT = 0.80. 


12-34. Reconsider the heat treating data in Exercises 

12-10 and 12-24, where we fit a model to PITCH using 

regressors x = SOAKTIME X SOAKPCT and x, = 

DIFFTIME X DIFFPCT. 

(a) Using the model with regressors x, and x;, find a 95% 
confidence interval on mean PITCH when SOAK- 
TIME = 1.00, SOAKPCT = 1.10, DIFFTIME = 1.00, 
and DIFFPCT = 0.80. 

(b) Compare the length of this confidence interval with the 
length of the confidence interval on mean PITCH at 
the same point from Exercise 12-33 part (b), where an 
additive model in SOAKTIME, SOAKPCT, DIFFTIME, 
and DIFFPCT was used. Which confidence interval is 
shorter? Does this tell you anything about which model 
is preferable? 


12-35. Consider the NHL data in Exercise 12-11. 

(a) Find a 95% confidence interval on the regression coeffi- 
cient for the variable “Pts.” 

(b) Fit a simple linear regression model relating the response 
variable “wins” to the regressor “Pts.” 

(c) Find a 95% confidence interval on the slope for the simple 
linear regression model from part (b). 

(d) Compare the lengths of the two confidence intervals com- 
puted in parts (a) and (c). Which interval is shorter? Does 
this tell you anything about which model is preferable? 


The residuals from the multiple regression model, defined by e; = y; — y; play an important 
role in judging model adequacy just as they do in simple linear regression. As noted in Section 
11-7.1, several residual plots are often useful; these are illustrated in Example 12-9. It is also 
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Figure 12-6 Normal probability plot of residuals. 


EXAMPLE 12-9 


MULTIPLE LINEAR REGRESSION 
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Figure 12-7 Plot of residuals against y. 
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helpful to plot the residuals against variables not presently in the model that are possible 
candidates for inclusion. Patterns in these plots may indicate that the model may be improved 
by adding the candidate variable. 


The residuals for the model from Example 12-1 are shown in Table 12-3. A normal probabil- 
ity plot of these residuals is shown in Fig. 12-6. No severe deviations from normality are ob- 
viously apparent, although the two largest residuals (e,; = 5.88 and e,, = 4.33) do not fall 
extremely close to a straight line drawn through the remaining residuals. 

The standardized residuals 


GOAT) 





are often more useful than the ordinary residuals when assessing residual magnitude. The 
standardized residuals corresponding to e,; and e,; are dj; = 5.88/V 5.2352 = 2.57 and 
diz = 4.33/V 4.2352 = 1.89, and they do not seem unusually large. Inspection of the data 
does not reveal any error in collecting observations 15 and 17, nor does it produce any other 
reason to discard or modify these two points. 

The residuals are plotted against y in Fig. 12-7, and against x, and x, in Figs. 12-8 and 
12-9, respectively." The two largest residuals, e,; and e,,, are apparent. Figure 12-8 gives 
some indication that the model underpredicts the pull strength for assemblies with short wire 
length (x, = 6) and long wire length (x; = 15) and overpredicts the strength for assemblies 
with intermediate wire length (7 < x, = 14). The same impression is obtained from Fig. 12-7. 





"There are other methods, described in Montgomery, Peck, and Vining (2001) and Myers (1990), that plot a modified 
version of the residual, called a partial residual, against each regressor. These partial residual plots are useful in dis- 
playing the relationship between the response y and each individual regressor. 
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Figure 12-8 Plot of residuals against x,. Figure 12-9 Plot of residuals against x». 


Either the relationship between strength and wire length is not linear (requiring that a term 
involving x7, say, be added to the model), or other regressor variables not presently in the 
model affected the response. 


In Example 12-9 we used the standardized residuals d; = e,/ Vô? as a measure of 
residual magnitude. Some analysts prefer to plot standardized residuals instead of ordinary 
residuals, because the standardized residuals are scaled so that their standard deviation is 
approximately unity. Consequently, large residuals (that may indicate possible outliers or un- 
usual observations) will be more obvious from inspection of the residual plots. 

Many regression computer programs compute other types of scaled residuals. One of the 
most popular is the studentized residual 


(12-42) 





where A; is the ith diagonal element of the matrix 
H-X(X'X)'!x' 
The H matrix is sometimes called the “hat” matrix, since 
$- XB = X(X'X) 'X'y = By 
Thus H transforms the observed values of y into a vector of fitted values y. 


Since each row of the matrix X corresponds to a vector, say x; = [1, Xj, Xj, ... , Xj], 
another way to write the diagonal elements of the hat matrix is 


(12-43) 





Note that apart from o^, h; is the variance of the fitted value y;. The quantities h; were used in 
the computation of the confidence interval on the mean response in Section 12-3.2. 
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Under the usual assumptions that the model errors are independently distributed with 
mean zero and variance o?, we can show that the variance of the ith residual e; is 


V(e;) = c*(1 E hii), i = 1, 2, — Tt 


Furthermore, the A; elements must fall in the interval 0 < h; = 1. This implies that the stan- 
dardized residuals understate the true residual magnitude; thus, the studentized residuals 
would be a better statistic to examine in evaluating potential outliers. 

To illustrate, consider the two observations identified in Example 12-9 as having residu- 
als that might be unusually large, observations 15 and 17. The standardized residuals are 


€15 5.88 e7 4.33 


d; =-= = —— = 225] and dy = = =" = 
^^ Và 52352 v AU. "veo 


Now A515 = 0.0737 and A;;,, = 0.2593, so the studentized residuals are 





1.89 


€15 5.88 


j Jaga ERE TET TENES 
and 
€17 = 4.33 
621 — hi7)  V5.2352(1 — 0.2593) 








ry7 


Notice that the studentized residuals are larger than the corresponding standardized residuals. 
However, the studentized residuals are still not so large as to cause us serious concern about 
possible outliers. 


12-5.2 Influential Observations 


Figure 12-10 A 
point that is remote in 
x-space. 


When using multiple regression, we occasionally find that some subset of the observations is 
unusually influential. Sometimes these influential observations are relatively far away from 
the vicinity where the rest of the data were collected. A hypothetical situation for two variables 
is depicted in Fig. 12-10, where one observation in x-space is remote from the rest of the data. 
The disposition of points in the x-space is important in determining the properties of the 
model. For example, point (x;;, xp) in Fig. 12-10 may be very influential in determining R’, the 
estimates of the regression coefficients, and the magnitude of the error mean square. 


x2 


Region containing 
all observations 
except the ;th 


EXAMPLE 12-10 
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We would like to examine the influential points to determine whether they control many 
model properties. If these influential points are *bad" points, or erroneous in any way, they should 
be eliminated. On the other hand, there may be nothing wrong with these points, but at least we 
would like to determine whether or not they produce results consistent with the rest of the data. In 
any event, even if an influential point is a valid one, if it controls important model properties, we 
would like to know this, since it could have an impact on the use of the model. 

Montgomery, Peck, and Vining (2001) and Myers (1990) describe several methods for 
detecting influential observations. An excellent diagnostic is the distance measure developed 
by Dennis R. Cook. This is a measure of the squared distance between the usual least squares 
estimate of B based on all n observations and the estimate obtained when the ith point is re- 


^ 


moved, say, B). The Cook distance measure is 





Clearly, 1f the ith point 1s influential, its removal will result in Bo changing considerably from 


the value fà. Thus, a large value of D; implies that the ith point is influential. The statistic D; is 
actually computed using 


(12-44) 





From Equation 12-44 we see that D; consists of the squared studentized residual, which 
reflects how well the model fits the ith observation y; [recall that r; = e;/V ó^(1 — hj)|anda 
component that measures how far that point is from the rest of the data [h,/(1 — hj) is a 
measure of the distance of the ith point from the centroid of the remaining n — 1 points]. A 
value of D; > 1 would indicate that the point is influential. Either component of D; (or both) 
may contribute to a large value. 


Table 12-11 lists the values of the hat matrix diagonals A; and Cook's distance measure D; for 
the wire bond pull strength data in Example 12-1. To illustrate the calculations, consider the 
first observation: 


PO(l—h) 
| Lev V MS&(1 — hi)pr A 
P (1 — hi) 
. [1.57/V5.2352(1 = 0.1973) ^ 0.1573 


3 (1 — 0.1573) 


D, = 


— 0.035 


The Cook distance measure D; does not identify any potentially influential observations in the 
data, for no value of D; exceeds unity. 
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Table 12-11 


Observations Cook's Distance Measure 
i h;; D 
1 0.1573 0.035 
2 0.1116 0.012 
3 0.1419 0.060 
4 0.1019 0.021 
5 0.0418 0.024 
6 0.0749 0.007 
7 0.1181 0.036 
8 0.1561 0.020 
9 0.1280 0.160 
10 0.0413 0.001 
11 0.0925 0.013 
12 0.0526 0.001 
13 0.0820 0.001 


EXERCISES FOR SECTION 12-5 


12-36. Consider the regression model for the NFL data in 

Exercise 12-4. 

(a) What proportion of total variability is explained by this 
model? 

(b) Construct a normal probability plot of the residuals. What 
conclusion can you draw from this plot? 

(c) Plot the residuals versus y and versus each regressor, and 
comment on model adequacy. 

(d) Are there any influential points in these data? 


12-37. Consider the gasoline mileage data in Exercise 12-5. 

(a) What proportion of total variability is explained by this 
model? 

(b) Construct a normal probability plot of the residuals and 
comment on the normality assumption. 

(c) Plot residuals versus y and versus each regressor. Discuss 
these residual plots. 

(d) Calculate Cook's distance for the observations in this data 
set. Are any observations influential? 


12-38. Consider the electric power consumption data in 
Exercise 12-6. 


(a) Calculate R? for this model. Interpret this quantity. 

(b) Plot the residuals versus y. Interpret this plot. 

(c) Construct a normal probability plot of the residuals and 
comment on the normality assumption. 


12-39. Consider the wear data in Exercise 12-7. 

(a) Find the value of R when the model uses the regressors x, 
and x). 

(b) What happens to the value of R* when an interaction term 
X,X, is added to the model? Does this necessarily imply 
that adding the interaction term is a good idea? 


Influence Diagnostics for the Wire Bond Pull Strength Data 2 


Observations Cook's Distance Measure 
i oe D, 
14 0.1129 0.003 
15 0.0737 0.187 
16 0.0879 0.001 
17 0.2593 0.565 
18 0.2929 0.155 
19 0.0962 0.018 
20 0.1473 0.000 
21 0.1296 0.052 
22 0.1358 0.028 
23 0.1824 0.002 
24 0.1091 0.040 
25 0.0729 0.000 


12-40. For the regression model for the wire bond pull 

strength data in Exercise 12-8. 

(a) Plot the residuals versus y and versus the regressors used in 
the model. What information do these plots provide? 

(b) Construct a normal probability plot of the residuals. Are 
there reasons to doubt the normality assumption for this 
model? 

(c) Are there any indications of influential observations in the 
data? 

12-41. 

12-9. 

(a) Plot the residuals from this model versus y. Comment on 
the information in this plot. 

(b) What is the value of R° for this model? 

(c) Refit the model using log HFE as the response variable. 

(d) Plot the residuals versus predicted log HFE for the model 
in part (c). Does this give any information about which 
model is preferable? 

(e) Plot the residuals from the model in part (d) versus the re- 
gressor x3. Comment on this plot. 

(f) Refit the model to log HFE using x, x», and 1/x;, as the re- 
gressors. Comment on the effect of this change in the model. 


Consider the semiconductor HFE data in Exercise 


12-42. Consider the regression model for the heat treating 

data in Exercise 12-10. 

(a) Calculate the percent of variability explained by this model. 

(b) Construct a normal probability plot for the residuals. 
Comment on the normality assumption. 

(c) Plot the residuals versus y and interpret the display. 

(d) Calculate Cook's distance for each observation and pro- 
vide an interpretation of this statistic. 


@ 


@ 
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12-43. In Exercise 12-24 we fit a model to the response 
PITCH in the heat treating data of Exercise 12-10 using new 
regressors xy = SOAKTIME X SOAKPCT and x, = DIFF- 
TIME X DIFFPCT. 

(a) Calculate the R^ for this model and compare it to the 
value of R? from the original model in Exercise 12-10. 
Does this provide some information about which model 
is preferable? 

(b) Plot the residuals from this model versus y and on a 
normal probability scale. Comment on model adequacy. 

(c) Find the values of Cook's distance measure. Are any ob- 
servations unusually influential? 

12-44. Consider the regression model for the NHL data 

from Exercise 12-11. 

(a) Fit a model using “pts” as the only regressor. 

(b) How much variability 1s explained by this model? 
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(c) Plot the residuals versus v» and comment on model 
adequacy. 

(d) Plot the residuals versus “PPG,” the points scored while in 
power play. Does this indicate that the model would be 
better if this variable were included? 


12-45. The diagonal elements of the hat matrix are often 
used to denote leverage—that is, a point that is unusual in its 
location in the x-space and that may be influential. Generally, 
the ith point is called a leverage point if its hat diagonal 
h;; exceeds 2p/n, which 1s twice the average size of all the hat 
diagonals. Recall that p = k + 1. 


(a) Table 12-11 contains the hat diagonal for the wire bond 
pull strength data used in Example 12-1. Find the average 
size of these elements. 

(b) Based on the criterion above, are there any observations 
that are leverage points in the data set? 


12-6 ASPECTS OF MULTIPLE REGRESSION MODELING 


In this section we briefly discuss several other aspects of building multiple regression models. 
For more extensive presentations of these topics and additional examples refer to Montgomery, 
Peck, and Vining (2001) and Myers (1990). 


12-6.1 Polynomial Regression Models 


The linear model y = Xf + eis a general model that can be used to fit any relationship that 
is linear in the unknown parameters p. This includes the important class of polynomial 
regression models. For example, the second-degree polynomial in one variable 


Y = Bo + Bix + Bite 


(12-45) 


and the second-degree polynomial in two variables 


_ 2 2 
Y = Bo + Bix, + Box» + Bixi + Bo2x5 + Bioxio; + € 


are linear regression models. 


(12-46) 


Polynomial regression models are widely used when the response is curvilinear, because 
the general principles of multiple regression can be applied. The following example illustrates 
some of the types of analyses that can be performed. 


EXAMPLE 12-11 


Sidewall panels for the interior of an airplane are formed in a 1500-ton press. The unit man- 


ufacturing cost varies with the production lot size. The data shown below give the average 
cost per unit (in hundreds of dollars) for this product ( y) and the production lot size (x). The 
scatter diagram, shown in Fig. 12-11, indicates that a second-order polynomial may be 


appropriate. 
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Figure 12-11 Data for Example 12-11. 


We will fit the model 


The y vector, X matrix, and B vector are as follows: 


1.8] 
1.70 
1.65 
1.55 
1.48 
1.40 
1.30 
1.26 
1.24 
1.21 
1.20 
1.18 


Y= Bo + Bix + Bua +e 
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Di 


p 


Solving the normal equations X'XB = X'y gives the fitted model 


$ = 2.19826629 — 0.02252236x + 0.00012507x? 


The test for significance of regression is shown in Table 12-12. Since f; = 2171.07 is signifi- 
cant at 1%, we conclude that at least one of the parameters B, and B,, 1s not zero. Furthermore, 
the standard tests for model adequacy do not reveal any unusual behavior, and we would con- 
clude that this is a reasonable model for the sidewall panel cost data. 
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Table 12-12 Test for Significance of Regression for the Second-Order Model in Example 12-11 


Source of Sum of Degrees of Mean 

Variation Squares Freedom Square fo P-value 
Regression 0.5254 p, 0.262700 2171.07 5.18E-15 
Error 0.0011 9 0.000121 
Total 0.5265 11 


In fitting polynomials, we generally like to use the lowest-degree model consistent with 
the data. In this example, it would seem logical to investigate the possibility of dropping the 
quadratic term from the model. That is, we would like to test 


Ho: By; = 0 
Hi: Di +0 


The general regression significance test can be used to test this hypothesis. We need to deter- 
mine the “extra sum of squares” due to B,,, or 


SSr(Bi1| Bi.Bo) — SSr(B15811| Bo) a SSr(B1| Bo) 


The sum of squares SSR(B1,B11| Bo) = 0.5254 from Table 12-12. To find SS&(B,| Bo), we fit a 
simple linear regression model to the original data, yielding 


y = 1.90036320 — 0.00910056x 
It can be easily verified that the regression sum of squares for this model is 
SSr(B1|Bo) = 0.4942 
Therefore, the extra sum of the squares due to B,,, given that B; and By are in the model, is 
SSp(B11|B1-B0) = SSs(Bi.Biil Bo) — SSr(B1| Bo) 
= 0.5254 — 0.4942 


= 0.0312 


The analysis of variance, with the test of Hp: Bj, = 0 incorporated into the procedure, is 
displayed in Table 12-13. Note that the quadratic term contributes significantly to the model. 


Table 12-13 Analysis of Variance for Example 12-11, Showing the Test for Hp: 1, = O 


Source of Degrees of Mean 
Variation Sum of Squares Freedom Square fo P-value 
Regression SSe(B15B11|/Bo) = 0.5254 2 0.262700 2171.07 5.18E-15 
Linear SSR(B1|Bo) = 0.4942 l 0.494200 4084.30 1.17E-15 
Quadratic SSR B11l| Bo,B1) = 0.0312 l 0.031200 258.18 5.51E-9 
Error 0.0011 9 0.00121 


Total 0.5265 11 
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12-6.2 Categorical Regressors and Indicator Variables 


EXAMPLE 12-12 


The regression models presented in previous sections have been based on quantitative vari- 
ables, that is, variables that are measured on a numerical scale. For example, variables such as 
temperature, pressure, distance, and voltage are quantitative variables. Occasionally, we need 
to incorporate categorical, or qualitative, variables in a regression model. For example, sup- 
pose that one of the variables in a regression model is the operator who is associated with each 
observation y;. Assume that only two operators are involved. We may wish to assign different 
levels to the two operators to account for the possibility that each operator may have a differ- 
ent effect on the response. 

The usual method of accounting for the different levels of a qualitative variable 1s to use 
indicator variables. For example, to introduce the effect of two different operators into a 
regression model, we could define an indicator variable as follows: 


E » if the observation 1s from operator 1 
1 if the observation is from operator 2 


In general, a qualitative variable with r-levels can be modeled by r — 1 indicator variables, 
which are assigned the value of either zero or one. Thus, if there are three operators, the 
different levels will be accounted for by the two indicator variables defined as follows: 


Xi X5 

0 0 if the observation 1s from operator 1 
1 0 if the observation is from operator 2 
0 |] if the observation is from operator 3 


Indicator variables are also referred to as dummy variables. The following example [from 
Montgomery, Peck, and Vining (2001)] illustrates some of the uses of indicator variables; for 
other applications, see Montgomery, Peck, and Vining (2001). 


A mechanical engineer is investigating the surface finish of metal parts produced on a lathe 
and its relationship to the speed (in revolutions per minute) of the lathe. The data are shown in 
Table 12-14. Note that the data have been collected using two different types of cutting tools. 
Since the type of cutting tool likely affects the surface finish, we will fit the model 


Y = Bo + By + Box + € 


where Y is the surface finish, x, is the lathe speed in revolutions per minute, and x, is an 
indicator variable denoting the type of cutting tool used; that is, 


+ for tool type 302 
X = 
1, for tool type 416 


The parameters in this model may be easily interpreted. If x; = 0, the model becomes 
Y= Bo T Dix, te 


which is a straight-line model with slope B, and intercept By. However, if x; = 1, the model 
becomes 


Y = Bo + Bixı + Bo(1) + € = (Bo + B2) + Bixı + € 


12-6 ASPECTS OF MULTIPLE REGRESSION MODELING 451 


Table 12-14 Surface Finish Data for Example 12-13 


Observation Surface Finish Type of Cutting Observation Surface Finish Type of Cutting 

Number, i yj RPM Tool Number, i Vi RPM Tool 
1 45.44 229 302 11 33.50 224 416 

2 42.03 200 302 12 31.23 212 416 

3 50.10 250 302 13 37:32 248 416 

4 48.75 245 302 14 37.13 260 416 

5 47.92 235 302 15 34.70 243 416 

6 47.79 237 302 16 33.92 238 416 

7 52.26 265 302 17 32.13 224 416 

8 50.52 259 302 18 35.47 25] 416 

9 45.58 22] 302 19 33.49 234 416 

10 44.78 218 302 20 32.20 216 416 


which is a straight-line model with slope B, and intercept Bo + B2. Thus, the model 
Y= Bo + Bix + Box. + € implies that surface finish is linearly related to lathe speed and 
that the slope B, does not depend on the type of cutting tool used. However, the type of cut- 
ting tool does affect the intercept, and B, indicates the change in the intercept associated with 
a change in tool type from 302 to 416. 

The X matrix and y vector for this problem are as follows: 


1 225 0 45.44 
1 200 0 42.03 
1 250 0 50.10 
1 245 0 48.75 
1 235 0 47.92 
] 237 0 47.79 
1 265 0 52.26 
] 259 Q 50.52 
1 221 0 45.58 
1 218 -0 44.78 
X= |1 224 1 33.50 
] 212 1 31.23 
] 248 |] 31.52 
1 260 1 AE 
1 243 | 34.70 
] 238 | 33.92 
1 224 |] PAE 
] 25% 1 35.47 
IL 232 J 33.49 
1 216 1 32.29 





The fitted model is 


ý = 14.27620 + 0.14115x, — 13.28020x, 
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Table 12-15 Analysis of Variance of Example 12-12 


Source of Degrees of Mean 
Variation Sum of Squares Freedom Square fo P-value 
Regression 1012.0595 2 506.0297 1103.69 1.02E-18 
SSR Bil Bo) 130.6091 l 130.6091 284.87 4.70E-12 
SSp(B2|B1,Bo) 881.4504 l 881.4504 1922.52 6.24E-19 
Error 7.1943 17 0.4508 
Total 1019.8538 19 


The analysis of variance for this model is shown in Table 12-15. Note that the hypothesis 
Ho: 8; = B2 = 0 (significance of regression) would be rejected at any reasonable level of 
significance because the P-value is very small. This table also contains the sums of squares 


SSR = SSr(Bi,B2| Bo) 
= SSr(BilBo) + SSr(B2|B1,Bo0) 


so a test of the hypothesis Hp: B2 = 0 can be made. Since this hypothesis is also rejected, we 
conclude that tool type has an effect on surface finish. 


It is also possible to use indicator variables to investigate whether tool type affects both 
the slope and intercept. Let the model be 


Y = Bo + Bix, + Box. + Bsxixo + € 
where x, is the indicator variable. Now if tool type 302 1s used, x, = 0, and the model is 
Y = Bo + Bix; te 
If tool type 416 is used, x, = 1, and the model becomes 


Y= Bo + Bix, + Bo + B3x, + e 
= (Bo + B5) + (Bi + B3 + € 


Note that B, is the change in the intercept and that B, is the change in slope produced by a 
change in tool type. 

Another method of analyzing these data is to fit separate regression models to the data 
for each tool type. However, the indicator variable approach has several advantages. First, 
only one regression model must be fit. Second, by pooling the data on both tool types, 
more degrees of freedom for error are obtained. Third, tests of both hypotheses on the 
parameters D; and B, are just special cases of the extra sum of squares method. 


12-6.3 Selection of Variables and Model Building 


An important problem in many applications of regression analysis involves selecting the set of 
regressor variables to be used in the model. Sometimes previous experience or underlying 
theoretical considerations can help the analyst specify the set or regressor variables to use in a 
particular situation. Usually, however, the problem consists of selecting an appropriate set of 
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regressors from a set that quite likely includes all the important variables, but we are sure that 
not all these candidate regressors are necessary to adequately model the response Y. 

In such a situation, we are interested in variable selection; that is, screening the candidate 
variables to obtain a regression model that contains the “best” subset of regressor variables. We 
would like the final model to contain enough regressor variables so that in the intended use of the 
model (prediction, for example) it will perform satisfactorily. On the other hand, to keep model 
maintenance costs to a minimum and to make the model easy to use, we would like the model to 
use as few regressor variables as possible. The compromise between these conflicting objectives 
is often called finding the "best" regression equation. However, in most problems, no single 
regression model is “best” in terms of the various evaluation criteria that have been proposed. A 
great deal of judgment and experience with the system being modeled is usually necessary to 
select an appropriate set of regressor variables for a regression equation. 

No single algorithm will always produce a good solution to the variable selection problem. 
Most of the currently available procedures are search techniques, and to perform satisfactorily, 
they require interaction with judgment by the analyst. We now briefly discuss some of the more 
popular variable selection techniques. We assume that there are K candidate regressors, x;, x», 
... , Xx, and a single response variable y. All models will include an intercept term Bo, so the 
model with a// variables included would have K + 1 terms. Furthermore, the functional form of 
each candidate variable (for example, x, — 1/x, x; — In x, etc.) is correct. 


All Possible Regressions 

This approach requires that the analyst fit all the regression equations involving one candidate 
variable, all regression equations involving two candidate variables, and so on. Then these 
equations are evaluated according to some suitable criteria to select the "best" regression 
model. If there are K candidate regressors, there are 2^ total equations to be examined. For 
example, if K = 4, there are 2^ = 16 possible regression equations; while if K = 10, there are 
2'° = 1024 possible regression equations. Hence, the number of equations to be examined 
increases rapidly as the number of candidate variables increases. However, there are some 
very efficient computing algorithms for all possible regressions available and they are widely 
implemented in statistical software, so it is a very practical procedure unless the number of 
candidate regressors 1s fairly large. 

Several criteria may be used for evaluating and comparing the different regression mod- 
els obtained. A commonly used criterion is based on the value of R? or the value of the 
adjusted R^, Rs. Basically, the analyst continues to increase the number of variables in the 
model until the increase in R° or the adjusted A7, is small. Often, we will find that the R7, will 
stabilize and actually begin to decrease as the number of variables in the model increases. 
Usually, the model that maximizes Rea is considered to be a good candidate for the best re- 
gression equation. Because we can write Rz; = 1 — (MSz/[SSz/(n — 1)]) and SSz/(n — 1) 
is a constant, the model that maximizes the Ra value also minimizes the mean square error, 
so this is a very attractive criterion. 

Another criterion used to evaluate regression models is the C, statistic, which is a meas- 
ure of the total mean square error for the regression model. We define the total standardized 
mean square error for the regression model as 


| Merc" 
D, = Ge Ale 7 EY 
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EXAMPLE 12-13 


We use the mean square error from the full K + 1 term model as an estimate of o°; that is, 
6^ = MSg(K + 1). Then an estimator of I’, is [see Montgomery, Peck, and Vining (2001) or 
Myers (1990) for the details]: 





If the p-term model has negligible bias, it can be shown that 
E(C,| zero bias) = p 


Therefore, the values of C, for each regression model under consideration should be evaluated 
relative to p. The regression equations that have negligible bias will have values of C, that are 
close to p, while those with significant bias will have values of C, that are significantly greater 
than p. We then choose as the "best" regression equation either a model with minimum C, or 
a model with a slightly larger C,, that does not contain as much bias (1.e., C, = p). 

The PRESS statistic can also be used to evaluate competing regression models. PRESS is 
an acronym for Prediction Error Sum of Squares, and it is defined as the sum of the squares of 
the differences between each observation y; and the corresponding predicted value based on a 
model fit to the remaining n — 1 points, say y. So PRESS provides a measure of how well 
the model is likely to perform when predicting new data, or data that was not used to fit the 
regression model. The computing formula for PRESS is 





where e; = y; — y; is the usual residual. Thus PRESS is easy to calculate from the standard 
least squares regression results. Models that have small values of PRESS are preferred. 


Table 12-16 presents data on taste-testing 38 brands of pinot noir wine (the data were first 
reported in an article by Kwan, Kowalski, and Skogenboe in an article in the Journal of Agri- 
cultural and Food Chemistry, Vol. 27, 1979, and it also appears as one of the default data sets 
in Minitab). The response variable is y — quality, and we wish to find the "best" regression 
equation that relates quality to the other five parameters. 

Figure 12-12 is the matrix of scatter plots for the wine quality data, as constructed by 
Minitab. We notice that there are some indications of possible linear relationships between 
quality and the regressors, but there is no obvious visual impression of which regressors 
would be appropriate. Table 12-16 lists the all possible regressions output from Minitab. In 
this analysis, we asked Minitab to present the best three equations for each subset size. Note 
that Minitab reports the values of R’, Ria. C,, and S = VMS; for each model. From Table 
12-17 we see that the three-variable equation with x, = aroma, x, = flavor, and x; = oaki- 
ness produces the minimum C, equation, whereas the four-variable model, which adds 
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Table 12-16 Wine Quality Data 


X X2 X3 X4 Xs V 
Clarity Aroma Body Flavor Oakiness Quality 

1 1.0 3.3 2.8 3.1 4.] 9.8 

2 1.0 4.4 4.9 3.5 3.9 12.6 

3 1.0 3.9 5.3 4.8 4.7 11.9 

4 1.0 3.9 2.6 3.1 3.6 11.1 

5 1.0 5.6 5.1 2. 54 13.3 

6 1.0 4.6 4.7 5.0 4.1 12.8 

7 1.0 4.8 4.8 4.8 3.3 12.8 

8 1.0 5.3 4.5 4.3 5.2 12.0 

9 1.0 4.3 4.3 3.9 2.9 13.6 
10 1.0 4.3 3.9 4.7 3.9 13.9 
11 1.0 SA 4.3 4.5 3.6 14.4 
12 0.5 3.3 5.4 4.3 3.6 12.3 
13 0.8 5.9 5.7 7.0 4.] 16.1 
14 0.7 Ta 6.6 6.7 3.7 16.1 
15 1.0 7.1 4.4 5.8 4.1 15.5 
16 0.9 5.3 5.6 5.6 4.4 15.5 
17 1.0 6.3 5.4 4.8 4.6 13.8 
18 1.0 5.0 SES) 205 4.] 13.8 
19 1.0 4.6 4.1 4.3 3.1 11.3 
20 0.9 3.4 5.0 3.4 3.4 7.9 
21 0.9 6.4 5.4 6.6 4.8 15.1 
22 1.0 2: 5.3 213 3.8 13.5 
23 0.7 4.7 4.] 5.0 3.7 10.8 
24 0.7 4.1 4.0 4.1 4.0 9.5 
25 1.0 6.0 5.4 5.7 4.7 12.7 
26 1.0 4.3 4.6 4.7 4.9 11.6 
27 1.0 3.9 4.0 5.1 5.1 11.7 
28 1.0 5.1 4.9 5.0 5.1 11.9 
29 1.0 3.9 4.4 5.0 4.4 10.8 
30 1.0 4.5 37 2.9 3.9 8.5 
31 1.0 2:2 4.3 5.0 6.0 10.7 
32 0.8 4.2 3.8 3.0 4.7 9.1 
33 1.0 3.3 3.5 4.3 4.5 12.1 
34 1.0 6.8 5.0 6.0 DZ 14.9 
35 0.8 5.0 5.4 5.5 4.8 13:5 
36 0.8 3.5 4.7 4.2 3.3 12.2 
37 0.8 4.3 5.5 3.9 5.8 10.3 
38 0.8 5.2 4.8 5.7 3.9 13:2 


x, = clarity to the previous three regressors, results in maximum Rea (or minimum MSp). The 


three-variable model is 
y = 6.47 + 0.580x, + 1.20x4 — 0.602x; 
and the four-variable model is 


y = 4.99 + 1.79x, + 0.530», + 1.26x4 — 0.659x;5 
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Table 12-17 Minitab All Possible Regressions Output for the Wine Quality Data 


Best Subsets Regression: Quality versus Clarity, Aroma, ... 


Response is Quality 


Vars 


mÓ 


Un BB BBW wWWN NN YK — 


R-Sq 
62.4 
50.0 
30.1 
66.1 
65.9 
63.3 
70.4 
68.0 
66.5 
Tis 
70.5 
69.3 
72.1 


R-Sq (adj) 
61.4 
48.6 
28.2 
64.2 
63.9 
61.2 
67.8 
65.2 
63.5 
68.0 
66.9 
65.6 
67.7 
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These models should now be evaluated further using residuals plots and the other tech- 
niques discussed earlier in the chapter, to see if either model is satisfactory with respect to the 
underlying assumptions and to determine if one of them is preferable. It turns out that the 
residual plots do not reveal any major problems with either model. The value of PRESS for 
the three-variable model 1s 56.0525 and for the four-variable model it is 60.3927. Since 
PRESS is smaller in the model with three regressors, and since it is the model with the fewest 
predictors, it would likely be the preferred choice. 


Stepwise Regression 

Stepwise regression is probably the most widely used variable selection technique. The pro- 
cedure iteratively constructs a sequence of regression models by adding or removing variables 
at each step. The criterion for adding or removing a variable at any step is usually expressed 
in terms of a partial F-test. Let f, be the value of the F-random variable for adding a variable 
to the model, and let fo, be the value of the F-random variable for removing a variable from 
the model. We must have fia = fo, and usually fin = fout- 

Stepwise regression begins by forming a one-variable model using the regressor variable 
that has the highest correlation with the response variable Y. This will also be the regressor 
producing the largest F-statistic. For example, suppose that at this step, x, 1s selected. At the 
second step, the remaining K — 1 candidate variables are examined, and the variable for 
which the partial F-statistic 


u SSr(B;|B1,B0) 


is a maximum is added to the equation, provided that f; > f... In equation 12-48, MS; (xj, x1) 
denotes the mean square for error for the model containing both x, and x; Suppose that this 
procedure indicates that x, should be added to the model. Now the stepwise regression algo- 
rithm determines whether the variable x, added at the first step should be removed. This is 
done by calculating the F-statistic 


— SSp(B1|B2.B0) 
F, = MSg(x, xj) m (12-49) 


If the calculated value f, < fu the variable x, is removed; otherwise it is retained, and we 
would attempt to add a regressor to the model containing both x, and x,. 

In general, at each step the set of remaining candidate regressors is examined, and the 
regressor with the largest partial F-statistic is entered, provided that the observed value of 
f exceeds f,,. Then the partial F-statistic for each regressor in the model is calculated, and the 
regressor with the smallest observed value of F is deleted if the observed f < fut The 
procedure continues until no other regressors can be added to or removed from the model. 

Stepwise regression is almost always performed using a computer program. The analyst 
exercises control over the procedure by the choice of fj, and fu- Some stepwise regression 
computer programs require that numerical values be specified for f,, and faut Since the num- 
ber of degrees of freedom on MS, depends on the number of variables in the model, which 
changes from step to step, a fixed value of f,, and fu causes the type I and type II error rates 
to vary. Some computer programs allow the analyst to specify the type I error levels for f;, and 
Fou. However, the “advertised” significance level is not the true level, because the variable 
selected is the one that maximizes (or minimizes) the partial F-statistic at that stage. 
Sometimes it is useful to experiment with different values of f;, and fa (or different advertised 
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EXAMPLE 12-14 


Table 12-18 Minitab Stepwise Regression Output for the 
Wine Quality Data 


Stepwise Regression: Quality versus Clarity, Aroma, ... 
Alpha-to-Enter: 0.15 Alpha-to-Remove: 0.15 


Response is Quality on 5 predictors, with N = 38 


Step l p 3 
Constant 4.941 6.912 6.467 
Flavor 1.57 1.64 1.20 
T-Value 173 8.25 4.36 
P-Value 0.000 0.000 0.000 
Oakiness — 0.54 — 0.60 
T-Value — 1.95 —2.28 
P-Value 0.059 0.029 
Aroma 0.58 
T-Value 2.21 
P-Value 0.034 
S 1:27 1:22 1.16 
R-Sq 62.42 66.11 70.38 
R-Sq(ad]) 61.37 64.17 67.76 
C-p 9.0 6.8 3.9 


type I error rates) in several different runs to see if this substantially affects the choice of the 
final model. 


Table 12-18 gives the Minitab stepwise regression output for the wine quality data. Minitab uses 
fixed values of a for entering and removing variables. The default level is a = 0.15 for both de- 
cisions. The output in Table 12-18 uses the default value. Notice that the variables were entered in 
the order Flavor (step 1), Oakiness (step 2), and Aroma (step 3) and that no variables were re- 
moved. No other variable could be entered, so the algorithm terminated. This is the three-variable 
model found by all possible regressions that results in a minimum value of C,,. 


Forward Selection 

The forward selection procedure is a variation of stepwise regression and is based on the 
principle that regressors should be added to the model one at a time until there are no remain- 
ing candidate regressors that produce a significant increase in the regression sum of squares. 
That is, variables are added one at a time as long as their partial F-value exceeds f;,. Forward 
selection is a simplification of stepwise regression that omits the partial F-test for deleting 
variables from the model that have been added at previous steps. This is a potential weakness 
of forward selection; that is, the procedure does not explore the effect that adding a regressor 
at the current step has on regressor variables added at earlier steps. Notice that if we were to 
apply forward selection to the wine quality data, we would obtain exactly the same results as 
we did with stepwise regression in Example 12-14, since stepwise regression terminated 
without deleting a variable. 
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Table 12-19 Minitab Backward Elimination Output for the 
Wine Quality Data 


Stepwise Regression: Quality versus Clarity, Aroma, ... 


Backward elimination. Alpha-to-Remove: 0.1 


Response is Quality on 5 predictors, with N = 38 


Step l 2 3 
Constant 3.997 4.986 6.467 
Clarity 2.3 1.8 
T-Value 1.35 1.12 
P-Value 0.187 0.269 
Aroma 0.48 0.53 0.58 
T-Value 1.77 2.00 2.21 
P-Value 0.086 0.054 0.034 
Body 0.27 
T-Value 0.82 
P-Value 0.418 
Flavor 1.17 1.26 1.20 
T-Value 3.84 4.52 4.36 
P-Value 0.001 0.000 0.000 
Oakiness —0.68 — 0.66 —0.60 
T-Value =2.52 —2.46 —2.28 
P-Value 0.017 0.019 0.029 
S 1.16 1.16 1.16 
R-Sq 72.06 71.47 70.38 
R-Sq(ad]) 67.69 68.01 67.76 
C-p 6.0 4.7 3.9 


Backward Elimination 
The backward elimination algorithm begins with all K candidate regressors in the model. Then 
the regressor with the smallest partial F-statistic is deleted 1f this F-statistic is insignificant, that 
is, iff < four. Next, the model with K — 1 regressors is fit, and the next regressor for potential 
elimination is found. The algorithm terminates when no further regressor can be deleted. 

Table 12-19 shows the Minitab output for backward elimination applied to the wine quality 
data. The a value for removing a variable is & = 0.10. Notice that this procedure removes Body at 
step 1 and then Clarity at step 2, terminating with the three-variable model found previously. 


Some Comments on Final Model Selection 

We have illustrated several different approaches to the selection of variables in multiple linear 
regression. The final model obtained from any model-building procedure should be subjected 
to the usual adequacy checks, such as residual analysis, lack-of-fit testing, and examination of 
the effects of influential points. The analyst may also consider augmenting the original set of 
candidate variables with cross-products, polynomial terms, or other transformations of the 
original variables that might improve the model. A major criticism of variable selection meth- 
ods such as stepwise regression is that the analyst may conclude there is one "best" regression 
equation. Generally, this 1s not the case, because several equally good regression models can 
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often be used. One way to avoid this problem is to use several different model-building tech- 
niques and see if different models result. For example, we have found the same model for the 
wine quality data using stepwise regression, forward selection, and backward elimination. The 
same model was also one of the two best found from all possible regressions. The results from 
variable selection methods frequently do not agree, so this is a good indication that the three- 
variable model is the best regression equation. 

If the number of candidate regressors is not too large, the all-possible regressions method 
is recommended. We usually recommend using the minimum MS; and C, evaluation criteria 
in conjunction with this procedure. The all-possible regressions approach can find the “best” 
regression equation with respect to these criteria, while stepwise-type methods offer no such 
assurance. Furthermore, the all-possible regressions procedure is not distorted by dependen- 
cies among the regressors, as stepwise-type methods are. 


12-6.4 Multicollinearity 


In multiple regression problems, we expect to find dependencies between the response 
variable Y and the regressors x;. In most regression problems, however, we find that there 
are also dependencies among the regressor variables x;. In situations where these depend- 
encies are strong, we say that multicollinearity exists. Multicollinearity can have serious 
effects on the estimates of the regression coefficients and on the general applicability of 
the estimated model. 

The effects of multicollinearity may be easily demonstrated. The diagonal elements of the 
matrix C = (X'X) can be written as 


l 
(nc 
= xem 


jJ=1,2,...,k 

where R$ is the coefficient of multiple determination resulting from regressing x, on the 

other k — 1 regressor variables. Clearly, the stronger the linear dependency of x; on the re- 

maining regressor variables, and hence the stronger the multicollinearity, the larger the 

value of R will be. Recall that vB) = ø°Cy. Therefore, we say that the variance of B; is 
"inflated" iy the quantity (1 — R; yo . Consequently, we define the variance inflation 

factor for B; as 





(12-50) 





These factors are an important measure of the extent to which multicollinearity is present. 

Although the estimates of the regression coefficients are very imprecise when multi- 
collinearity 1s present, the fitted model equation may still be useful. For example, suppose we 
wish to predict new observations on the response. If these predictions are interpolations in the 
original region of the x-space where the multicollinearity 1s in effect, satisfactory predictions 
will often be obtained, because while individual B; may be poorly estimated, the function 
X j=1 ST j may be estimated quite well. On the other hand, if the prediction of new obseva- 
tions requires extrapolation beyond the original region of the x-space where the data were col- 
lected, generally we would expect to obtain poor results. Extrapolation usually requires good 
estimates of the individual model parameters. 
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Multicollinearity arises for several reasons. It will occur when the analyst collects data 
such that a linear constraint holds approximately among the columns of the X matrix. For ex- 
ample, if four regressor variables are the components of a mixture, such a constraint will 
always exist because the sum of the components is always constant. Usually, these constraints 
do not hold exactly, and the analyst might not know that they exist. 

The presence of multicollinearity can be detected 1n several ways. Two of the more easily 
understood of these will be discussed briefly. 


1. The variance inflation factors, defined in equation 12-50, are very useful measures 
of multicollinearity. The larger the variance inflation factor, the more severe the mul- 
ticollinearity. Some authors have suggested that 1f any variance inflation factor ex- 
ceeds 10, multicollinearity is a problem. Other authors consider this value too liberal 
and suggest that the variance inflation factors should not exceed 4 or 5. Minitab will 
calculate the variance inflation factors. Table 12-4 presents the Minitab multiple re- 
gression output for the wire bond pull strength data. Since both VIF, and VIF, are 
small, there is no problem with multicollinearity. 


2. Ifthe F-test for significance of regression is significant, but tests on the individual 
regression coefficients are not significant, multicollinearity may be present. 


Several remedial measures have been proposed for solving the problem of multi- 
collinearity. Augmenting the data with new observations specifically designed to break up the 
approximate linear dependencies that currently exist is often suggested. However, this is 
sometimes impossible because of economic reasons or because of the physical constraints that 
relate the x;. Another possibility is to delete certain variables from the model, but this approach 
has the disadvantage of discarding the information contained in the deleted variables. 

Since multicollinearity primarily affects the stability of the regression coefficients, it would 
seem that estimating these parameters by some method that is less sensitive to multicollinearity 
than ordinary least squares would be helpful. Several methods have been suggested. One alterna- 
tive to ordinary least squares, ridge regression, can be useful in combating multicollinearity. For 
more details on ridge regression, see Section 12-6.5 on the CD material or the more extensive pre- 
sentations in Montgomery, Peck, and Vining (2001) and Myers (1990). 


12-6.5 Ridge Regression (CD Only) 
12-6.6 Nonlinear Regression (CD Only) 


EXERCISES FOR SECTION 12-6 


12-46. An article entitled “A Method for Improving the 
Accuracy of Polynomial Regression Analysis" in the Journal 
of Quality Technology (1971, pp. 149—155) reported the fol- 
lowing data on y — ultimate shear strength of a rubber com- 
pound (psi) and x = cure temperature (°F). 


y 770 800 840 810 


xX 280 284 207 295 
590 560 


x 298 305 308 315 


(a) Fit a second-order polynomial to these data. 
(b) Test for significance of regression using a = 0.05. 
(c) Test the hypothesis that Bj, = Ousinga = 0.05. 


(d) Compute the residuals from part (a) and use them to eval- 
uate model adequacy. 


12-47. Consider the following data, which result from an 
experiment to determine the effect of x = test time in hours at 
a particular temperature on y — change in oil viscosity: 





y — 1.42 —]1.39 =o —1.89 —2.43 
x 2D .50 215 1.00 1.25 
y —3.15 —4.05 229215 —6.43 — 7.89 





x 1.50 1:75 2.00 2.22 2.50 


(a) Fit a second-order polynomial to the data. 
(b) Test for significance of regression using a = 0.05. 
(c) Test the hypothesis that Dj, = 0 using a = 0.05. 
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(d) Compute the residuals from part (a) and use them to eval- 
uate model adequacy. 


12-48. When fitting polynomial regression models, we 
often subtract x from each x value to produce a “centered” 
regressor x' — x — x. This reduces the effects of dependencies 
among the model terms and often leads to more accurate esti- 
mates of the regression coefficients. Using the data from 
Exercise 12-46, fit the model Y = B, + Bix’ + Bx)? + e. 
Use the results to estimate the coefficients in the uncentered 
model Y = By + Bix + Bix? + e. 


12-49. Suppose that we use a standardized variable 
x' = (x — x)/s,, where s, is the standard deviation of x, in 
constructing a polynomial regression model. Using the data in 
Exercise 12-46 and the standardized variable approach, fit the 
model Y = By + Bix’ + Bix)? + e. 

(a) What value of y do you predict when x = 285°F? 

(b) Estimate the regression coefficients in the unstandardized 
model Y = By + Bix + Bix? + e. 

(c) What can you say about the relationship between SS; and 
R? for the standardized and unstandardized models? 

(d) Suppose that y' — (y — y)/s, is used in the model along 
with x'. Fit the model and comment on the relationship 
between SS, and R? in the standardized model and the 
unstandardized model. 

12-50. The following data shown were collected during an 

experiment to determine the change in thrust efficiency (y, in 

percent) as the divergence angle of a rocket nozzle (x) changes: 





x 4.0 4.0 4.0 5.0 5.0 6.0 


y 67.11 67.24 67.15 77.87 80.11 84.67 
x 6.5 6.5 6.75 7.0 Tal 7.3 


(a) Fit a second-order model to the data. 

(b) Test for significance of regression and lack of fit using 
a = 0.05. 

(c) Test the hypothesis that B,, = 0, using a = 0.05. 

(d) Plot the residuals and comment on model adequacy. 

(e) Fit a cubic model, and test for the significance of the cubic 
term using a = 0.05. 


12-51. An article in the Journal of Pharmaceuticals 
Sciences (Vol. 80, 1991, pp. 971—977) presents data on the ob- 
served mole fraction solubility of a solute at a constant tem- 
perature and the dispersion, dipolar, and hydrogen bonding 
Hansen partial solubility parameters. The data are as shown in 
the following table, where y is the negative logarithm of the 
mole fraction solubility, x, 1s the dispersion partial solubility, 
x, is the dipolar partial solubility, and x4 is the hydrogen 
bonding partial solubility. 
(a) Fit the model Y = Bo + Bix, 5x5 + B3x3 + 

By rx 1X2 + Braxixs + Bosxoxs + Brix + Baox5 + Basi + €. 
(b) Test for significance of regression using a = 0.05. 
(c) Plot the residuals and comment on model adequacy. 


Observation 
Number y Xi Xp X3 
l 0.22200 7.3 0.0 0.0 
2 0.39500 8.7 0.0 0.3 
3 0.42200 8.8 0.7 1.0 
4 0.43700 8.1 4.0 0.2 
5 0.42800 9.0 0.5 1.0 
6 0.46700 8.7 1.5 2.8 
7 0.44400 9.3 2l 1.0 
8 0.37800 7.6 5.1 3.4 
9 0.49400 10.0 0.0 0.3 
10 0.45600 8.4 3.7 4.1 
11 0.45200 9.3 3.6 2.0 
12 0.11200 T 2.8 TA 
13 0.43200 9.8 4.2 2.0 
14 0.10100 7.3 2.9 6.8 
15 0.23200 8.5 2.0 6.6 
16 0.30600 9.5 2.5 5.0 
17 0.09230 7.4 2.8 7.8 
18 0.11600 7.8 2.8 7.7 
19 0.07640 7.7 3.0 8.0 
20 0.43900 10.3 Nr) 4.2 
21 0.09440 7.8 3.3 8.5 
22 0.11700 7.1 3.9 6.6 
23 0.07260 Tel 4.3 9.5 
24 0.04120 7.4 6.0 10.9 
25 0.25100 7.3 2.0 2.2 
26 0.00002 7.6 7.8 20.7 


(d) Use the extra sum of squares method to test the contribu- 
tion of the second-order terms using a = 0.05. 


12-52. Consider the gasoline mileage data in Exercise 12-5. 

(a) Discuss how you would model the information about the 
type of transmission in the car. 

(b) Fit a regression model to the gasoline mileage using 
engine displacement, horsepower, and the type of trans- 
mission in the car as the regressors. 

(c) Is there evidence that the type of transmission affects 
gasoline mileage performance? 


12-53. Consider the tool life data in Example 12-12. Test 
the hypothesis that two different regression models (with dif- 
ferent slopes and intercepts) are required to adequately model 
the data. Use indicator variables in answering this question. 
12-54. Use the National Football League Team Performance 
data in Exercise 12-4 to build regression models using the 
following techniques: 

(a) All possible regressions. Find the equations that minimize 

MS; and that minimize C,,. 
(b) Stepwise regression. 


SZ 
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(c) Forward selection. 

(d) Backward elimination. 

(e) Comment on the various models obtained. Which model 
seems “best,” and why? 


12-55. Use the gasoline mileage data in Exercise 12-5 to 

build regression models using the following techniques: 

(a) All possible regressions. Find the minimum C, and mini- 
mum MS; equations. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 

(e) Comment on the various models obtained. 


12-56. Consider the electric power data in Exercise 12-6. 

Build regression models for the data using the following 

techniques: 

(a) All possible regressions. 

(b) Stepwise regression. 

(c) Forward selection. 

(d) Backward elimination. 

(e) Comment on the models obtained. Which model would 
you prefer? 

12-57. Consider the wire bond pull strength data in 

Exercise 12-8. Build regression models for the data using the 

following methods: 

(a) Stepwise regression. 

(b) Forward selection. 

(c) Backward elimination. 

(d) Comment on the models obtained. Which model would 
you prefer? 

12-58. Consider the NHL data in Exercise 12-11. Build 

regression models for these data using the following methods: 

(a) Stepwise regression. 

(b) Forward selection. 

(c) Backward elimination. 

(d) Which model would you prefer? 


12-59. Consider the data in Exercise 12-51. Use all the terms 

in the full quadratic model as the candidate regressors. 

(a) Use forward selection to identify a model. 

(b) Use backward elimination to identify a model. 

(c) Compare the two models obtained in parts (a) and (b). 
Which model would you prefer and why? 


12-60. Find the minimum C, equation and the equation that 
maximizes the adjusted R° statistic for the wire bond pull 
strength data in Exercise 12-8. Does the same equation satisfy 
both criteria? 


12-61. For the NHL data in Exercise 12-11. 

(a) Find the equation that minimizes C,. 

(b) Find the equation that minimizes M$;. 

(c) Find the equation that maximizes the adjusted R°. Is this 
the same equation you found in part (b)? 

12-62. We have used a sample of 30 observations to fit a 


regression model. The full model has nine regressors, the vari- 
ance estimate is 6? = MS; = 100, and R? = 0.92. 
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(a) Calculate the F-statistic for testing significance of regres- 
sion. Using a = 0.05, what would you conclude? 

(b) Suppose that we fit another model using only four of the 
original regressors and that the error sum of squares for 
this new model is 2200. Find the estimate of o? for this 
new reduced model. Would you conclude that the reduced 
model is superior to the old one? Why? 

(c) Find the value of C, for the reduced model in part (b). 
Would you conclude that the reduced model is better than 
the old model? 


12-63. A sample of 25 observations is used to fit a regres- 
sion model in seven variables. The estimate of o? for this full 
model is MS, = 10. 

(a) A forward selection algorithm has put three of the original 
seven regressors in the model. The error sum of squares 
for the three-variable model is SS; = 300. Based on C,, 
would you conclude that the three-variable model has any 
remaining bias? 

After looking at the forward selection model in part (a), 
suppose you could add one more regressor to the model. 
This regressor will reduce the error sum of squares to 
275. Will the addition of this variable improve the 
model? Why? 


(b 


— 


Supplemental Exercises 


12-64. The data shown in the table on page 464 represent 
the thrust of a jet-turbine engine (y) and six candidate 
regressors: x, = primary speed of rotation, x, = secondary 
speed of rotation, x, = fuel flow rate, x, = pressure, x; = 
exhaust temperature, and x; = ambient temperature at time 
of test. 

(a) Fit a multiple linear regression model using x4 — fuel flow 
rate, x, = pressure, and x4 = exhaust temperature as the 
regressors. 

(b) Test for significance of regression using a = 0.01. Find 
the P-value for this test. What are your conclusions? 

(c) Find the t-test statistic for each regressor. Using a = 0.01, 
explain carefully the conclusion you can draw from these 
statistics. 

(d) Find R? and the adjusted statistic for this model. 

(e) Construct a normal probability plot of the residuals and 
interpret this graph. 

(f) Plot the residuals versus y. Are there any indications of 
inequality of variance or nonlinearity? 

(g) Plot the residuals versus x4. Is there any indication of 
nonlinearity? 

(h) Predict the thrust for an engine for which x, = 1670, 
x, = 170, and x; = 1589. 

12-65. Consider the engine thrust data in Exercise 12-64. 

Refit the model using y* = In y as the response variable and 

x3 = In x4 as the regressor (along with x, and x;). 

(a) Test for significance of regression using a = 0.01. Find 
the P-value for this test and state your conclusions. 


es 
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Observation 
Number X, n X3 Xj X5 y 
l 3 3 3 0 0.787 
2 8 30 8 0 0.293 
3 3 6 6 6 0 1.710 
4 4 4 4 12 0 0.203 
5 8 7 6 5 0 0.806 
6 10 20 5 5 0 4.713 
f 8 6 3 3 25 0.607 
8 24 4 4 25 9.107 
9 4 10 12 4 25 9.210 
10 16 12 8 4 25 1.365 
11 3 10 8 8 25 4.554 
12 8 3 3 3 25 0.293 
13 3 6 3 3 50 2.292 
14 3 8 8 3 50 9.167 
15 4 8 4 8 50 0.694 
16 5 2 2 2 50 0.379 
17 2 2 2 3 50 0.485 
18 10 15 3 3 50 3.345 
19 15 6 2 3 50 0.208 
20 15 6 2 3 75 0.201 
21 10 4 3 3 78 0.329 
22. 3 8 2 2 75 4.966 
23 6 6 6 4 T5 1.362 
24 2 3 8 6 75 1.515 
25 3 3 8 8 75 0.751 


(b) Use the t-statistic to test Hp: B; = 0 versus H,: B; # 0 for 
each variable in the model. Ifa = 0.01, what conclusions 
can you draw? 

(c) Plot the residuals versus ?* and versus x3. Comment on 
these plots. How do they compare with their counterparts 
obtained in Exercise 12-64 parts (f) and (g)? 


12-66. The transient points of an electronic inverter are 
influenced by many factors. Table 12-20 gives data on the 
transient point ( y, in volts) of PMOS-NMOS inverters and five 
candidate regressors:, x, = width of the NMOS device, x, = 
length of the NMOS device, x; = width of the PMOS device, 
x, = length of the PMOS device, and x; = temperature (°C). 
(a) Fit the multiple linear regression model to these data. Test 

for significance of regression using a = 0.01. Find the 

P-value for this test and use it to draw your conclusions. 

(b) Test the contribution of each variable to the model using 
the t-test with a = 0.05. What are your conclusions? 

(c) Delete x; from the model. Test the new model for signifi- 
cance of regression. Also test the relative contribution of 
each regressor to the new model with the t-test. Using 
a = 0.05, what are your conclusions? 


(d) Notice that the MS, for the model in part (c) is smaller 
than the MS, for the full model in part (a). Explain why 
this has occurred. 

(e) Calculate the studentized residuals. Do any of these seem 
unusually large? 

(f) Suppose that you learn that the second observation was in- 
correctly recorded. Delete this observation and refit the 
model using x,, x», x4, and x, as the regressors. Notice that 
the R? for this model is considerably higher than the R? for 
either of the models fitted previously. Explain why the R? 
for this model has increased. 

(g) Test the model from part (f) for significance of regression 
using a = 0.05. Also investigate the contribution of each 
regressor to the model using the t-test with a = 0.05. 
What conclusions can you draw? 

(h) Plot the residuals from the model in part (f) versus y and 
versus each of the regressors x,, x2, xs, and x4. Comment 
on the plots. 


12-67. Consider the inverter data in Exercise 12-66. Delete 
observation 2 from the original data. Define new variables as fol- 
lows: y* = Iny, xt =1/Vx,, xj = Vx,xi = 1/V%, 
and x. = Nx. 

(a) Fit a regression model using these transformed regressors 
(do not use x;). 

(b) Test the model for significance of regression using a = 
0.05. Use the t-test to investigate the contribution of 
each variable to the model (a — 0.05). What are your 
conclusions? 

(c) Plot the residuals versus $t and versus each of the trans- 
formed regressors. Comment on the plots. 


12-68. Following are data on y = green liquor (g/l) and x = 


paper machine speed (feet per minute) from a Kraft paper [ ) 


machine. (The data were read from a graph in an article in the 
Tappi Journal, March 1986.) 


y 16.0 15.8 15.6 15:5 14.8 
x 1700 1720 1730 1740 1750 
y 14.0 13.5 13.0 12.0 11.0 
x 1760 1770 1780 1790 1795 


(a) Fit the model Y= By + Bix + Box? + € using least 
squares. 

(b) Test for significance of regression using a = 0.05. What 
are your conclusions? 

(c) Test the contribution of the quadratic term to the model, 
over the contribution of the linear term, using an F-statistic. 
If a = 0.05, what conclusion can you draw? 

(d) Plot the residuals from the model in part (a) versus y. 
Does the plot reveal any inadequacies? 

(e) Construct a normal probability plot of the residuals. 
Comment on the normality assumption. 
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Observation 
Number 


= = 
Fe O O oco -)10 tan Ut: r2 — 


12-20 


12-6 ASPECTS OF MULTIPLE REGRESSION MODELING 


20640 
20280 
19860 
18980 
18100 
20740 
20305 
19961 
18916 
18012 
20520 
20130 
19780 
19020 
18030 
20680 
20340 
19860 
18950 
18700 
20500 
20050 
19680 
18890 
17870 
20540 
20160 
19750 
18890 
17870 
20460 
20010 
19640 
18710 
17780 
20520 
20150 
19750 
18850 
17910 


30250 
30010 
29780 
29330 
28960 
30083 
29831 
29604 
29088 
28675 
30120 
29920 
29720 
29370 
28940 
30160 
29960 
29710 
29250 
28890 
30190 
29960 
29770 
29360 
28960 
30160 
29940 
29760 
29350 
28910 
30180 
29940 
29750 
29360 
28900 
30170 
29950 
29740 
29320 
28910 
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184 
164 
144 
216 
206 
196 
171 
149 
195 
190 
180 
161 
139 
208 
199 
187 
164 
145 
193 
183 
173 
153 
134 
193 
184 
173 
153 
133 
198 
186 
178 
156 
136 
197 
188 
178 
156 
137 


1732 
1697 
1662 
1598 
1541 
1709 
1669 
1640 
1572 
1522 
1740 
1711 
1682 
1630 
1532 
1704 
1679 
1642 
1576 
1528 
1748 
1713 
1684 
1624 
1569 
1746 
1714 
1679 
1621 
1561 
1729 
1692 
1667 
1609 
1552 
1758 
1729 
1690 
1616 
1569 
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100 


101 
100 
100 
100 
101 
98 
96 
94 
94 
94 
101 
100 
100 
99 
100 
99 
99 
99 
99 
99 
102 
101 
101 
101 
101 
100 
99 
99 
99 
100 
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12-69. Consider the jet engine thrust data in Exercise 12-64. 

(a) Use all possible regressions to select the best regression 
equation, where the model with the minimum value of 
MS; is to be selected as “best.” 

(b) Repeat part (a) using the Cp criterion to identify the best 
equation. 

(c) Use stepwise regression to select a subset regression model. 

(d) Compare the models obtained in parts (a), (b), and (c) 
above. 


12-70. Consider the electronic inverter data in Exercise 12-66 

and 12-67. Define the response and regressors variables as in 

Exercise 12-67, and delete the second observation in the sample. 

(a) Use all possible regressions to find the equation that mini- 
mizes C,. 

(b) Use all possible regressions to find the equation that 
minimizes MSp. 

(c) Use stepwise regression to select a subset regression model. 

(d) Compare the models you have obtained. 


12-71. Consider the three-variable regression model for the 
jet engine thrust data in Exercise 12-65. Calculate the variance 
inflation factors for this model. Would you conclude that mul- 
ticollinearity is a problem in this model? 


12-72. A multiple regression model was used to relate y = 

viscosity of a chemical product to x, = temperature and x, = 

reaction time. The data set consisted of n = 15 observations. 

(a) The estimated regression coefficients were By = 300.00, 
B, = 0.85, and B, = 10.40. Calculate an estimate of 
mean viscosity when x, = 100?F and x, = 2 hours. 

(b) The sums of squares were SS; = 1230.50 and SS; = 
120.30. Test for significance of regression using a — 
0.05. What conclusion can you draw? 


(c) What proportion of total variability in viscosity is 
accounted for by the variables in this model? 

(d) Suppose that another regressor, x4 — stirring rate, is added 
to the model. The new value of the error sum of squares 1s 
SS; = 117.20. Has adding the new variable resulted in a 
smaller value of MS,? Discuss the significance of this 
result. 

(e) Calculate an F-statistic to assess the contribution of x; to 
the model. Using a — 0.05, what conclusions do you 
reach? 


12-75.  Anarticle in the Journal of the American Ceramics 
Society (Vol. 75, 1992, pp. 112-116) describes a process for 
immobilizing chemical or nuclear wastes in soil by dissolving 
the contaminated soil into a glass block. The authors mix CaO 
and Na,O with soil and model viscosity and electrical conduc- 
tivity. The electrical conductivity model involves six regres- 
sors, and the sample consists of n = 14 observations. 

(a) For the six-regressor model, suppose that SS; = 0.50 and 
R? = 0.94. Find SS, and SSp, and use this information to 
test for significance of regression with a = 0.05. What are 
your conclusions? 

(b) Suppose that one of the original regressors is deleted from 
the model, resulting in R? — 0.92. What can you conclude 
about the contribution of the variable that was removed? 
Answer this question by calculating an F-statistic. 

(c) Does deletion of the regressor variable in part (b) result in 
a smaller value of MS; for the five-variable model, in 
comparison to the original six-variable model? Comment 
on the significance of your answer. 
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MIND-EXPANDING EXERCISES 


12-74. Consider a multiple regression model with K re- 
gressors. Show that the test statistic for significance of 
regression can be written as 


rM R?/k 
07 (1 = R2y(n — k- 1) 

Suppose that n = 20, k = 4, and R? = 0.90. Ifa = 0.05, 

what conclusion would you draw about the relationship 

between y and the four regressors? 


12-75. A regression model is used to relate a response 

y to k = A regressors with n = 20. What is the smallest 
value of R? that will result in a significant regression if 
a = 0.05? Use the results of the previous exercise. Are 
you surprised by how small the value of R? is? 


12-76. Show that we can express the residuals from a 
multiple regression model ase = (I — H)y, where H = 
X(X'X) 'X'’. 

12-77. Show that the variance of the ith residual e, in 
a multiple regression model is o(1 — h;) and that the 
covariance between e; and e; is — o?h;, where the h’s are 
the elements of H = X(X X) !X'. 

12-78. Consider the multiple linear regression model 
y = Xp + e. If f denotes the least squares estimator of 
B, show that Ê = B + Re, where R = (X'X) !X'. 
12-79. Constrained Least Squares. Suppose we wish 
to find the least squares estimator of B in the model y = 
Xp + e subject to a set of equality constraints, say, 
TB =c. 


ij? 


IMPORTANT TERMS AND CONCEPTS 


Confidence interval on 
the mean response 


In the E-book, click on any 
term or concept below to 


(a) Show that the estimator is 


B. = Ê + (XX)! 
x T''T(X'X)'T'] (c — TB) 


where B = (X’X)!X’y. 
(b) Discuss situations where this model might be appro- 
priate. 


12-80. Piecewise Linear Regression (I). Suppose 
that y is piecewise linearly related to x. That is, different 
linear relationships are appropriate over the intervals 
—o <x x x* and x* < x < o, Show how indicator 
variables can be used to fit such a piecewise linear re- 
gression model, assuming that the point x* is known. 


12-81. Piecewise Linear Regression (II). Consider 
the piecewise linear regression model described in 
Exercise 12-79. Suppose that at the point x* a disconti- 
nuity occurs in the regression function. Show how 
indicator variables can be used to incorporate the dis- 
continuity into the model. 


12-82.  Piecewise Linear Regression (IIT). Consider 
the piecewise linear regression model described in 
Exercise 12-79. Suppose that the point x* is not known 
with certainty and must be estimated. Suggest an ap- 
proach that could be used to fit the piecewise linear 
regression model. 


Significance of 
regression 


in multiple 
regression 
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go to that subject. 
All possible regressions 
Analysis of variance 
test in multiple 
regression 
Categorical variables as 
regressors 


Extra sum of squares 
method 

Inference (test and 
intervals) on individ- 
ual model parameters 

Influential observations 

Model parameters and 
their interpretation 


Outliers 

Polynomial terms in a 
regression model 

Prediction interval on a 
future observation 

Residual analysis and 
model adequacy 
checking 


Stepwise regression and 
related methods 


CD MATERIAL 

Ridge regression 

Nonlinear regression 
models 


12-2.3 More about the Extra Sum of Squares Method (CD Only) 


The extra sum of squares method for evaluating the contribution of one or more terms to a 
model is a very useful technique. Basically, one considers how much the regression or model 
sum of squares increases upon adding terms to a basic model. The expanded model is called 
the full model, and the basic model is called the reduced model. Although the development in 
the text is quite general, Example 12-5 illustrates the simplest case, one in which there is only 
one additional parameter in the full model. In this case, the partial F-test based on the extra 
sum of squares is equivalent to a t-test. When there is more than one additional parameter in 
the full model, the partial F-test is not equivalent to a t-test. 

The extra sum of squares method is often used sequentially when fitting a polynomial 
model, such as 


y = Bo + Bix, + Box, + B3x3 + 0 
Here SSp(B; | Bo) would measure the contribution of the linear term over and above a model 
containing only a mean (Bo), SS(B2 | Bo,8;) would measure the contribution of the quadratic 
terms over and above the linear, and SSp(B3 | Bo,81,;) would measure the contribution of the 
cubic terms over and above the linear and the quadratic. This can be very useful in selecting 
the order of the polynomial to fit. Notice from Table 12-4 that Minitab automatically pro- 


duces this sequential computation. Also, note that in a sequential partition of the model or re- 
gression sum of squares, 


SSr(B1,B2,83, | Bo) = SSa(Bi | Bo) + SSr(B2 | Bo.B1) + SSe(B3 | Bo.Bi.Bz). 


However, if we consider each variable as if it were the last to be added, 


SSr(B 182,83 | Bo) zs SSr(Bi | Bo,B2,B3) + SSr(Bo | Bo. B1. B3) SSr(B3 | Bo,B1,B2) 


As another illustration of the extra sum of squares method, consider the model 


_ 2 2 
y = Bo + Bixi + Box? + Bioxio + By xT + Boo + € 


Suppose that we are uncertain about the contribution of the second-order terms. We could 
evaluate this with a partial F-test by fitting the reduced model 


y = Bo + Bixi + Box. + € 


and computing 


SSR(B12B11B22 | Bo.B1.B2) = SSR(B1B2B12B11B22 | Bo) — SSr(B1,B2 | Bo) 


Finally, note that we have expressed the extra sum of squares as the difference in the regres- 
sion sum of squares between the full model and the reduced model: 


SS, (Extra) = SSp(Full Model) — SS; (Reduced Model) 


Some authors write SS,(Extra) as the difference between error or residual sums of squares for 
the two model. 
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Now 

SS, = SSp(Full Model) + SSz(Full Model) 
and 

SS; = SSp(Reduced Model) + SS (Reduced Model) 
SO 


SSp(Extra) = SSp(Full Model) — SSp(Reduced Model) 
= SS; — SS,-(Full Model) — [ SS; — SS pg(Reduced Model) | 
= SS;,(Reduced Model) — SS;(Full Model) 


Therefore, this is an equivalent way to write the extra sum of squares. 


12-6.5 Ridge Regression (CD Only) 


Since multicollinearity primarily affects the stability of the regression coefficients, it would 
seem that estimating these parameters by some method that is less sensitive to multicollinear- 
ity than ordinary least squares would be helpful. Several methods have been suggested. One 
alternative to ordinary least squares, ridge regression, can be useful in combating multi- 
collinearity. In ridge regression, the parameter estimates are obtained from 


p*(0) = (X'X + oD) !X'y (S12-1) 


where 0 = 0 is a constant. Generally, values of 0 in the interval 0 S 0 < 1 are appropriate. 
The ridge estimator B*(0) is not an unbiased estimator of p, as is the ordinary least squares 
estimator f$. Thus, ridge regression seeks to find a set of regression coefficients that are more 
"stable," in the sense that they have a small mean square error. Since multicollinearity usually 
results in ordinary least squares estimators that may have extremely large variances, ridge 
regression is suitable for situations where the multicollinearity problem exists. 

To obtain the ridge regression estimator from Equation S12-1, we must specify a value 
for the constant 0. Generally, there is an “optimum” 0 for any problem, but the simplest ap- 
proach is to solve Equation S12-1 for several values of 0 in the interval 0 = 0 S 1. Then a 
plot of the value of B *(0) is constructed. This display is called a ridge trace. The approximate 
value for 0 is chosen subjectively by inspection of the ridge trace. Typically, its value 1s cho- 
sen to obtain stable parameter estimates. Generally, the variance of BB *(0) is a decreasing func- 
tion of 0, while the squared bias [B — E(B*(0))]* is an increasing function of 0. Choosing the 
value of 0 involves trading off these two properties of B *(0). 

Extensive practical discussions of the use of ridge regression are in Montgomery, 
Peck, and Vining (2001) and Myers (1990). In addition, several other biased estimation 
techniques have been proposed for dealing with multicollinearity. Many regression com- 
puter packages incorporate ridge regression capability. SAS PROC REG will fit ridge 
regression models. 

To illustrate ridge regression, consider the data in Table 812-1, which shows the heat gen- 
erated in calories per gram for a particular type of cement as a function of the quantities of four 
additives (w,, w», w3, and w,). We wish to fit a multiple linear regression model to these data. 
This is some very “classical” regression data, first analyzed by Anders Hald. Refer to 
Montgomery, Peck, and Vining (2001) for sources and more details. 
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Table S12-1 Cement Data 


Observation 

Number y W| W» Wa W4 
1 28.25 10 31 5 45 

2 24.80 12 35 5 52 

3 11.86 5 15 3 24 

4 36.60 17 42 9 65 

5 15.80 8 6 5 19 

6 16.23 6 17 3 25 

T 29.50 12 36 6 55 

8 28.75 10 34 5 50 

9 43.20 18 40 10 70 

10 38.47 23 50 10 80 

11 10.14 16 3T 5 61 

12 38.92 20 40 11 70 

13 36.70 15 45 8 68 

14 15.31 7 22 2 30 

15 8.40 9 12 3 24 


EY m i=1,2,...,15 j=1,2,3,4 


J. 


where S; = Ew = wj is the corrected sum of squares of the levels of w,. The coded data 


are shown in Table S12-2. This transformation makes the column of ones in X orthogonal to the 


Table S12-2 Coded Cement Data 


Observation 
Number y Xi 9 às X4 
1 28.25 = 12515 .00405 —.09206 —.05538 
2 24.80 = 02635 08495 — .09206 03692 
3 11.86 = 37/217 = 3195) — 27614 — .03692 
4 36.60 .22066 .22653 .27617 —.33226 
5 15.80 — 22396 = 50161 — .09206 — 39819 
6 16.23 = 322760 —21912 = 21617 = 31907 
7 29.50 —.02635 .10518 .00000 .07647 
8 28.75 = 12515 .06472 —.09206 .01055 
9 43.20 .27007 .18608 .36823 .27425 
10 38.47 .51709 .38834 .36823 .40609 
11 10.14 .17126 .12500 — .09206 15556 
12 38.92 .36887 .18608 .46029 .27425 
13 36.70 .12186 .28721 .18411 .24788 
14 15.31 —.27336 —.17799 — 36523 —25315 


15 8.40 — .17456 =,30025 = 2101) 7.99220 
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other columns, so the intercept in this model will always be estimated by y. The (4 X 4) X'X 
matrix for the four coded variables 1s the correlation matrix 


1.00000 0.84894 0.91412 0.93367 
0.84894 1.00000 0.76899 0.97567 
0.91412 0.76899 1.00000 0.86784 
0.93367 0.97567 0.86784 1.00000 


/ 


This matrix contains several large correlation coefficients, and this may indicate significant 
multicollinearity. The inverse of X'X is 


20.769 25.813 —0.608 | —44.042 
25.813 74.486 12.597 —107.710 
— 0.608 12,597 8.274 | —18.903 
—44.042  —107.710 —18.903 163.620 


[XX i 


The variance inflation factors are the main diagonal elements of this matrix. Note that three of 
the variance inflation factors exceed 10, a good indication that multicollinearity is present. 

Equation 812-1 was solved for various values of 0, and the results are summarized in 
Table S12-3. The ridge trace is shown in Fig. S12-1. The instability of the least squares estimates 
p;*(0 = 0) is evident from inspection of the ridge trace. It is often difficult to choose a value of 
0 from the ridge trace that simultaneously stabilizes the estimates of all regression coefficients. 
Because the bias in the coefficients increases as 0 increases, it 1s usually best to choose 0 as small 
as possible; yet we want 0 to be large enough to provide reasonable stability in the coefficients. 
In our example, most of the change in the regression coefficients has occurred when 0.05 < 0 
< 0.1. We will choose 0 = 0.064, which implies that the regression model is 


y = 25.53 — 18.0566x, + 17.2202x; + 36.0743x4 + 4.7242x, 
using By = y = 25.53. Converting the model to the original variables w;, we have 
y = 2.9913 — 0.8920w, + 0.3483w, + 3.3209w, — 0.0623w, 


This is the ridge regression model for the cement data. 


Table S12-3 Ridge Regression Estimates for the Cement Data 


0 Bi(@) B5(9) B3(0) Ba(0) 
.000 —28.3318 65.9996 64.0479 — 57.2491 
.001 — 31.0360 57.0244 61.9645 —44.0901 
.002 — 32.6441 50.9649 60.3899 —35.3088 
.004 —34.1071 43.2358 58.0266 — 24.3241 
.008 —34.3195 35.1426 54.7018 — 13.3348 
.016 =31.9710 27.9534 50.0949 —4.5489 
.032 —26.3451 22.0347 43.8309 1.2950 
.064 — 18.0566 17:2202 36.0743 4.7242 
.128 —9.1786 13.4944 277.9363 6.5914 
.256 — 1.9896 10.9160 20.8028 7.5076 
212 2.4922 9.2014 15.3197 7.1224 
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Figure 512-1 Ridge trace for the cement data. 


12-6.6 Nonlinear Regression 


Linear regression models provide a rich and flexible framework that work extremely well in 
many problems in engineering and science. However, linear regression models are not appro- 
priate for all situations. There are many problems where the response variable and the predic- 
tor variables are related through a known nonlinear function. This leads to a nonlinear 
regression model. When the method of least squares is applied to such models, the resulting 
normal equations are nonlinear and, in general, difficult to solve. The usual approach is to 
directly minimize the residual sum of squares by an iterative procedure. We now give a very 
brief introduction to nonlinear regression models. 


Linear or Nonlinear Models 
We have focused in Chapter 12 on the linear regression model 


Y= Bo + Dix, + Dx» PH eee ee Dix, +e (S12-2) 


These models can include not only the first-order relationships, such as Equation S12-2, but 
polynomial models, and other more complex relationships as well. In fact, we could write the 
linear regression model as 


Y= Do + Dizi F oz» FP cues F B.Z, te (812-3) 


where z; represents any function of the original regressors xj, x», ... , X}, including trans- 
formations such as exp(x;), Vx; and sin(x;). These models are called linear regression models 
because they are linear in the unknown parameters, the B, j = 1, 2,..., K. 

We may write the linear regression model (Equation S12-2) in a general form as 


Y-x'B-e 


f(x, B) + « (S12-4) 
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where x’ = [1, xj, X2, ... , X4]. Since the expected value of the model errors is zero, the ex- 
pected value of the response variable is 


E(Y) = E|f(x, B) + €] 
= f(x B) 


We usually refer to f(x, B) as the expectation function for the model. Obviously, the expec- 
tation function here is just a linear function of the unknown parameters. 

Any model that is not linear in the unknown parameters is a nonlinear regression model. 
For example, the model 


Y = 04e" + € (S12-5) 


is not linear in the unknown parameters 0, and 0,. We will use the symbol 6 to represent a pa- 
rameter in a nonlinear model to emphasize the difference between the linear and the nonlinear 
case. Nonlinear models often arise in cases where the relationship between the response and 
the regressors 1s a differential equation or the solution to a differential equation. 

In general, we will write the nonlinear regression model as 


Y - f(x, 0) - € (S12-6) 


where 9 is a p X 1 vector of unknown parameters, and e is an uncorrelated random error term 
with E(e) = 0 and Var(e) = o°. We also typically assume that the errors are normally distrib- 
uted, as in linear regression. Since 


= f(x, 8) (S12-7) 


we call f(x, 0) the expectation function for the nonlinear regression model. This is very sim- 
ilar to the linear regression case, except that now the expectation function is a nonlinear func- 
tion of the parameters. 

In a nonlinear regression model, at least one of the derivatives of the expectation function 
with respect to the parameters depends on at least one of the parameters. In linear regression, 
these derivatives are not functions of the unknown parameters. To illustrate these points, con- 
sider a linear regression model 


Y = Bo + Pp + Doo + +e + Bx +e 


with expectation function f(x, B) = Bo + Ši Bix; Now 


af (x, B) — m 
3B, = ys j=0,1,...,k 


where x; = 1. Notice that in the linear case the derivatives are not functions of the B’s. 
Now consider the nonlinear model 


y=f(x,0) +e 
= 0,e* te 
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The derivatives of the expectation function with respect to 0, and 0, are 


of (x, 8) i 
00, 
and 
af (x, 0) T 
30, — xe 


Since the derivatives are a function of the unknown parameters 0, and 0,, the model is nonlinear. 


Parameter Estimation in a Nonlinear Model 

Suppose that we have a sample of n observations on the response and the regressors, say y;, 
Xi Xp, Xg, fori = 1,2,..., n. We have observed previously that the method of least squares 
in linear regression involves minimizing the least-squares function 


n k 2 
L(B) = 2, b» E (6, + 2 23] 
Because this is a linear regression model, when we differentiate S(B) with respect to the 
unknown parameters and equate the derivatives to zero, the resulting normal equations are 
linear equations, and consequently, they are easy to solve. 

Now consider the nonlinear regression situation. The model is 


Y; = f(x; 9) + ej i= 1,2,...,n 


where now x; = [1, x3, x5, ..., Xg] fori = 1, 2,..., n. The least squares function is 


n 


L(9) = X v; - f(x, 9) (S12-8) 
i=l 
To find the least squares estimates we must differentiate Equation S12-8 with respect to each 
element of 0. This will provide a set of p normal equations for the nonlinear regression situa- 
tion. The normal equations are 


-0  foj-12,..,p (S12-9) 


In a nonlinear regression model the derivatives in the large square brackets will be functions 
of the unknown parameters. Furthermore, the expectation function is also a nonlinear func- 
tion, so the normal equations can be very difficult to solve. 

To illustrate this point, consider the nonlinear regression model in Equation S12-5: 


Y= 0,e*2* te 


The least squares normal equations for this model are 


n 


> by = 6, c9» pori = () 


i=] 
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After simplification, the normal equations are 


n n 
6x; A 26,x, — 
Sues — 6, Hehe =o 
i=l i=l 
n 


n 
ó Xj — A 26 X; — 
M yixie : 0, X xe zuo 0 


i=l i=l 


These equations are not linear in 6, and 6,5, and no simple closed-form solution exists. In gen- 
eral, iterative methods must be used to find the values of 6, and 6). To further complicate the 
problem, sometimes there are multiple solutions to the normal equations. That is, there are 
multiple stationary values for the residual sum of squares function L(@). 

A widely used method for nonlinear regression 1s linearization of the nonlinear function 
followed by the Gauss-Newton iteration method of parameter estimation. Linearization 1s ac- 
complished by a Taylor series expansion of f(x; 0) about the point 0) = [0;5, 929, ... , 9,0] 
with only the linear terms retained. This yields 


Of (x;, 0 
fxi 9) = f(x; 09) + > EM (0; — 80) (812-10) 
j= J 0—0, 
If we set 

i = f (xj, 90) 

Bj = 6; — % 
0 _ E 
Í 00; Je=0, 


we note that the nonlinear regression model can be written as 
p 
yJ = X BZ} te,  i=1l,2,...,n (S12-11) 
yc 


That is, we now have a linear regression model. We usually call 0, the starting values for the 
parameters. 
We may write Equation 812-10 as 


Yo = Zopo + € (S12-12) 
so the estimate of p, is 
Bo = (ZoZo) 'Zoyo 
= (ZoZo) ‘Loy — fo) 
Now since Bo = 0 — 0,, we could define 
0, = Bo + 6, 


as revised estimates of 0. Sometimes Bo is called the vector of increments. We may now 
replace the revised estimates 0, in Equation 812-10 (in the same roles played by the initial 
estimates 05) and then produce another set of revised estimates, say 05 and so forth. 
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In general, we have at the kth iteration 


where 


DoD Bui 
0, = [bie 9245 ..., Opl 


This iterative process continues until convergence, that is, until 
(0 44 B 059/95. = 0, J = D. eee 


where 8 is some small number, say, 1.0 X 10 6. At each iteration the residual sum of squares 
L(6,) should be evaluated to ensure that a reduction in its value has been obtained. This 
scheme is available in many software packages. The SAS procedure is PROC NLIN. There 
are also several widely used variations of this procedure. For details and examples refer to 
Montgomery, Peck, and Vining (2001) and Myers (1990). 
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After careful study of this chapter, you should be able to do the following: 


1. Design and conduct engineering experiments involving a single factor with an arbitrary number 
of levels 


NW OM BB W WN 


. Assess model adequacy with residual plots 


. Understand how the analysis of variance is used to analyze the data from these experiments 


. Use multiple comparison procedures to identify specific differences between means 
. Make decisions about sample size in single-factor experiments 
. Understand the difference between fixed and random factors 


. Estimate variance components in an experiment involving random factors 
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8. Understand the blocking principle and how it is used to isolate the effect of nuisance factors 


9. Design and conduct experiments involving the randomized complete block design 


CD MATERIAL 
10. Use operating characteristic curves to make sample size decisions in single-factor random effects 
experiment 


11. Use Tukey's test, orthogonal contrasts and graphical methods to identify specific differences 
between means. 


Answers for most odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-Text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e- Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e-Text immediately following the section they accompany. 


13-1 DESIGNING ENGINEERING EXPERIMENTS 


Experiments are a natural part of the engineering and scientific decision-making process. 
Suppose, for example, that a civil engineer 1s investigating the effects of different curing methods 
on the mean compressive strength of concrete. The experiment would consist of making up sev- 
eral test specimens of concrete using each of the proposed curing methods and then testing the 
compressive strength of each specimen. The data from this experiment could be used to determine 
which curing method should be used to provide maximum mean compressive strength. 

If there are only two curing methods of interest, this experiment could be designed and 
analyzed using the statistical hypothesis methods for two samples introduced in Chapter 10. 
That is, the experimenter has a single factor of interest—curing methods—and there are only 
two levels of the factor. If the experimenter is interested in determining which curing method 
produces the maximum compressive strength, the number of specimens to test can be deter- 
mined from the operating characteristic curves in Appendix Chart VI, and the t-test can be 
used to decide if the two means differ. 

Many single-factor experiments require that more than two levels of the factor be con- 
sidered. For example, the civil engineer may want to investigate five different curing methods. 
In this chapter we show how the analysis of variance (frequently abbreviated ANOVA) can 
be used for comparing means when there are more than two levels of a single factor. We will 
also discuss randomization of the experimental runs and the important role this concept plays 
in the overall experimentation strategy. In the next chapter, we will show how to design and 
analyze experiments with several factors. 

Statistically based experimental design techniques are particularly useful in the engineering 
world for improving the performance of a manufacturing process. They also have extensive 
application in the development of new processes. Most processes can be described in terms of 
several controllable variables, such as temperature, pressure, and feed rate. By using designed 
experiments, engineers can determine which subset of the process variables has the greatest 
influence on process performance. The results of such an experiment can lead to 


1. Improved process yield 


2. Reduced variability in the process and closer conformance to nominal or target 
requirements 


Reduced design and development time 


Reduced cost of operation 
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Experimental design methods are also useful in engineering design activities, where new 
products are developed and existing ones are improved. Some typical applications of statisti- 
cally designed experiments in engineering design include 


Evaluation and comparison of basic design configurations 
Evaluation of different materials 


Selection of design parameters so that the product will work well under a wide vari- 
ety of field conditions (or so that the design will be robust) 


4. Determination of key product design parameters that affect product performance 


The use of experimental design in the engineering design process can result in products that 
are easier to manufacture, products that have better field performance and reliability than their 
competitors, and products that can be designed, developed, and produced in less time. 

Designed experiments are usually employed sequentially. That is, the first experiment 
with a complex system (perhaps a manufacturing process) that has many controllable variables 
is often a screening experiment designed to determine which variables are most important. 
Subsequent experiments are used to refine this information and determine which adjustments 
to these critical variables are required to improve the process. Finally, the objective of the ex- 
perimenter is optimization, that 1s, to determine which levels of the critical variables result in 
the best process performance. 

Every experiment involves a sequence of activities: 


1. Conjecture—the original hypothesis that motivates the experiment. 
2. Experiment—the test performed to investigate the conjecture. 

3. Analysis—the statistical analysis of the data from the experiment. 
4 


Conclusion—what has been learned about the original conjecture from the experi- 
ment. Often the experiment will lead to a revised conjecture, and a new experiment, 
and so forth. 


The statistical methods introduced in this chapter and Chapter 14 are essential to good 
experimentation. All experiments are designed experiments; unfortunately, some of them 
are poorly designed, and as a result, valuable resources are used ineffectively. Statistically 
designed experiments permit efficiency and economy in the experimental process, and the 
use of statistical methods in examining the data results in scientific objectivity when draw- 
ing conclusions. 


13-2 THE COMPLETELY RANDOMIZED SINGLE-FACTOR 


EXPERIMENT 


13-2.1 An Example 


A manufacturer of paper used for making grocery bags is interested in improving the tensile 
strength of the product. Product engineering thinks that tensile strength is a function of the 
hardwood concentration in the pulp and that the range of hardwood concentrations of practi- 
cal interest is between 5 and 20%. A team of engineers responsible for the study decides to in- 
vestigate four levels of hardwood concentration: 596, 10%, 15%, and 20%. They decide to 
make up six test specimens at each concentration level, using a pilot plant. All 24 specimens 
are tested on a laboratory tensile tester, in random order. The data from this experiment are 
shown in Table 13-1. 
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Table 13-1 Tensile Strength of Paper (psi) 


Hadrood Observations 
Concentration (%) 1 2 3 E 5 6 Totals Averages 
3 7 8 15 11 9 10 60 10.00 
10 12 17 13 18 19 15 94 15:67 
15 14 18 19 17 16 18 102 17.00 
20 19 25 22 23 18 20 127 21.17 


383 15.96 


This is an example of a completely randomized single-factor experiment with four levels 
of the factor. The levels of the factor are sometimes called treatments, and each treatment has 
six observations or replicates. The role of randomization in this experiment is extremely im- 
portant. By randomizing the order of the 24 runs, the effect of any nuisance variable that may 
influence the observed tensile strength is approximately balanced out. For example, suppose 
that there is a warm-up effect on the tensile testing machine; that is, the longer the machine is 
on, the greater the observed tensile strength. If all 24 runs are made in order of increasing 
hardwood concentration (that is, all six 5% concentration specimens are tested first, followed 
by all six 1096 concentration specimens, etc.), any observed differences in tensile strength 
could also be due to the warm-up effect. 

It is important to graphically analyze the data from a designed experiment. Figure 13-1(a) 
presents box plots of tensile strength at the four hardwood concentration levels. This figure 
indicates that changing the hardwood concentration has an effect on tensile strength; specif- 
ically, higher hardwood concentrations produce higher observed tensile strength. 
Furthermore, the distribution of tensile strength at a particular hardwood level is reasonably 
symmetric, and the variability in tensile strength does not change dramatically as the hard- 
wood concentration changes. 


30 


25 


20 


15 


Tensile strength (psi) 


10 


0 5 10 15 20 ur H + T2 u H + 73 U + T4 
Hardwood concentration (%) PI U2 u3 U4 
(a) (b) 


Figure 13-1 (a) Box plots of hardwood concentration data. (b) Display of the model in Equation 13-1 for the completely ran- 
domized single-factor experiment. 
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Graphical interpretation of the data is always useful. Box plots show the variability 
of the observations within a treatment (factor level) and the variability between treatments. 
We now discuss how the data from a single-factor randomized experiment can be analyzed 
statistically. 


13-2.2 The Analysis of Variance 


Suppose we have a different levels of a single factor that we wish to compare. Sometimes, 
each factor level is called a treatment, a very general term that can be traced to the early 
applications of experimental design methodology in the agricultural sciences. The response 
for each of the a treatments is a random variable. The observed data would appear as shown 
in Table 13-2. An entry in Table 13-2, say y;, represents the jth observation taken under treat- 
ment i. We initially consider the case in which there are an equal number of observations, n, 
on each treatment. 
We may describe the observations in Table 13-2 by the linear statistical model 


] LO d 


-atuteli y. (13-1) 


where Y; is a random variable denoting the (ij)th observation, p is a parameter common to all 
treatments called the overall mean, 7; is a parameter associated with the ith treatment called 
the ith treatment effect, and €, is a random error component. Notice that the model could 
have been written as 


Y " i—1,2,...,a 
JE LEO TTE 


where p; = u + 7; is the mean of the ith treatment. In this form of the model, we see 
that each treatment defines a population that has mean u;, consisting of the overall mean p 
plus an effect v; that is due to that particular treatment. We will assume that the errors e; 
are normally and independently distributed with mean zero and variance o^. Therefore, 
each treatment can be thought of as a normal population with mean w; and variance o?. See 
Fig. 13-1(b). 

Equation 13-1 is the underlying model for a single-factor experiment. Furthermore, since 
we require that the observations are taken in random order and that the environment (often 
called the experimental units) in which the treatments are used is as uniform as possible, this 
experimental design is called a completely randomized design. 


Table 13-2 Typical Data for a Single-Factor Experiment 


Treatment Observations Totals Averages 
l yu X12 ee Yin Yı- Yı- 
2 Yn 22 ee Yan X2. Y2- 
a Yal Ya2 eas Yan Ya Vo 


13-2 THE COMPLETELY RANDOMIZED SINGLE-FACTOR EXPERIMENT 473 


The a factor levels in the experiment could have been chosen in two different ways. 
First, the experimenter could have specifically chosen the a treatments. In this situation, we 
wish to test hypotheses about the treatment means, and conclusions cannot be extended to 
similar treatments that were not considered. In addition, we may wish to estimate the treat- 
ment effects. This is called the fixed-effects model. Alternatively, the a treatments could be 
a random sample from a larger population of treatments. In this situation, we would like to 
be able to extend the conclusions (which are based on the sample of treatments) to all treat- 
ments in the population, whether or not they were explicitly considered in the experiment. 
Here the treatment effects 7; are random variables, and knowledge about the particular ones 
investigated is relatively unimportant. Instead, we test hypotheses about the variability of 
the 7; and try to estimate this variability. This is called the random effects, or components 
of variance, model. 

In this section we develop the analysis of variance for the fixed-effects model. The 
analysis of variance is not new to us; it was used previously in the presentation of regression 
analysis. However, in this section we show how it can be used to test for equality of treatment 
effects. In the fixed-effects model, the treatment effects 7; are usually defined as deviations 
from the overall mean p, so that 


a 


a 
| 
c 


(13-2) 
Ml 

Let y; represent the total of the observations under the ith treatment and y;. represent the average 

of the observations under the ith treatment. Similarly, let y.. represent the grand total of all obser- 

vations and y.. represent the grand mean of all observations. Expressed mathematically, 


n 
Yi = X yj V; = y;./n i—-1,2,...,a 
j=l 


a n 


y. = >» > Vij Y.. = y../N (13-3) 
Pel. ye 
where N = an is the total number of observations. Thus, the “dot” subscript notation implies 
summation over the subscript that it replaces. 
We are interested in testing the equality of the a treatment means p, W2, ... , Wa Using 
Equation 13-2, we find that this 1s equivalent to testing the hypotheses 


Ao: 74 =m =: =7,=0 
H,:1,# 0 for at least one; (13-4) 


Thus, if the null hypothesis is true, each observation consists of the overall mean p plus a 
realization of the random error component e;. This is equivalent to saying that all N 
observations are taken from a normal distribution with mean p and variance o^. Therefore, 
if the null hypothesis is true, changing the levels of the factor has no effect on the mean 
response. 

The ANOVA partitions the total variability in the sample data into two component parts. 
Then, the test of the hypothesis in Equation 13-4 is based on a comparison of two independ- 
ent estimates of the population variance. The total variability in the data is described by the to- 
tal sum of squares 


a n 


SS; = > by (yi T y.y 


i=1 j=l 
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Definition 


The partition of the total sum of squares is given in the following definition. 


The sum of squares identity is 


a n n 


py p» (yi E yer je . > (Vi EE var 5.5) 


= =] j=l 


or symbolically 


SST = 5) Treatments Ys SSE 





The identity in Equation 13-5 (which is developed in Section 13-4.4 on the CD) shows 
that the total variability in the data, measured by the total corrected sum of squares SS;, can be 
partitioned into a sum of squares of differences between treatment means and the grand mean 
denoted SSy,eatments and a sum of squares of differences of observations within a treatment from 
the treatment mean denoted SS;. Differences between observed treatment means and the 
grand mean measure the differences between treatments, while differences of observations 
within a treatment from the treatment mean can be due only to random error. 

We can gain considerable insight into how the analysis of variance works by examining 
the expected values of SStyeatments and SS. This will lead us to an appropriate statistic for test- 
ing the hypothesis of no differences among treatment means (or all 7; — 0). 


The expected value of the treatment sum of squares is 


a 


JE SIS Treaments) = (a = 1)o? a p T? 
i-l 


and the expected value of the error sum of squares 1s 


E(SSz) = a(n — 1)o* 





There is also a partition of the number of degrees of freedom that corresponds to the sum 
of squares identity in Equation 13-5. That is, there are an = N observations; thus, SS; has 
an — | degrees of freedom. There are a levels of the factor, so SStreatments has a — 1 degrees of 
freedom. Finally, within any treatment there are n replicates providing n — 1 degrees of free- 
dom with which to estimate the experimental error. Since there are a treatments, we have 
a(n — 1) degrees of freedom for error. Therefore, the degrees of freedom partition 1s 


an— Lea ian 1) 
The ratio 


MS Treatments — SS Treatments! (a B 1) 


is called the mean square for treatments. Now if the null hypothesis Hg: T) = tT = 

-= 7, = Q is true, MSrreaments iS an unbiased estimator of o? because X.,7; = 0. However, 
if H, is true, MSrreaments estimates o? plus a positive term that incorporates variation due to the 
systematic difference in treatment means. 


Definition 
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Note that the error mean square 
MS; = SS;/{a(n — 1)] 
is an unbiased estimator of o? regardless of whether or not H is true. We can also show that 


MS treatments 2d MS; are independent. Consequently, we can show that if the null hypothesis Ho 
Is true, the ratio 


_ SS Treatments/(@ — 1) a MS Treatments 


on SSz/[a(n — 1)] |»... MSg 





has an F-distribution with a — 1 and a(n — 1) degrees of freedom. Furthermore, from the ex- 
pected mean squares, we know that MS; is an unbiased estimator of o?. Also, under the null 
hypothesis, MSreaments is an unbiased estimator of o”. However, if the null hypothesis is false, 
the expected value of MSyreatments is greater than o7. Therefore, under the alternative hypothe- 
sis, the expected value of the numerator of the test statistic (Equation 13-7) is greater than the 
expected value of the denominator. Consequently, we should reject Hj if the statistic is large. 
This implies an upper-tail, one-tail critical region. Therefore, we would reject H if 
Jo > fa;a-1,a(n-1) Where fo is the computed value of F from Equation 13-7. 

Efficient computational formulas for the sums of squares may be obtained by expanding 
and simplifying the definitions of SStreatments and SS. This yields the following results. 


The sums of squares computing formulas for the ANOVA with equal sample sizes in 
each treatment are 


a n 


2 
Ve 
Soe = p» p» DE 77 (13-8) 
i=] j=l 


yf 


Ve 
S Treatments EE n N (139) 


The error sum of squares is obtained by subtraction as 


SSE = SST a DS Treatments (13-10) 





The computations for this test procedure are usually summarized in tabular form as shown in 
Table 13-3. This is called an analysis of variance (or ANOVA) table. 


Table 13-3 The Analysis of Variance for a Single-Factor Experiment, Fixed-Effects Model 


Source of Degrees of 
Variation Sum of Squares Freedom Mean Square Fo 

MS Treatment 
Treatments OS treatments a—l MS peatments —MS. 
Error SS, a(n — 1) MS; 


Total SS, an =| 
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EXAMPLE 13-1 


Consider the paper tensile strength experiment described in Section 13-2.1. We can use the 
analysis of variance to test the hypothesis that different hardwood concentrations do not affect 
the mean tensile strength of the paper. 

The hypotheses are 


Ho: Ti = T = T3 = 14 = O 


H: q; # 0 for atleast one i . 


We will use a = 0.01. The sums of squares for the analysis of variance are computed from 
Equations 13-8, 13-9, and 13-10 as follows: 


E. ys 
SSr= >, bi MES 


2 2 2 ( 83) 
— (7) + (8) + + (20) = 24 = 512.96 
4 2 2 
Yi Y 
SS Treatments — 2, n o- N 
60) + (94) + (102 + (127) (383) 
LP MP OR a CEP Oa 


SSE m SST — SS Treatments 
= 512.96 — 382.79 = 130.17 


The ANOVA is summarized in Table 13-4. Since fo; 559 = 4.94, we reject Hy and conclude 
that hardwood concentration in the pulp significantly affects the mean strength of the paper. 
We can also find a P-value for this test statistic as follows: 


P = P(F39 > 19.60) = 3.59 x 107° 


Since P = 3.59 X 10 is considerably smaller than a = 0.01, we have strong evidence to 
conclude that Ho is not true. 


Minitab Output 

Many software packages have the capability to analyze data from designed experiments using 
the analysis of variance. Table 13-5 presents the output from the Minitab one-way analysis of 
variance routine for the paper tensile strength experiment in Example 13-1. The results agree 
closely with the manual calculations reported previously in Table 13-4. 


Table 13-4 ANOVA for the Tensile Strength Data 


Source of Degrees of 

Variation Sum of Squares Freedom Mean Square ho P-value 
Hardwood 

concentration 382.79 3 127.60 19.60 3.59 E-6 
Error 130.17 20 6.51 


Total 512.96 29 
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Table 13-5 Minitab Analysis of Variance Output for Example 13-1 


One-Way ANOVA: Strength versus CONC 
Analysis of Variance for Strength 











Source DF SS MS F P 
Conc 3 382.79 127.60 19.61 0.000 
Error 20 130.17 6.51 
Total 23 512.96 Individual 95% CIs For Mean 
Based on Pooled StDev 
Level N Mean StDev — + ——- + - + - +— 
5 6 10.000 2.828 (—*—) 
10 6 15.667 2.805 (—*—) 
15 6 17.000 1.789 (—*—) 
20 6 21.167 2.639 (—*—) 
—- + -+ - + -+- 
Pooled StDev = 2:33] 10.0 15.0 20.0 25.0 


Fisher's pairwise comparisons 


Family error rate — 0.192 
Individual error rate — 0.0500 


Critical value = 2.086 


Intervals for (column level mean) — (row level mean) 


5 10 15 
10 — 8.739 
— 2.594 
15 — 10.072 —4.406 
9:928 1:759 
20 — 14.239 mu rp —1.239 


— 8.094 —2.428 — 1.094 


The Minitab output also presents 95% confidence intervals on each individual treatment 
mean. The mean of the ith treatment is defined as 


Mj—7 ptg i=1,2,...,a 


A point estimator of w; is ù; = Y;. Now, if we assume that the errors are normally distributed, 
each treatment average is normally distributed with mean p; and variance o?/n. Thus, if o? 
were known, we could use the normal distribution to construct a CI. Using MS; as an estima- 
tor of o? (The square root of MS; is the *Pooled StDev" referred to in the Minitab output), we 
would base the CI on the ¢-distribution, since 


T= Y. — W 
7 VMS k/ n 

has a t-distribution with a(n — 1) degrees of freedom. This leads to the following definition 
of the confidence interval. 
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Definition 
A 100(1 — a) percent confidence interval on the mean of the ith treatment p; is 
- MSp = MS; 
Yi 7 ly /2,a(n—1) n S wu Sy, t fa/2,a(n—1) n S 

Equation 13-11 is used to calculate the 95% CIs shown graphically in the Minitab output of 
Table 13-5. For example, at 20% hardwood the point estimate of the mean is y4. = 21.167, 
MS; = 6.51, and to02520 = 2.086, so the 95% CI is 

[Va X to.025,20 VMSg/n] 

[21.167 + (2.086)V6.51/6] 
or 

19.00 psi S py S 23.34 psi 

It can also be interesting to find confidence intervals on the difference in two treatment means, 

say, w; — p. The point estimator of w; — p; 1s Y;. — Y;., and the variance of this estimator is 

E E a. © E 207 

WE.- Qa t£ ^ 

Now if we use MS; to estimate o^, 

T= Y, — Y, — (M; — |j) 
has a t-distribution with a(n — 1) degrees of freedom. Therefore, a CI on p; — p; may be 
based on the f-distribution. 

Definition 


A 100(1 — a) percent confidence interval on the difference in two treatment means 
W; — pls 


s 2M5; UE IMS 
Yi — Yj T fa/2,a(n—1) n = WT By S ye T Vie t ta/2an-) T 


GBD) 





A 95% CI on the difference in means p, — p» is computed from Equation 13-12 as follows: 


[ y3. — Y». X fo2520 V 2MSg/n |] 
[17.00 — 15.67 + (2.086) V2(6.51)/6] 
Or 
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Since the CI includes zero, we would conclude that there is no difference in mean tensile 
strength at these two particular hardwood levels. 

The bottom portion of the computer output in Table 13-5 provides additional information con- 
cerning which specific means are different. We will discuss this in more detail in Section 13-2.3. 


An Unbalanced Experiment 

In some single-factor experiments, the number of observations taken under each treatment 
may be different. We then say that the design is unbalanced. In this situation, slight 
modifications must be made in the sums of squares formulas. Let n; observations be taken 
under treatment i (i = 1,2,...,a), and let the total number of observations N = Y, n; The 
computational formulas for SS and SSy,eatments are as shown in the following definition. 


Definition 
The sums of squares computing formulas for the ANOVA with unequal sample sizes 
n; in each treatment are 


(13-13) 


(13-14) 


SSE = SST m bee eR (Sela) 





Choosing a balanced design has two important advantages. First, the ANOVA is relatively 
insensitive to small departures from the assumption of equality of variances if the sample sizes 
are equal. This is not the case for unequal sample sizes. Second, the power of the test is max- 
imized if the samples are of equal size. 


13-2.5 Multiple Comparisons Following the ANOVA 


When the null hypothesis Hp: 7; = 70 = + = 7, = O is rejected in the ANOVA, we know 
that some of the treatment or factor level means are different. However, the ANOVA doesn't 
identify which means are different. Methods for investigating this issue are called multiple 
comparisons methods. Many of these procedures are available. Here we describe a very 
simple one, Fisher’s least significant difference (LSD) method. In Section 13-2.4 on the 
CD, we describe three other procedures. Montgomery (2001) presents these and other methods 
and provides a comparative discussion. 

The Fisher LSD method compares all pairs of means with the null hypotheses Ho: |; = y; 
(for all i # j) using the t-statistic 


In — Hp 


lj = ——— 
2MS 
n 


Assuming a two-sided alternative hypothesis, the pair of means w; and jy; would be declared 
significantly different if 





480 CHAPTER 13 DESIGN AND ANALYSIS OF SINGLE-FACTOR EXPERIMENTS: THE ANALYSIS OF VARIANCE 


EXAMPLE 13-2 


where LSD, the least significant difference, is 


2MS; 
LSD = 2an DN ES 


If the sample sizes are different in each treatment, the LSD is defined as 





1 i 
LSD = a/2,N—a MSp e T =) 


We will apply the Fisher LSD method to the hardwood concentration experiment. There are 
a = 4 means, n = 6, MS; = 6.51, and to.025,20 = 2.086. The treatment means are 


yı. = 10.00 psi 
y2. = 15.67 psi 
ys. = 17.00 psi 
y4. = 21.17 psi 


The value of LSD is LSD = 1905559 V2M Sz /n = 2.086 V/2(6.51)/6 = 3.07. Therefore, any 
pair of treatment averages that differs by more than 3.07 implies that the corresponding pair 
of treatment means are different. 

The comparisons among the observed treatment averages are as follows: 


4 vs. ] = 21.17 — 10.00 = 11.17 > 3.07 
4vs.2 = 21.17 — 15.67 = 5.50 > 3.07 
4vs.3 = 21.17 — 17.00 = 4.17 > 3.07 
3 vs. 1 = 17.00 — 10.00 = 7.00 > 3.07 
3vs.2 = 17.00 — 15.67 = 1.33 < 3.07 

. 1 = 15.67 — 10.00 = 5.67 > 3.07 


2 VS 


From this analysis, we see that there are significant differences between all pairs of means 
except 2 and 3. This implies that 10 and 15% hardwood concentration produce approximately 
the same tensile strength and that all other concentration levels tested produce different tensile 
strengths. It is often helpful to draw a graph of the treatment means, such as in Fig. 13-2, with 
the means that are not different underlined. This graph clearly reveals the results of the exper- 
iment and shows that 20% hardwood produces the maximum tensile strength. 


The Minitab output in Table 13-5 shows the Fisher LSD method under the heading 
“Fisher’s pairwise comparisons.” The critical value reported is actually the value of 1505559 = 


5% 10% 15% 20% 
e e——e e 
0 5 10 15 20 25 psi 


Figure 13-2 Results of Fisher’s LSD method in Example 13-2. 
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2.086. Minitab implements Fisher's LSD method by computing confidence intervals on all 
pairs of treatment means using Equation 13-12. The lower and upper 95% confidence limits 
are shown at the bottom of the table. Notice that the only pair of means for which the 
confidence interval includes zero is for wọ and ps. This implies that wio and p; are not 
significantly different, the same result found in Example 13-2. 

Table 13-5 also provides a "family error rate," equal to 0.192 in this example. When all 
possible pairs of means are tested, the probability of at least one type I error can be much 
greater than for a single test. We can interpret the family error rate as follows. The probability 
is 1 — 0.192 — 0.808 that there are no type I errors in the six comparisons. The family error 
rate in Table 13-5 is based on the distribution of the range of the sample means. See 
Montgomery (2001) for details. Alternatively, Minitab permits you to specify a family error 
rate and will then calculate an individual error rate for each comparison. 


13-2.4 More About Multiple Comparisons (CD Only) 
13-2.5 Residual Analysis and Model Checking 


The analysis of variance assumes that the observations are normally and independently dis- 
tributed with the same variance for each treatment or factor level. These assumptions should 
be checked by examining the residuals. A residual is the difference between an observation y;; 
and its estimated (or fitted) value from the statistical model being studied, denoted as y;;. For 
the completely randomized design y; = y;. and each residual is e; = y; — Yi» that is, the dif- 
ference between an observation and the corresponding observed treatment mean. The residuals 
for the paper tensile strength experiment are shown in Table 13-6. Using y;. to calculate each 
residual essentially removes the effect of hardwood concentration from the data; consequently, 
the residuals contain information about unexplained variability. 

The normality assumption can be checked by constructing a normal probability plot of 
the residuals. To check the assumption of equal variances at each factor level, plot the residu- 
als against the factor levels and compare the spread in the residuals. It is also useful to plot the 
residuals against y;. (sometimes called the fitted value); the variability in the residuals should 
not depend in any way on the value of y,. Most statistics software packages will construct 
these plots on request. When a pattern appears in these plots, it usually suggests the need for 
a transformation, that is, analyzing the data in a different metric. For example, if the variabil- 
ity in the residuals increases with y;., a transformation such as log y or Vy should be consid- 
ered. In some problems, the dependency of residual scatter on the observed mean y;. is very 
important information. It may be desirable to select the factor level that results in maximum 
response; however, this level may also cause more variation in response from run to run. 

The independence assumption can be checked by plotting the residuals against the time 
or run order in which the experiment was performed. A pattern in this plot, such as sequences 
of positive and negative residuals, may indicate that the observations are not independent. 


Table 13-6 Residuals for the Tensile Strength Experiment 


Hardwood 
Concentration (96) Residuals 
5 —3.00 — 2.00 5.00 1.00 — 1.00 0.00 
10 —3.67 hoo =207 2:33 3.33 0.67 
15 — 3.00 1.00 2.00 0.00 — 1.00 1.00 


20 =2.17 3.83 0.83 1.83 =) 17 EL 
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Figure 13-5 Plot of residuals versus y; 


This suggests that time or run order is important or that variables that change over time are 
important and have not been included in the experimental design. 

A normal probability plot of the residuals from the paper tensile strength experiment is 
shown in Fig. 13-3. Figures 13-4 and 13-5 present the residuals plotted against the factor 
levels and the fitted value y;. respectively. These plots do not reveal any model inadequacy or 
unusual problem with the assumptions. 


13-2.6 Determining Sample Size 


In any experimental design problem, the choice of the sample size or number of replicates to 
use 1s important. Operating characteristic curves can be used to provide guidance in mak- 
ing this selection. Recall that the operating characteristic curve is a plot of the probability ofa 


EXAMPLE 13-3 
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type II error (B) for various sample sizes against a measure of the difference in means that it 
is important to detect. Thus, if the experimenter knows the magnitude of the difference in 
means that is of potential importance, the operating characteristic curves can be used to deter- 
mine how many replicates are required to achieve adequate sensitivity. 

The power of the ANOVA test is 


1 — B = P{Reject Hp | Ho is false} 
- P{Fo Jad dae) | Ho is false} (13-17) 


To evaluate this probability statement, we need to know the distribution of the test statistic Fo 
if the null hypothesis is false. It can be shown that, if Ho is false, the statistic Fy = 
MS yeamens/ MS, is distributed as a noncentral F random variable, with a — 1 and a(n — 1) 
degrees of freedom and a noncentrality parameter 6. If 6 = 0, the noncentral F-distribution 
becomes the usual or central F-distribution. 

Operating characteristic curves are used to evaluate Q defined in Equation 13-17. These 
curves plot B against a parameter ®, where 





p =, (13-18) 


The parameter ® is (apart from n) the noncentrality parameter 8. Curves are available for 
a = 0.05 and a = 0.01 and for several values of the number of degrees of freedom for nu- 
merator (denoted v,) and denominator (denoted v,). Figure 13-6 gives representative O.C. 
curves, one for a = 4 (v, = 3) and one for a = 5 (v, = 4) treatments. Notice that for each 
value of a there are curves for a = 0.05 and a = 0.01. O.C. curves for other values of a are 
in Section 13-2.7 on the CD. 

In using the operating curves, we must define the difference in means that we wish to 
detect in terms of 97, 77. Also, the error variance o? is usually unknown. In such cases, we 
must choose ratios of €; ,77/o^ that we wish to detect. Alternatively, if an estimate of o? 
is available, one may replace o? with this estimate. For example, if we were interested in 
the sensitivity of an experiment that has already been performed, we might use MS; as the 
estimate of o°. 


Suppose that five means are being compared in a completely randomized experiment with 
a = 0.01. The experimenter would like to know how many replicates to run if it is impor- 
tant to reject Hy with probability at least 0.90 if p - 17/0^ = 5.0. The parameter d? is, in 
this case, 





and for the operating characteristic curve with v = a — 1 = 5 — 1 = 4,and v, = a(n — 1) = 
5(n — 1) error degrees of freedom refer to the lower curve in Figure 13-6. As a first guess, try 
n = 4 replicates. This yields X? = 4, ® = 2, and v, = 5(3) = 15 error degrees of freedom. 
Consequently, from Figure 13-6, we find that B = 0.38. Therefore, the power of the test is 
approximately 1 — B = 1 — 0.38 = 0.62, which is less than the required 0.90, and so we 
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Figure 13-6 Two Operating characteristic curves for the fixed-effects model analysis of variance. 
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conclude that n = 4 replicates is not sufficient. Proceeding in a similar manner, we can con- 


struct the following table: 


n 
q 4 2.00 
5 5 2.24 
6 6 2.45 


a(n — 1) B Power = (1 — B) 
15 0.38 0.62 
20 0.18 0.82 
25 0.06 0.94 


Thus, at least n — 6 replicates must be run in order to obtain a test with the required power. 


13-2.7 Technical Details about the Analysis of Variance (CD Only) 


EXERCISES FOR SECTION 13-2 


13-1. In Design and Analysis of Experiments, 5th edition 
(John Wiley & Sons, 2001) D. C. Montgomery describes an 
experiment in which the tensile strength of a synthetic fiber is 
of interest to the manufacturer. It is suspected that strength is 
related to the percentage of cotton in the fiber. Five levels of 
cotton percentage are used, and five replicates are run in 
random order, resulting in the data below. 


Cotton SSCS 
Percentage 1 2 3 4 5 
15 7 7 15 11 9 

20 12 17 12 18 18 

25 14 18 18 19 19 

30 19 25 22 19 23 

35 7 10 11 15 11 


(a) Does cotton percentage affect breaking strength? Draw 
comparative box plots and perform an analysis of vari- 
ance. Use a = 0.05. 

(b) Plot average tensile strength against cotton percentage 
and interpret the results. 


(c) Analyze the residuals and comment on model adequacy. 


13-2. In *Orthogonal Design for Process Optimization and Its 
Application to Plasma Etching" (Solid State Technology, May 
1987), G. Z. Yin and D. W. Jillie describe an experiment to de- 
termine the effect of C,F, flow rate on the uniformity of the etch 
on a silicon wafer used in integrated circuit manufacturing. 
Three flow rates are used in the experiment, and the resulting 
uniformity (in percent) for six replicates is shown below. 


CF; Flow Observations 
(SCCM) l 2 3 4 5 6 
125 2J 4.6 2.6 3.0 3.2 3.8 
160 4.9 4.6 5.0 4.2 3.6 A? 
200 4.6 3.4 2.9 3.5 4.1 5.1 


(a) Does C,F, flow rate affect etch uniformity? Construct box 
plots to compare the factor levels and perform the analysis 
of variance. Use a = 0.05. 

(b) Do the residuals indicate any problems with the underly- 
ing assumptions? 

13-3. The compressive strength of concrete is being stud- 

ied, and four different mixing techniques are being investi- 

gated. The following data have been collected. 


Mixing 
Technique Compressive Strength (psi) 
l 3129 3000 2865 2890 
2 3200 3300 2975 3150 
3 2800 2900 2985 3050 
E 2600 2700 2600 2765 


(a) Test the hypothesis that mixing techniques affect the 
strength of the concrete. Use a = 0.05. 

(b) Find the P-value for the F-statistic computed in part (a). 

(c) Analyze the residuals from this experiment. 

13-4. An experiment was run to determine whether four 

specific firing temperatures affect the density of a certain type 

of brick. The experiment led to the following data. 


Temperature 

(°F) Density 

100 21.8 219 21.7 21.6 21.7 21. 21.8 
125 21.7 214 21.5 215 — — — 
150 21.9 21.8 218 216 215 — — 
175 21.9 21.7 218 217 216 218 — 


(a) Does the firing temperature affect the density of the 
bricks? Use a — 0.05. 

(b) Find the P-value for the F-statistic computed in part (a). 

(c) Analyze the residuals from the experiment. 


SZ 
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13-5. An electronics engineer is interested in the effect on 
tube conductivity of five different types of coating for cathode 
ray tubes in a telecommunications system display device. The 
following conductivity data are obtained. 


Coating 
Type Conductivity 
l 143 141 150 146 
2 152 149 137 143 
3 134 133 132 127 
4 129 127 132 129 
5 147 148 144 142 


(a) Is there any difference in conductivity due to coating 
type? Use a = 0.01. 

(b) Analyze the residuals from this experiment. 

(c) Construct a 95% interval estimate of the coating type 1 
mean. Construct a 99% interval estimate of the mean dif- 
ference between coating types 1 and 4. 

13-6. Theresponse time in milliseconds was determined for 

three different types of circuits in an electronic calculator. The 

results are recorded here. 


Circuit 
Type Response 
1 19 22 20 18 25 
2 20 2l 33 27 40 
3 16 15 18 26 17 


(a) Using a = 0.01, test the hypothesis that the three circuit 
types have the same response time. 

(b) Analyze the residuals from this experiment. 

(c) Find a 95% confidence interval on the response time for 
circuit three. 


13-7. An article in the ACI Materials Journal (Vol. 84, 
1987, pp. 213—216) describes several experiments investi- 
gating the rodding of concrete to remove entrapped air. A 
3-inch X 6-inch cylinder was used, and the number of times 
this rod was used is the design variable. The resulting com- 
pressive strength of the concrete specimen is the response. 
The data are shown in the following table. 


Rodding 
evel Compressive Strength 
10 1530 1530 1440 
15 1610 1650 1500 
20 1560 1730 1530 
25 1500 1490 1510 


(a) Is there any difference in compressive strength due to the 
rodding level? 
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(b) Find the P-value for the F-statistic in part (a). 

(c) Analyze the residuals from this experiment. What con- 
clusions can you draw about the underlying model 
assumptions? 


13-8. An article in Environment International (Vol. 18, 
No. 4, 1992) describes an experiment in which the amount of 
radon released in showers was investigated. Radon-enriched 
water was used in the experiment, and six different orifice 
diameters were tested in shower heads. The data from the 
experiment are shown in the following table. 


Orifice 
Diameter Radon Released (96) 
0.37 80 83 83 85 
0.51 T9 T3 79 79 
0.71 74 73 76 TI 
1.02 67 74 74 74 
1.40 62 62 67 69 
1.99 60 61 64 66 


(a) Does the size of the orifice affect the mean percentage of 
radon released? Use a = 0.05. 

(b) Find the P-value for the F-statistic in part (a). 

(c) Analyze the residuals from this experiment. 

(d) Find a 95% confidence interval on the mean percent of 
radon released when the orifice diameter is 1.40. 


13-9. A paper in the Journal of the Association of Asphalt 
Paving Technologists (Vol. 59, 1990) describes an experi- 
ment to determine the effect of air voids on percentage 
retained strength of asphalt. For purposes of the experiment, 
air voids are controlled at three levels; low (2-4%), medium 
(4—6%), and high (6—896). The data are shown in the follow- 
ing table. 


Air Voids Retained Strength (96) 

Low 106 90 103 90 79 88 92 95 
Medium 80 69 94 91 70 83 S87 83 
High 78 80 62 69 76 85 69 85 


(a) Do the different levels of air voids significantly affect 
mean retained strength? Use a = 0.01. 

(b) Find the P-value for the F-statistic in part (a). 

(c) Analyze the residuals from this experiment. 

(d) Find a 95% confidence interval on mean retained strength 
where there is a high level of air voids. 


(e) Find a 95% confidence interval on the difference 
in mean retained strength at the low and high levels of 
air voids. 

13-10. An article in the Materials Research Bulletin (Vol. 26, 

No. 11, 1991) investigated four different methods of preparing 

the superconducting compound PbMo;,Sg. The authors contend 


ae 


that the presence of oxygen during the preparation process 
affects the material’s superconducting transition temperature T,. 
Preparation methods 1 and 2 use techniques that are designed to 
eliminate the presence of oxygen, while methods 3 and 4 allow 
oxygen to be present. Five observations on T, (in °K) were made 
for each method, and the results are as follows: 


Preparation 
Method Transition Temperature 7,(CK) 
1 14.8 14.8 14.7 14.8 14.9 
2 14.6 15.0 14.9 14.8 14.7 
3 12 11.6 12.4 12.7 12:1 
4 14.2 14.4 14.4 12.2 11.7 


(a) Is there evidence to support the claim that the presence of 
oxygen during preparation affects the mean transition 
temperature? Use a = 0.05. 

(b) What is the P-value for the F-test in part (a)? 

(c) Analyze the residuals from this experiment. 

(d) Find a 95% confidence interval on mean T, when method 
1 is used to prepare the material. 


13-11. Use Fisher’s LSD method with a = 0.05 to analyze 
the means of the five different levels of cotton content in 
Exercise 13-1. 


13-12. Use Fisher's LSD method with a = 0.05 test to 
analyze the means of the three flow rates in Exercise 13-2. 


13-3 THE RANDOM-EFFECTS MODEL 


13-3.1 Fixed versus Random Factors 
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13-13. Use Fisher’s LSD method with a = 0.05 to analyze 
the mean compressive strength of the four mixing techniques 
in Exercise 13-3. 


13-14. Use Fisher's LSD method to analyze the five means 
for the coating types described in Exercise 13-5. Usea = 0.01. 


13-15. Use Fishers LSD method to analyze the mean 
response times for the three circuits described in Exercise 
13-6. Usea = 0.01. 


13-16. Use Fisher’s LSD method to analyze the mean 
amounts of radon released in the experiment described in 
Exercise 13-8. Use a = 0.05. 


13-17. Apply Fisher’s LSD method to the air void experi- 
ment described in Exercise 13-9. Using a = 0.05, which 
treatment means are different? 


13-18. Apply Fisher’s LSD method to the superconducting 
material experiment described in Exercise 13-10. Which 
preparation methods differ, if a = 0.05? 


13-19. Suppose that four normal populations have common 
variance o? = 25 and means p; = 50, jj = 60, u, = 50, and 
u4 = 60. How many observations should be taken on each 
population so that the probability of rejecting the hypothesis 
of equality of means is at least 0.90? Use a = 0.05. 


13-20. Suppose that five normal populations have common 
variance o? = 100 and means u; = 175, uj = 190, u, = 160, 
u4 = 200, and u; = 215. How many observations per popula- 
tion must be taken so that the probability of rejecting the 
hypothesis of equality of means is at least 0.95? Use a = 0.01. 


In many situations, the factor of interest has a large number of possible levels. The analyst is 
interested in drawing conclusions about the entire population of factor levels. If the experimenter 
randomly selects a of these levels from the population of factor levels, we say that the factor is a 
random factor. Because the levels of the factor actually used in the experiment were chosen ran- 
domly, the conclusions reached will be valid for the entire population of factor levels. We will 
assume that the population of factor levels is either of infinite size or is large enough to be con- 
sidered infinite. Notice that this is a very different situation than we encountered in the fixed 
effects case, where the conclusions apply only for the factor levels used in the experiment. 


13-3.2 ANOVA and Variance Components 


The linear statistical model is 


Wyo nta te 


i= 1,2,...,a 


Ino (13-19) 


where the treatment effects t; and the errors e; are independent random variables. Note that 
the model is identical in structure to the fixed-effects case, but the parameters have a different 


@ 


@ 


@ 


@ 


@ 
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interpretation. If the variance of the treatment effects 7, is 02, by independence the variance of 
the response is 


VY) = oio! 


The variances o? and o? are called variance components, and the model, Equation 13-19, is 
called the components of variance model or the random-effects model. To test hypotheses 
in this model, we assume that the errors e; are normally and independently distributed with 
mean 0 and variance o? and that the treatment effects 7; are normally and independently dis- 
tributed with mean zero and variance o2.* 

For the random-effects model, testing the hypothesis that the individual treatment effects 
are zero is meaningless. It is more appropriate to test hypotheses about o2. Specifically, 


Ho: o? = 0 
Hio: > 0 


If o? = 0, all treatments are identical; but if o2 > 0, there is variability between treatments. 
The ANOVA decomposition of total variability is still valid; that is, 


SST = DO Treats + SSE (13-20) 


However, the expected values of the mean squares for treatments and error are somewhat 
different than in the fixed-effect case. 


In the random-effects model for a single-factor, completely randomized experiment, 
the expected mean square for treatments 1s 


SS Treatment 
EMS Teea) =E (H sanens ) 


o? + no? (13-21) 


E(MSg) = 


(13-22) 


From examining the expected mean squares, it is clear that both MS, and MS. 
estimate o? when Hy: o? = 0 is true. Furthermore, MS; and MSreamens are independent. 


Consequently, the ratio 


u MS Treatments 


— 13-23 
Eoo (13-23) 





*The assumption that the (7) are independent random variables implies that the usual assumption of $;., 7; = 0 
from the fixed-effects model does not apply to the random-effects model. 


EXAMPLE 13-4 





13-3 THE RANDOM-EFFECTS MODEL 489 


is an F random variable with a — 1 and a(n — 1) degrees of freedom when Hj, is true. The null 
hypothesis would be rejected at the a-level of significance if the computed value of the test 
statistic fo > fo,a—1,a(n-1): 

The computational procedure and construction of the ANOVA table for the random- 
effects model are identical to the fixed-effects case. The conclusions, however, are quite dif- 
ferent because they apply to the entire population of treatments. 

Usually, we also want to estimate the variance components (o? and o2) in the model. The 
procedure that we will use to estimate o? and o2 is called the analysis of variance method 
because it uses the information in the analysis of variance table. It does not require the nor- 
mality assumption on the observations. The procedure consists of equating the expected mean 
squares to their observed values in the ANOVA table and solving for the variance components. 
When equating observed and expected mean squares in the one-way classification random- 
effects model, we obtain 


E. 2 = 22 
MS treatments = 0 + no? and MS; =o 


Therefore, the estimators of the variance components are 


6 = MS; (13-24) 


O EM MS treatments — MSE 


(13-25) 


On m 


Sometimes the analysis of variance method produces a negative estimate of a variance 
component. Since variance components are by definition nonnegative, a negative estimate of a 
variance component is disturbing. One course of action is to accept the estimate and use it as 
evidence that the true value of the variance component is zero, assuming that sampling variation 
led to the negative estimate. While this approach has intuitive appeal, it will disturb the statisti- 
cal properties of other estimates. Another alternative is to reestimate the negative variance com- 
ponent with a method that always yields nonnegative estimates. Still another possibility is to 
consider the negative estimate as evidence that the assumed linear model is incorrect, requiring 
that a study of the model and its assumptions be made to find a more appropriate model. 


In Design and Analysis of Experiments, 5th edition (John Wiley, 2001), D. C. Montgomery de- 
scribes a single-factor experiment involving the random-effects model in which a textile man- 
ufacturing company weaves a fabric on a large number of looms. The company is interested 
in loom-to-loom variability in tensile strength. To investigate this variability, a manufacturing 
engineer selects four looms at random and makes four strength determinations on fabric sam- 
ples chosen at random from each loom. The data are shown in Table 13-7 and the ANOVA is 
summarized in Table 13-8. 

From the analysis of variance, we conclude that the looms in the plant differ significantly 
in their ability to produce fabric of uniform strength. The variance components are estimated 
by 6^ = 1.90 and 


., 29.73 — 1.90 


62 ; — 6.96 
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Table 13-7 Strength Data for Example 13-4 


Observations 
Lom 1 2 3 4 Total Average Table 13-8 Analysis of Variance for the Strength Data 
1 98 97 99 96 390 97.5 Source of  Sumof  Degreesof | Mean 
5 91 90 93 92 366 91.5 Variation Squares Freedom Square fo P-value 
3 96 95 097 95 383 95.8 Looms 89.19 3 20.73 15.68 1.88 E-4 
4 95 96 99 98 388 97.0 Error 22.79 12 1.90 
1527 95.45 Total 111.94 15 


Therefore, the variance of strength in the manufacturing process is estimated by 


[NS 
W(Y;) = 62 + 6? = 6.96 + 1.90 = 8.86 


ij 
Most of this variability is attributable to differences between looms. 


This example illustrates an important application of the analysis of variance—the iso- 
lation of different sources of variability in a manufacturing process. Problems of excessive 
variability in critical functional parameters or properties frequently arise in quality- 
improvement programs. For example, in the previous fabric strength example, the process 
mean is estimated by y = 95.45 psi, and the process standard deviation is estimated by 
gy m V V(Y;) = V8.86 = 2.98 psi. If strength is approximately normally distributed, the 
distribution of strength in the outgoing product would look like the normal distribution 
shown in Fig. 13-7(a). If the lower specification limit (LSL) on strength is at 90 psi, a sub- 
stantial proportion of the process output is fallout—that is, scrap or defective material that 
must be sold as second quality, and so on. This fallout is directly related to the excess vari- 
ability resulting from differences between looms. Variability in loom performance could be 
caused by faulty setup, poor maintenance, inadequate supervision, poorly trained operators, 
and so forth. The engineer or manager responsible for quality improvement must identify 
and remove these sources of variability from the process. If this can be done, strength vari- 
ability will be greatly reduced, perhaps as low as ay = Vô? = N/1.90 = 1.38 psi, as 
shown in Fig. 13-7(b). In this improved process, reducing the variability in strength has 
greatly reduced the fallout, resulting in lower cost, higher quality, a more satisfied cus- 
tomer, and enhanced competitive position for the company. 


13-3.3 Determining Sample Size in the Random Model (CD Only) 


Process 
fallout 
80 85 90 95 100 105 110 psi 80 85 90 95 100 105 110 psi 
LSL LSL 
(a) (b) 


Figure 13-7 The distribution of fabric strength. (a) Current process, (b) improved process. 
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EXERCISES FOR SECTION 13-3 


13-21. A textile mill has a large number of looms. Each 
loom is supposed to provide the same output of cloth per 
minute. To investigate this assumption, five looms are chosen 
at random, and their output is measured at different times. The 
following data are obtained: 


Loom Output (Ib/min) 
l 40 4.1 4.2 4.0 4.1 
2 3.9 3.8 3.9 4.0 4.0 
3 4.1 4.2 4.1 4.0 3:9 
4 3.6 3.8 4.0 3.9 3.7 
S 3.8 3.6 3.9 3.8 4.0 


(a) Are the looms similar in output? Use a = 0.05. 

(b) Estimate the variability between looms. 

(c) Estimate the experimental error variance. 

(d) Analyze the residuals from this experiment and check for 
model adequacy. 


13-22. Anarticle in the Journal of the Electrochemical Society 
(Vol. 139, No. 2, 1992, pp. 524—532) describes an experiment to 
investigate the low-pressure vapor deposition of polysilicon. The 
experiment was carried out in a large-capacity reactor at 
Sematech in Austin, Texas. The reactor has several wafer posi- 
tions, and four of these positions are selected at random. The 
response variable is film thickness uniformity. Three replicates of 
the experiment were run, and the data are as follows: 


Water 
Position Uniformity 
l 2.76 5.67 4.49 
2 1.43 1.70 2.19 
3 2.34 1.97 1.47 
4 0.94 1.36 1.65 
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(a) Is there a difference in the wafer positions? Use a = 0.05. 

(b) Estimate the variability due to wafer positions. 

(c) Estimate the random error component. 

(d) Analyze the residuals from this experiment and comment 
on model adequacy. 


13-23. An article in the Journal of Quality Technology 
(Vol. 13, No. 2, 1981, pp. 111—114) describes an experiment 
that investigates the effects of four bleaching chemicals on 
pulp brightness. These four chemicals were selected at ran- 
dom from a large population of potential bleaching agents. 
The data are as follows: 


Chemical Pulp Brightness 
l 77.199 74.466 92.746 76.208 82.876 
2 80.522 79.306 81.914 80.346 73.385 
3 79.417 78.017 91.596 80.802 80.626 
4 78.001 78.358 77.544 77.364 77.386 


(a) Is there a difference in the chemical types? Use a = 0.05. 

(b) Estimate the variability due to chemical types. 

(c) Estimate the variability due to random error. 

(d) Analyze the residuals from this experiment and comment 
on model adequacy. 


13-24. Consider the vapor-deposition experiment described 

in Exercise 13-22. 

(a) Estimate the total variability in the uniformity response. 

(b) How much of the total variability in the uniformity 
response is due to the difference between positions in the 
reactor? 

(c) To what level could the variability in the uniformity re- 
sponse be reduced, if the position-to-position variability 
in the reactor could be eliminated? Do you believe this is 
a significant reduction? 
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13-4.1 Design and Statistical Analysis 


In many experimental design problems, it is necessary to design the experiment so that the 
variability arising from a nuisance factor can be controlled. For example, consider the sit- 
uation of Example 10-9, where two different methods were used to predict the shear 
strength of steel plate girders. Because each girder has different strength (potentially), and 
this variability in strength was not of direct interest, we designed the experiment by using 
the two test methods on each girder and then comparing the average difference in strength 
readings on each girder to zero using the paired t-test. The paired t-test is a procedure for 
comparing two treatment means when all experimental runs cannot be made under 
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Block 1 Block 2 Block 3 Block4 ‘Table 13-9 A Randomized Complete Block Design 


EIS Block (Girder) 
e 2 3 E 
1 J12 J 13 J14 


l 

v1 
Figure 13-8 A randomized complete Jn y J23 Yr 
block design. Ji X32 J33 X34 


homogeneous conditions. Alternatively, we can view the paired f-test as a method for re- 
ducing the background noise in the experiment by blocking out a nuisance factor effect. 
The block is the nuisance factor, and in this case, the nuisance factor is the actual experi- 
mental unit—the steel girder specimens used in the experiment. 

The randomized block design is an extension of the paired t-test to situations where 
the factor of interest has more than two levels; that 1s, more than two treatments must be 
compared. For example, suppose that three methods could be used to evaluate the strength 
readings on steel plate girders. We may think of these as three treatments, say t4, t, and t4. 
If we use four girders as the experimental units, a randomized complete block design 
would appear as shown in Fig. 13-8. The design is called a randomized complete block 
design because each block is large enough to hold all the treatments and because the actual 
assignment of each of the three treatments within each block is done randomly. Once the 
experiment has been conducted, the data are recorded in a table, such as is shown in 
Table 13-9. The observations in this table, say y;; represent the response obtained when 
method i is used on girder j. 

The general procedure for a randomized complete block design consists of selecting 5 
blocks and running a complete replicate of the experiment in each block. The data that re- 
sult from running a randomized complete block design for investigating a single factor 
with a levels and 5 blocks are shown in Table 13-10. There will be a observations (one per 
factor level) in each block, and the order in which these observations are run is randomly 
assigned within the block. 

We will now describe the statistical analysis for a randomized complete block design. 
Suppose that a single factor with a levels is of interest and that the experiment is run in b 
blocks. The observations may be represented by the linear statistical model 


E i=1,2,...,a 
Yy — ptt; + B+ eg je ue aid (13-26) 
Table 13-10 A Randomized Complete Block Design with a Treatments and b Blocks 
Blocks 

Treatments 1 2 e b Totals Averages 

l vit Yı2 zr Vib Yı- Yı. 

2 Y» Y22 vee Y2b Va ya. 

a Yal Va2 uy Jab Ya. Ja: 

Totals y Y.2 TE Yap y.. 


Averages y4 y yis y.b y.. 
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where p is an overall mean, 7; is the effect of the ith treatment, B; is the effect of the jth 
block, and e;; is the random error term, which is assumed to be normally and independently 
distributed with mean zero and variance o?. Treatments and blocks will initially be consid- 
ered as fixed factors. Furthermore, the treatment and block effects are defined as deviations 
from the overall mean, so X. T; = 0 and > p; = 0. We also assume that treatments and 
blocks do not interact. That is, the effect of treatment i is the same regardless of which block 
(or blocks) it is tested in. We are interested in testing the equality of the treatment effects. 
That is 


Ho: 74 =m = =T= 
H: T; # 0 at least one i 
Testing the hypothesis that all the treatment effects 7; are equal to zero is equivalent to 


testing the hypothesis that the treatment means are equal. To see this, note that the mean of the 
ith treatment is p,;, defined as 





b 
2, 
d l| I 
pi = E\ S— ]} = 5 Dd Kf) 7, 2, En + v + Bj; * ej) 
Jel il 
12 bs 
=F daw tutB)-utut.l B; 
Jal Jai 


and since br B; = 0, we have the mean of the ith treatment defined as 


w=wtt i-71,2,...,a 


Therefore, testing the hypothesis that the a treatment means are equal is equivalent to testing 
that all the treatment effects 7; are equal to zero. 

The analysis of variance can be extended to the randomized complete block design. The 
procedure uses a sum of squares identity that partitions the total sum of squares into three 
components. 


The sum of squares identity for the randomized complete block design is 


(13-27) 


or symbolically 


SST x SOTE T SS Blocks a SSE 





Furthermore, the degrees of freedom corresponding to these sums of squares are 


ab — 1 = (a — 1) + (b — 1) + (a — Do — 1) 
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Definition 


For the randomized block design, the relevant mean squares are 


S S Treatments 


M. S Treatments ~ 





Therefore, if the null hypothesis Hj, is true so that all treatment effects 7; = 0, MStyeatments 18 an 
unbiased estimator of o?, while if Hp is false, MStreatments overestimates o°. The mean square 
for error is always an unbiased estimate of o^. To test the null hypothesis that the treatment ef- 
fects are all zero, we use the ratio 


MS Treatments 
o = — MS; (13-28) 
which has an F-distribution with a — 1 and (a — 1)(b — 1) degrees of freedom if the null 
hypothesis is true. We would reject the null hypothesis at the a-level of significance if the 
computed value of the test statistic in Equation 13-28 is fo > fs, 1 (5 o1 
In practice, we compute SS7, SStreatments aNd SSBiocks and then obtain the error sum of 
squares SS; by subtraction. The appropriate computing formulas are as follows. 


The computing formulas for the sums of squares in the analysis of variance for a ran- 
domized complete block design are 


(13-29) 
(13-30) 


SS plecks = a (13-31) 
j=l 


SSE = SST v SPESE v OS Blocks (13-32) 
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Table 13-11 ANOVA for a Randomized Complete Block Design 








Source of Degrees of 

Variation Sum of Squares Freedom Mean Square Fo 

Treatments SS Treatments a—1 SS Treatments MST reatments 

a=] MS; 

SS 

Blocks SS Blocks p= I pu 
b—1 

E SS; (by subtracti b — 1 a: 

rror z (by subtraction) (a x ) CIE hes) 
Total SS ab — | 


The computations are usually arranged in an ANOVA table, such as is shown in Table 13-11. 
Generally, a computer software package will be used to perform the analysis of variance for 
the randomized complete block design. 


EXAMPLE 13-5 An experiment was performed to determine the effect of four different chemicals on the 
strength of a fabric. These chemicals are used as part of the permanent press finishing 
process. Five fabric samples were selected, and a randomized complete block design 
was run by testing each chemical type once in random order on each fabric sample. The 
data are shown in Table 13-12. We will test for differences in means using an ANOVA with 
a = 0.01. 


The sums of squares for the analysis of variance are computed as follows: 








; ; ; 119927 
—(L3) (LO PB 20 — 25.69 
4 2 2 
yi ya 
S S Treatments aH >, 3 B p 
(5.7 + (8.8)? + (6.9) + (17.59* (39.2) "— 
i 5 20 7 
Table 13-12 Fabric Strength Data—Randomized Complete Block Design 
Treatment Treatment 
Fabric Sample Totals Averages 
Chemical Type I 2 B 4 5 Vj. Vj. 
1 1.3 1.6 0.5 1.2 Ll 5. 1.14 
2 22 2.4 0.4 2.0 1.8 8.8 1.76 
S 1.8 1.7 0.6 1:5 1.3 6.9 1.38 
4 3.9 4.4 2.0 4.1 3.4 17.8 3.56 
Block totals y.; 9.2 10.1 3.5 8.8 7.6 39.2(y..) 


Block averages y.; 2.30 2:23 0.88 2.20 1.90 1.96(y..) 
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Table 13-13 Analysis of Variance for the Randomized Complete Block Experiment 





Source of Degrees of 
Variation Sum of Squares Freedom Mean Square fo P-value 
Chemical types 
(treatments) 18.04 3 6.01 75.13 4.79 E-8 
Fabric samples 
(blocks) 6.69 4 1.67 
Error 0.96 12 0.08 
Total 25.09 19 
2 
SS E y 
Blocks = a ab 
(9.2 + (10.1 + (3.5)? + (8.8)? + (7.6)?  (392y 
= TQ — Hn = 6.69 


SSE — SST E SSBlocks v DO Teamen 
= 25.69 — 6.69 — 18.04 = 0.96 


The ANOVA is summarized in Table 13-13. Since fo = 75.13 > fo.o13,12 = 5.95 (the P-value 
is 4.79 X 1075), we conclude that there is a significant difference in the chemical types so far 
as their effect on strength is concerned. 


When Is Blocking Necessary? 

Suppose an experiment is conducted as a randomized block design, and blocking was not 
really necessary. There are ab observations and (a — 1)(6 — 1) degrees of freedom for error. 
If the experiment had been run as a completely randomized single-factor design with b repli- 
cates, we would have had a(b — 1) degrees of freedom for error. Therefore, blocking has cost 
a(b — 1) — (a — 1)(6 — 1) = b — 1 degrees of freedom for error. Thus, since the loss in 
error degrees of freedom is usually small, if there is a reasonable chance that block effects may 
be important, the experimenter should use the randomized block design. 

For example, consider the experiment described in Example 13-5 as a single-factor experi- 
ment with no blocking. We would then have 16 degrees of freedom for error. In the randomized 
block design, there are 12 degrees of freedom for error. Therefore, blocking has cost only 4 
degrees of freedom, which is a very small loss considering the possible gain in information that 
would be achieved if block effects are really important. The block effect in Example 13-5 is 
large, and if we had not blocked, SSbiocks would have been included in the error sum of squares 
for the completely randomized analysis. This would have resulted in a much larger MS,, making 
it more difficult to detect treatment differences. As a general rule, when in doubt as to the 
importance of block effects, the experimenter should block and gamble that the block effect does 
exist. If the experimenter is wrong, the slight loss in the degrees of freedom for error will have a 
negligible effect, unless the number of degrees of freedom is very small. 


Computer Solution 

Table 13-14 presents the computer output from Minitab for the randomized complete block 
design in Example 13-5. We used the analysis of variance menu for balanced designs to solve 
this problem. The results agree closely with the hand calculations from Table 13-13. Notice 
that Minitab computes an F-statistic for the blocks (the fabric samples). The validity of this ra- 
tio as a test statistic for the null hypothesis of no block effects is doubtful because the blocks 
represent a restriction on randomization; that is, we have only randomized within the 
blocks. If the blocks are not chosen at random, or if they are not run in random order, the 
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Table 13-14 Minitab Analysis of Variance for the Randomized Complete 
Block Design in Example 13-5 


Analysis of Variance (Balanced Designs) 


Factor Type Levels Values 
Chemical fixed 4 ] 2 3 4 
Fabric S fixed 5 l 2 3 4 5 


Analysis of Variance for strength 


Source DF SS MS F P 
Chemical 3 18.0440 6.0147 75.89 0.000 
Fabric S 4 6.6930 1.6733 21.11 0.000 
Error 17 0.9510 0.0792 

Total 19 25.6880 


F-test with denominator: Error 
Denominator MS — 0.079250 with 12 degrees of freedom 


Numerator DF MS F P 
Chemical 3 6.015 75.89 0.000 
Fabric S 4 1.673 21.11 0.000 


F-ratio for blocks may not provide reliable information about block effects. For more discus- 
sion see Montgomery (2001, Chapter 4). 


13-4.2 Multiple Comparisons 


When the ANOVA indicates that a difference exists between the treatment means, we may 
need to perform some follow-up tests to isolate the specific differences. Any multiple com- 
parison method, such as Fisher’s LSD method, could be used for this purpose. 

We will illustrate Fisher’s LSD method. The four chemical type averages from 
Example 13-5 are: 


Each treatment average uses b = 5 observations (one from each block). We will use a = 
0.05, so 1555515 = 2.179. Therefore the value of the LSD is 


2M5; 2(0.08) 
LSD = 10.025,12 B = 2.179 "5 — = 0.39 


Any pair of treatment averages that differ by 0.39 or more indicates that this pair of treatment 
means is significantly different. The comparisons are shown below: 


4 vs. 1 = 74. — y. = 3.56 — 1.14 = 2.42 > 0.39 
4 vs. 3 = Jy. — Jy. = 3.56 — 1.38 = 2.18 > 0.39 
4 vs. 2 = ya. — Jo. = 3.56 — 1.76 = 1.80 > 0.39 
2 vs. 1 = y. — yı. = 1.76 — 1.14 = 0.62 > 0.39 
2vs.3 = Jo. — X. = 1.76 — 1.38 = 0.38 < 0.39 
3 vs. 1 = Js. — yy. = 1.38 — 1.14 = 0.24 < 0.39 


| 
= 
| 
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Chemical type 


HE ud 


2 3 4 5 6 





Figure 13-9 Results of Fisher’s LSD method. 


Figure 13-9 presents the results graphically. The underlined pairs of means are not different. 
The LSD procedure indicates that chemical type 4 results in significantly different strengths 
than the other three types do. Chemical types 2 and 3 do not differ, and types 1 and 3 do not 
differ. There may be a small difference in strength between types 1 and 2. 


13-4.3 Residual Analysis and Model Checking 


In any designed experiment, it is always important to examine the residuals and to check for 
violation of basic assumptions that could invalidate the results. As usual, the residuals for the 
randomized complete block design are just the difference between the observed and estimated 
(or fitted) values from the statistical model, say, 


and the fitted values are 


The fitted value represents the estimate of the mean response when the ith treatment is run in 
the jth block. The residuals from the chemical type experiment are shown in Table 13-15. 

Figures 13-10, 13-11, 13-12, and 13-13 present the important residual plots for the ex- 
periment. These residual plots are usually constructed by computer software packages. There 
is some indication that fabric sample (block) 3 has greater variability in strength when treated 
with the four chemicals than the other samples. Chemical type 4, which provides the greatest 
strength, also has somewhat more variability in strength. Followup experiments may be nec- 
essary to confirm these findings, if they are potentially important. 


13-4.4 Randomized Complete Block Design with Random Factors 
(CD Only) 


Table 13-15 Residuals from the Randomized Complete Block Design 


Fabric Sample 


Chemical M a M OP 
Type 1 2 3 4 5 

1 —0.18 —0.10 0.44 —0.18 0.02 

2 0.10 0.08 —0.28 0.00 0.10 

3 0.08 — 0.24 0.30 —0.12 — 0.02 

4 0.00 0.28 —0.48 0.30 —0.10 





Figure 13-10 Normal probability plot of 
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Figure 13-11  Residuals by treatment. 
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Figure 13-12  Residuals by block. 


EXERCISES FOR SECTION 13-4 


13-25. 


In *The Effect of Nozzle Design on the Stability and 


Performance of Turbulent Water Jets" (Fire Safety Journal, 
Vol. 4, August 1981), C. Theobald describes an experiment in 
which a shape measurement was determined for several differ- 
ent nozzle types at different levels of jet efflux velocity. 
Interest in this experiment focuses primarily on nozzle type, 
and velocity is a nuisance factor. The data are as follows: 


Nims Jet Efflux Velocity (m/s) 
Type 11.73 14.37 16.59 20.43 23.46 28.74 
l 0.78 0.80 0.81 0.75 | 0.77 0.78 
2 0.85 0.85 0.92 0.86 0.81 0.83 
3 0.93 0.92 0.95 0.89 0.89 0.83 
4 1.14 0.97 0.98 0.88 0.86 0.83 
5 0.97 0.86 0.78 0.76 0.76 0.75 


0.5 
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e e e 
eij " e e * [o] i 
0 z’ " | Vij 
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ee e 
eo 
" e 
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—0.5 


Figure 13-13 Residuals versus j;;. 


(a) Does nozzle type affect shape measurement? Compare the 
nozzles with box plots and the analysis of variance. 

(b) Use Fisher’s LSD method to determine specific differ- 
ences between the nozzles. Does a graph of the average 
(or standard deviation) of the shape measurements versus 
nozzle type assist with the conclusions? 

(c) Analyze the residuals from this experiment. 


13-26. In Design and Analysis of Experiments, 5th edition 
(John Wiley & Sons, 2001), D. C. Montgomery describes an 


experiment that determined the effect of four different types e 


of tips in a hardness tester on the observed hardness of a 
metal alloy. Four specimens of the alloy were obtained, and 
each tip was tested once on each specimen, producing the 
following data: 


(a) Is there any difference in hardness measurements between 
the tips? 
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Type of Specimen 
Tip 1 2 5 4 
l 9.3 9.4 9.6 10.0 
2 9.4 9.3 9.8 9.9 
3 92 9.4 9.5 9 
4 9.7 9.6 10.0 10.2 


(b) Use Fisher's LSD method to investigate specific differ- 
ences between the tips. 

(c) Analyze the residuals from this experiment. 

13-27. An article in the American Industrial Hygiene 

Association Journal (Vol. 37, 1976, pp. 418—422) describes 

a field test for detecting the presence of arsenic in urine sam- 

ples. The test has been proposed for use among forestry 


ax Workers because of the increasing use of organic arsenics in 
c— that industry. The experiment compared the test as per- 


formed by both a trainee and an experienced trainer to an 
analysis at a remote laboratory. Four subjects were selected 
for testing and are considered as blocks. The response vari- 
able is arsenic content (in ppm) in the subject’s urine. The 
data are as follows: 


Subject 
Test 1 2 3 4 
Trainee 0.05 0.05 0.04 0.15 
Trainer 0.05 0.05 0.04 0.17 
Lab 0.04 0.04 0.03 0.10 


(a) Is there any difference in the arsenic test procedure? 

(b) Analyze the residuals from this experiment. 

13-28. An article in the Food Technology Journal (Vol. 10, 
1956, pp. 39—42) describes a study on the protopectin content 


em of tomatoes during storage. Four storage times were selected, 
<3 and samples from nine lots of tomatoes were analyzed. The 


protopectin content (expressed as hydrochloric acid soluble 
fraction mg/kg) is in the following table. 


Storage Lot 
Time jl D 2 4 
0 days 1694.0 989.0 917.3 346.1 
7 days 1802.0 1074.0 278.8 1375.0 
14 days 1568.0 646.2 1820.0 1150.0 
21 days 415.5 845.4 377.6 279.4 
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(a) The researchers in this study hypothesized that mean pro- 
topectin content would be different at different storage 
times. Can you confirm this hypothesis with a statistical 
test using a = 0.05? 

(b) Find the P-value for the test in part (a). 

(c) Which specific storage times are different? Would you 
agree with the statement that protopectin content de- 
creases as storage time increases? 

(d) Analyze the residuals from this experiment. 


13-29. An experiment was conducted to investigate leak- 
ing current ina SOS MOSFETS device. The purpose of the 
experiment was to investigate how leakage current varies as 
the channel length changes. Four channel lengths were se- 
lected. For each channel length, five different widths were 
also used, and width is to be considered a nuisance factor. 
The data are as follows: 


Channel Width 

Length l 2 3 4 5 
l 0.7 0.8 0.8 0.9 1.0 
2 0.8 0.8 0.9 0.9 1.0 
3 0.9 1.0 1.7 2.0 4.0 
4 1.0 1.5 2.0 3.0 20.0 


(a) Test the hypothesis that mean leakage voltage does not 
depend on the channel length, using a = 0.05. 

(b) Analyze the residuals from this experiment. Comment on 
the residual plots. 


13-30. Consider the leakage voltage experiment described 
in Exercise 13-29. The observed leakage voltage for channel 
length 4 and width 5 was erroneously recorded. The correct 
observation is 4.0. Analyze the corrected data from this exper- 
iment. Is there evidence to conclude that mean leakage voltage 
increases with channel length? 


Supplemental Exercises 


13-31. An article in the ZEEE Transactions on 
Components, Hybrids, and Manufacturing Technology (Vol. 
15, No. 2, 1992, pp. 146-153) describes an experiment in 
which the contact resistance of a brake-only relay was studied 


5 6 7i 8 9 
1260.0 965.6 1123.0 1106.0 1116.0 
544.0 672.2 818.0 406.8 461.6 
983.7 395.3 422.3 420.0 409.5 
447.8 ZU 394.1 356.4 391,2 


for three different materials (all were silver-based alloys). 
The data are as follows. 


Alloy Contact Resistance 
l 95 97 99 98 99 
99 99 94 95 98 
2 104 102 102 105 99 
102 111 103 100 103 
3 119 130 132 136 141 


172 145 150 144 135 


(a) Does the type of alloy affect mean contact resistance? Use 
a = 0.01. 

(b) Use Fisher's LSD method to determine which means differ. 

(c) Find a 99% confidence interval on the mean contact 
resistance for alloy 3. 

(d) Analyze the residuals for this experiment. 

13-32. An article in Lubrication Engineering (December 

1990) describes the results of an experiment designed to 

investigate the effects of carbon material properties on 


em the progression of blisters on carbon face seals. The carbon 
—7 face seals are used extensively in equipment such as air 


turbine starters. Five different carbon materials were tested, 
and the surface roughness was measured. The data are as 
follows: 


Carbon 

Material 

Type Surface Roughness 

EC10 0.50 055 0.55 0.36 

ECI10A 0.31 0.07 025 0.18 0.56 0.20 
EC4 0.20 0.28 0.12 

ECI 0.10 0.16 


(a) Does carbon material type have an effect on mean surface 
roughness? Use a = 0.05. 

(b) Find the residuals for this experiment. Does a normal 
probability plot of the residuals indicate any problem with 
the normality assumption? 

(c) Plot the residuals versus y;. Comment on the plot. 

(d) Find a 95% confidence interval on the difference between 
the mean surface roughness between the EC10 and the 
ECI carbon grades. 


13-33. Apply the Fisher LSD method to the experiment in 


mm Exercise 13-32. Summarize your conclusions regarding the 
S effect of material type on surface roughness. 


13-34. An article in the Journal of Quality Technology 
(Vol. 14, No. 2, 1982, pp. 80—89) describes an experiment in 
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which three different methods of preparing fish are evalu- 
ated on the basis of sensory criteria and a quality score is 
assigned. Assume that these methods have been randomly 
selected from a large population of preparation methods. 
The data are in the following table: 


Method Score 
l 24.4 23.2 25.0 19.7 
222 24.4 23.8 18.0 
2 Z2 19.5 17.3 19.7 
22.3 23.2 21.4 22.6 
3 23.3 22.8 22.4 2591 
20.4 23.5 20.8 24.1 


(a) Is there any difference in preparation methods? Use 
a = 0.05. 

(b) Calculate the P-value for the F-statistic in part (a). 

(c) Analyze the residuals from this experiment and comment 
on model adequacy. 

(d) Estimate the components of variance. 


13-35. An article in the Journal of Agricultural 
Engineering Research (Vol. 52, 1992, pp. 53—76) describes 
an experiment to investigate the effect of drying temperature 
of wheat grain on the baking quality of bread. Three temper- 
ature levels were used, and the response variable measured 
was the volume of the loaf of bread produced. The data are 
as follows: 


Temperature (°C) Volume (CC) 
70.0 1245 1235 1285 1245 1235 
75.0 1235 1240 1200 1220 1210 
80.0 1225 1200 1170 1155 1095 


(a) Does drying temperature affect mean bread volume? Use 
a = 0.01. 

(b) Find the P-value for this test. 

(c) Use the Fisher's LSD method to determine which means 
are different. 

(d) Analyze the residuals from this experiment and comment 
on model adequacy. 


13-36. An article in Agricultural Engineering (December 
1964, pp. 672-673) describes an experiment in which 
the daily weight gain of swine is evaluated at different 
levels of housing temperature. The mean weight of each 
group of swine at the start of the experiment 1s considered to 
be a nuisance factor. The data from this experiment are as 
follows: 
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Nenn Housing Air Temperatures 


Weight (degrees F) 

(lbs) 50 60 70 80 90 100 
100 1.37 1.58 2.00 1.97 1.40 0.39 
150 1.47 Le 2.16 1.82 1.14 "D. T9 
200 1.19 1.91 222 1.67 0.88 —0.77 


(a) Does housing air temperature affect mean weight gain? 
Use a = 0.05. 

(b) Use Fisher’s LSD method to determine which tempera- 
ture levels are different. 

(c) Analyze the residuals from this experiment and comment 
on model adequacy. 


13-37. An article in Communications of the ACM (Vol. 30, 
No. 5, 1987) studied different algorithms for estimating 
software development costs. Six algorithms were applied to 
eight software development projects and the percent error in 
estimating the development cost was observed. The data are in 
the table at the bottom of the page. 


Algorithm l 2 

I(SLIM) 1244 21 
2(COCOMO-A) 281 129 
3(COCOMO-R) 220 84 
4(COCOMO-C) 229 83 
S(FUNCTION POINTS) 19 11 
6(ESTIMALS) =20 35 


CHAPTER 13 DESIGN AND ANALYSIS OF SINGLE-FACTOR EXPERIMENTS: THE ANALYSIS OF VARIANCE 


(a) Do the algorithms differ in their mean cost estimation 
accuracy? Use a = 0.05. 

(b) Analyze the residuals from this experiment. 

(c) Which algorithm would you recommend for use in 
practice? 

13-38. Consider an ANOVA situation with a = 4 means 

u; = 1, po = 5, u, = 8, and py = 4. Suppose that o? = 4, 

n = 4,anda = 0.05. 

(a) Find the power of the ANOVA F-test. 

(b) How large would the sample size have to be if we want the 
power of the F-test for detecting this difference in means 
to be at least 0.90? 


13-39. Consider an ANOVA situation with a = 5 treat- 

ments. Let o? = 9 and a = 0.05, and suppose that n = 4. 

(a) Find the power of the ANOVA F-test when p, = p; 
W, = 1, wy = 3, and p; = 2. 

(b) What sample size is required if we want the power of the 
F-test in this situation to be at least 0.90? 


Project 

- 5 6 7 8 
2221 905 839 32 122 
1306 336 910 473 199 

543 300 794 488 142 

352 291 826 509 153 

121 15 103 87 -—IJ 

170 104 199 142 41 
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MIND-EXPANDING EXERCISES 


13-40. Show that in the fixed-effects model analysis 
of variance E(MS,) = o°. How would your develop- 
ment change if the random-effects model had been 
specified? 

13-41. Consider testing the equality of the means of 
two normal populations where the variances are 
unknown but are assumed equal. The appropriate test 
procedure is the two-sample t-test. Show that the two- 
sample t-test is equivalent to the single-factor analysis of 
variance F-test. 


13-42. Consider the ANOVA with a = 2 treatments. 
Show that the MS; in this analysis is equal to the 
pooled variance estimate used in the two-sample 
t-test. 


13-43. Show that the variance of the linear combina- 
tion 

D c;Y; is o? > no. 

=I S] 
13-44. Ina fixed-effects model, suppose that there are 


n observations for each of four treatments. Let Of, Q5, 
and Q5 be single-degree-of-freedom sums of squares for 
the orthogonal contrasts. Prove that SSteatments = 0 + 
Q; + Qs. 

13-45. Consider the single-factor completely ran- 
domized design with a treatments and 7 replicates. 
Show that if the difference between any two treatment 
means is as large as D, the minimum value that the OC 
curve parameter Q? can take on is 





To nD? 


2ao? 


13-46. Consider the single-factor completely ran- 
domized design. Show that a 100(1 — «) percent confi- 
dence interval for o? is 


(N — aMSg " (N — a)MS; 


2 Æ 


D 2 
Xa/2,N-a X1—a/2,N—a 

where N is the total number of observations in the 
experimental design. 


13-47. | Consider the random-effect model for the 
single-factor completely randomized design. Show that 


a 100(1 — a)% confidence interval on the ratio of vari- 
ance components 02/6? is given by 


ve 
lL ZÆ —5 = U 
o 
where 
_! Ase T (——) - 1 
H MS; Jacq 
and 











= l E ( l )- 1 
n MSg fi QU2 ql Ng 


13-48. Consider a random-effects model for the 
single-factor completely randomized design. Show that 
a 100(1 — o)96 confidence interval on the ratio o?/ 
(c? + a?) is 


ib, o? U 
— < ——__< 
1-L oto 14+0U 





where L and U are as defined in Exercise 13-47. 


13-49. Continuation of Exercise 13-48. Use the 
results of Exercise 13-48 to find a 100(1 — a)% confi- 
dence interval for o?/(o? + o?). 


13-50. Consider the fixed-effect model of the com- 
pletely randomized single-factor design. The model 
parameters are restricted by the constraint 9; ., t; = 0. 
(Actually, other restrictions could be used, but this one is 
simple and results in intuitively pleasing estimates for 
the model parameters.) For the case of unequal sample 
size Ny, Ny, ... , Na the restriction is 9; , nr; = 0. Use 
this to show that 


y ae nenE) 2 o? p sep 


Does this suggest that the null hypothesis in this model 
is Ho: nj, = MoT, = +++ = NTa = 0? 

13-51. Sample Size Determination. In the single- 
factor completely randomized design, the accuracy of a 
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100(1 — a)% confidence interval on the difference in (b) Suppose that in comparing a = 5 means we have a 


any two treatment means 18 ty/7 «4 — 1 V 2ZMSg/n. preliminary estimate of o? of 4. If we want the 95% 
(a) Show that if A is the desired accuracy of the interval, confidence interval on the difference in means to 
the sample size required is have an accuracy of 2, how many replicates should 

we use? 


E ia OR 


n 4 2 


IMPORTANT TERMS AND CONCEPTS 


In the E-book, click on any Fisher's least significant Randomized complete CD MATERIAL 
term or concept below to difference method block design Graphical comparison 
go to that subject. Fixed factor Residual analysis and of means 
Analysis of variance Multiple comparisons model adequacy Orthogonal contrasts 
Blocking Nuisance factors checking Tukey's test 
Complete randomized Random factor Sample size and replica- 
experiment Bandomieatión tion in an experiment 


Expected mean squares Variance component 


13-2.4 More About Multiple Comparisons 


As noted in the previous section, there are many ways to investigate the treatment means 
following rejection of the null hypothesis with an ANOVA. The Fisher LSD method is easy 
and very widely used. It is consider to be a very “liberal” procedure in that although each 
test 1s at significance level o, the type I error for the entire set of comparisons (called the 
experimentwise error rate) is much greater than a. In this section we briefly describe three 
other approaches. 


Graphical Comparison of Means 

It is easy to compare treatment means graphically, following the analysis of variance. Suppose 
that the factor has a levels and that y,., y>.,..., Yq. are the observed averages for these factor 
levels. Each treatment average has standard deviation o/ Vn, where ø is the standard devia- 
tion of an individual observation. If all treatment means are equal, the observed means y;. 
would behave as if they were a set of observations drawn at random from a normal distribu- 
tion with mean p. and standard deviation o/ Vn. 

Visualize this normal distribution capable of being slid along an axis below which the 
treatment means y4., y>.,..., Y,- are plotted. If all treatment means are equal, there should 
be some position for this distribution that makes it obvious that the y;. values were drawn 
from the same distribution. If this is not the case, the y;. values that do not appear to have 
been drawn from this distribution are associated with treatments that produce different 
mean responses. 

The only flaw in this logic is that c is unknown. However, we can use V MS; from 
the analysis of variance to estimate o. This implies that a f-distribution should be used 
instead of the normal in making the plot, but since the ¢ looks so much like the normal, 
sketching a normal curve that is approximately 6 V MS;/n. units wide will usually work 
very well. 

Figure 813-1 shows this arrangement for the hardwood concentration experiment in 
Example 13-1. The standard deviation of this normal distribution is 


V MSg/n = V6.51/6 = 1.04 


If we visualize sliding this distribution along the horizontal axis, we note that there is no lo- 
cation for the distribution that would suggest that all four observations (the plotted means) are 
typical, randomly selected values from that distribution. This, of course, should be expected, 
because the analysis of variance has indicated that the means differ, and the display in 
Fig. S13-1 is just a graphical representation of the analysis of variance results. The figure does 


Gl n = 1.04 


1 2 d 4 
e o e 

O 5 10 15 20 25 30 
Figure S13-1 Tensile strength averages from the hardwood concentration 
experiment in relation to a normal distribution with standard deviation 


VMSg/n = V651/6 = 1.04. 


13-1 


13-2 


indicate that treatment 4 (20% hardwood) produces paper with higher mean tensile strength 
than do the other treatments, and treatment 1 (596 hardwood) results in lower mean tensile 
strength than do the other treatments. The means of treatments 2 and 3 (10 and 15% hard- 
wood, respectively) do not differ. 

This simple procedure is a rough but very effective multiple comparison technique. We 
now briefly describe two other procedures: orthogonal contrasts and Tukey's method. 


Orthogonal Contrasts 

Many multiple comparison procedures use the idea of a contrast. Consider the hardwood con- 
centration experiment presented in Example 13-1. Since the hypothesis Hp: 7, = 7; = 74 = 
14 = 0 was rejected, we know that some hardwood concentrations produce different tensile 
strengths than others, but which ones actually cause this difference? At the outset of the 
experiment, we might suspect that hardwood concentrations 3 and 4 produce the same tensile 
strength. This implies that we would like to test the hypothesis 


Ho: p3 = p4 
Hi: p3 T M4 


This hypothesis could be tested by using a linear combination of treatment totals, say, 
V3. — Y4 


If we had suspected that the average of hardwood concentrations 1 and 3 did not differ from 
the average of hardwood concentrations 2 and 4, the hypothesis would have been 


Ho: pı + pa = p2 + p4 
Hi: pi + pa T go + ga 


which implies using the linear combination of treatment totals 
Vi. E Ys — Y2 — Y4 


In general, the comparison of treatment means of interest will imply a linear combination 
of treatment totals such as 


a 
C = 2, c9» 
i= 


with the restriction that $7, c; = 0. These linear combinations are called contrasts. The sum 
of squares for any contrast is 





EXAMPLE $13-1 


13-3 


and has a single degree of freedom. If the design is unbalanced, the comparison of treatment 
means requires that $7. , n;c; = 0, and Equation S13-1 becomes 


BS E (S13-2) 
> nc? 
[5l 
A contrast is tested by comparing its sum of squares to the mean square error. The resulting 
statistic is distributed as F, with 1 and N — a degrees of freedom. 


A very important special case of the above procedure is that of orthogonal contrasts. 
Two contrasts with coefficients {c;} and {d;} are orthogonal if 


> cid; = 0 
i-l 


or for an unbalanced design if 


=1 


M. 


For a treatments a set of a — 1 orthogonal contrasts will partition the sum of squares due to 
treatments into a — 1 independent single-degree-of-freedom sums of squares. Thus, tests 
performed on orthogonal contrasts are independent. 

There are many ways to choose the orthogonal contrast coefficients for a set of treat- 
ments. Usually, something in the context of the experiment should suggest which comparisons 
will be of interest. For example, if there are a = 3 treatments, with treatment 1 a control and 
treatments 2 and 3 actual levels of the factor of interest to the experimenter, appropriate 
orthogonal contrasts might be as follows: 


Hg: —-29, + w + p = O 
Ho: M2 — p; = 0 


Note that contrast 1 with c; = —2, 1, 1 compares the average effect of the factor with the con- 
trol, while contrast 2 with d; = 0, 1, — 1 compares the two levels of the factor of interest. 
Contrast coefficients must be chosen prior to running the experiment, because if these 
comparisons are selected after examining the data, most experimenters would construct tests that 
compare large observed differences in means. These large differences could be due to the presence 
of real effects, or they could be due to random error. If experimenters always pick the largest dif- 
ferences to compare, they will inflate the type I error of the test, since it is likely that in an unusu- 
ally high percentage of the comparisons selected, the observed differences will be due to error. 


Consider the hardwood concentration experiment. There are four levels of hardwood concen- 
tration, and possible sets of comparisons between these means and the associated orthogonal 
comparisons are 


Hg: by + Ma = Mot Ma C= Yr — Y» T Y + y4 
Ho: py + Ma = Mas + p4 d= —yi. = ya. + ys. + ya. 
Ho: ja + Ma = Mo + p4 zip Role LI 


® 
| 
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Table S13-1 Analysis of Variance for the Tensile Strength Data 


Sum of Degrees of Mean 
Source of Variation Squares Freedom Square po 
Hardwood concentration 382.79 3 127.60 19.61 
c (1, 4 vs. 2, 3) 3.38 l 3.38 0.52 
d (1, 2 vs. 3, 4) 234.38 1 234.38 36.00 
e (1, 3 vs. 2, 4) 145.04 l 145.04 22:20 
Error 130.17 20 6.51 
Total 512.96 23 


Notice that the contrast constants are orthogonal. Using the data from Table S13-1, we find the 
numerical values of the contrasts and the sums of squares as follows: 


—9 2 

c = 60 — 94 — 102 + 127 = —9 ss, = $2) = 338 
6(4) 
(75)° 

d = —60 — 94 + 102 + 127 = 75 $$, = Tow = 234.38 
6(4) 
(59)° 

e = —60 + 94 — 102 + 127 = 59 SS, = 6(4) = 145.04 


These contrast sums of squares completely partition the treatment sum of squares; that is, 
SStreatments = 99, + SS, + SS.. These tests on the contrasts are usually incorporated in the 
analysis of variance, such as is shown in Table S13-1. From this analysis, we conclude that 
there are significant differences between hardwood concentration 3 and 4, and 1 and 2, but 
that the average of 1 and 4 does not differ from the average of 2 and 3. Also, the average of 1 
and 3 differs from the average of 2 and 4. 


Tukey's Method 
The Tukey procedure for comparing pairs of means makes use of the studentized range 
statistic 


m Ymax — Y min 
: 7 VMS;,/n 


where Y,,,, and Y, are the largest and smallest sample means, respectively, out of a group 
of p sample means. For equal sample sizes, the Tukey procedure would indicate that the 
two means w; and p; are different if the absolute value of the observed difference | y; — y| 


exceeds 
| MSg 
do = g(a, f) n 


where g,(a, f ) is the upper a percentage point of the studentized range statistic, a is the num- 
ber of treatments, and f is the number of even degrees of freedom. Tables of g,(a, f ) are 
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Table $13-2 Minitab Output Illustrating Tukey's Method 
One-way ANOVA: Tensile Str versus Conc 


Analysis of Variance for Tensile 


Source DF SS MS P 
Conc 3 382.79 127.60 0.000 
Error 20 130.17 6.51 
Total 23 512.96 
Individual 95% CIs For Mean 
Based on Pooled StDev 
Level N Mean StDev = 1725 c E "bienes uite a an Ta 
5 6 10.000 2.828 (ses Ama] 
10 6 15.667 2.805 (astaan) 
15 6 17.000 1.789 C. 
20 6 21.167 2.639 (Tn) 
Sea a ale 
Pooled StDev = 2.551 10.0 15.0 20.0 25.0 


Tukey’s pairwise comparisons 


= 0.0500 
Individual error rate = 0.0111 


Family error rate 


Critical value = 3.96 


Intervals for (column level mean) — (row level mean) 


5 10 15 
10 —9.191 
— 1.542 
15 —]1.124 —5.458 
—2.876 2191 
20 —I5.291 — —9.624 =8.291 
—7.042 —1.376 —0.042 


widely available; for example, see Montgomery (2001). Equivalently, we could construct a set 
of 100(1 — a)% confidence intervals for all pairs of mean using 


Vn ec Es gu py. y. 


The Tukey confidence intervals are a set of simultaneous confidence intervals that hold with prob- 
ability 1 — o. Tukey's method is a very conservative procedure relative to Fisher's LSD because 
it requires a larger observed difference in treatment averages to declair the pair of means different. 

Minitab implements the Tukey procedure and reports the results in terms of the confi- 
dence interval. Table S13-2 is the Minitab output for the hardwood concentration experiment 
of Example S13-1. Notice that, like Fisher’s LSD, Tukey's method indicates that all pairs of 
means are different except at 10% and 15% concentrations. 
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13-2.7 Technical Details about the Analysis of Variance (CD Only) 


Derivation of the ANOVA Identity 
The proof of the fundamental ANOVA identity in Equation 13-5 is straightforward. Note that 
we may write 


(vy — Y = [os Y) + Oy —»9T 
i=] j=l i=] j=l 
or 
a n a a n 
> Dye) HH D> Ox) (vy — F) 
= ur i-l i=] j=1 
a n 
+ 2 (Yi — y-Joy — Pp) 


Note that the cross-product term in the previous equation is zero, since 


n 


>, Co =n — JH = å 0 
j= 


Therefore, we have shown that Equation 13-5 is correct. 


Expected Mean Squares 
In the text we state that 





E(MS treatments) = o? + 
We can prove this directly by apply the expected value operator. Since 


_ D Treatments 
MS reatments z a 


we will initially work with the treatment sum of squares. Now 


ES Sqecanenis) =p E > (Y;. n Ey 
i=l 


and from the model Y; = u + 7; + e; we have 
Y. = u +T +E. 
and 
AE |. + €.. 


since Y. 7; = 0. Substituting for Y;. and Y.. in the expression for SStreatments yields 


a 


E(SSTrentments) =£ n > (7; T ej — E 
i-l 


a a a a a 
-E E p P Foe Se MC 
i-l i-l i-l i-l i=] 
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However, since the e;'s are independent random variables with mean zero and variance o’, we 
find that 


Therefore 


As a result, 


SS reatmen 
E( MStreatmenis) — E a 





a—i 

x a— 1 E (SS Treatments) 
nX T 

= g? 4 i=l 
a=] 


Now if the null hypothesis of equal treatment means is true, each 7; is equal to zero and 


E (Sons = g? 
a= 


If the alternative hypothesis is true, 


SS: = 
e( tara ) = g? 4 i=] 


S 





a>] 


A similar approach will show that 
E(SSz) = a(n — 1)o? 


so that 


O.C. Curves 

In the text, we give O.C. curves for the fixed effects ANOVA for the case of a = 4 and 
a = 5 treatments. A collection of additional curves for a = 2, 3, 6, 7, 8, and 9 are on pages 
13-8 through 13-10. In using the curves, remember that v, = the number of numerator 
degrees of freedom and v, = the number of denominator degrees of freedom. The sample 
size calculation routine in Minitab will also determine sample sizes for the single-factor 
ANOVA. 
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Source: These curves are adapted with permission from Biometrika Tables for Statisticians, Vol. 
2, by E. S. Pearson and H. O. Hartley, Cambridge University Press, Cambridge, 1972. 
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13-3.3 Determining Sample Size in the Random Model (CD Only) 


The power of the test for the random-effects model is 


1 — B = P{Reject Ho| Ho is false} 
= PIF ae ET E > 0} 


It can be shown that if H, is true (o? > 0) the power can be computed using the central F 
distribution, with a — 1 and a(n — 1) degrees of freedom. In fact, the ratio 


MO Yreainietits I (0° T no?) 
MSg/oa^ 


has the F-distribution with a — 1 and a(n — 1) degrees of freedom. Then, 


MS Treatments p 
Le p =P | Signe 7 Jaa | O = 0} 


= Pg cesta | 
MSg(1 + no2/o*) (1 + no2/o?) 


Josue 
= P E E > (1 + nol/o?) (813-3) 


This probability statement may be easily evaluated using certain hand-held calculators, or it 
may be evaluated using tables of the F-distribution. 


EXAMPLE $13-2 | Consider a completely randomized design with five treatments selected at random and six 
observations per treatment. Ifa = 0.05, what is the power of the test if o2 = o°? 
From Equation $13-3, we have the power as 


/0.05,425 
MI 


since if o2 = o^ the ratio o2/o^ = 1. Now foos45s = 2.76, so 


= P{ F425 = 0.39} = 0.81 


This probability was evaluated using a calculator that provided F-distribution probabilities. 
Since the power of the test is 0.81, this implies that the null hypothesis Hj: o2 = 0 will be 
rejected with probability 0.81 in this experimental situation. 


It is also possible to evaluate the power of the test using the operating characteristic 
curves on page 13-12 through 13-15. These curves plot the probability of the type II error B 
against A, where 





A=,/1+— 5 (S13-4) 
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Source: Reproduced with permission from Engineering Statistics, 2nd edition, by A. H. Bowker and G. J. 
Lieberman, Prentice-Hall, Englewood Cliffs, N.J., 1972. 
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EXAMPLE $13-3 


In the randomized block design, replace n by b, the number of blocks. Since o? is usually un- 
known, we may either use a prior estimate or define the value of o? that we are interested in 
detecting in terms of the ratio 02/0”. 


Consider the situation described in Example S13-2. Since o = 0.05, a = 5, n = 6, and o? = 
o°, we may find à from Equation S13-4 as 


A = VI + 6(1) = 2.646 


From the operating characteristic curve with v; = a — 1 = 4, v; = a(n — 1) = 25 degrees 
of freedom and a = 0.05, we find that 


B = 0.20 


Therefore, the power is approximately 0.80. This agrees with the results obtained in 
Example S13-2. 


13-4.4 Randomized Complete Block Design with Random Factors (CD Only) 


In the preceding sections, we have assumed that the treatments and blocks are fixed factors. In 
many randomized complete block designs, these assumptions may be too restrictive. For 
example, in the chemical type experiment, Example 13-5, we might like to view the fabric 
samples as a random sample of material to which the chemicals may be applied so that the 
conclusions from the experiment will extend to the entire population of material. 

It turns out that, if either treatments or blocks (or both) are random effects, the F-test in 
the analysis of variance is still formed as 


u MS yeatments 
9 = — 
MS; 


This can be shown by using the methods presented previously to evaluate the expected mean 
squares. If the treatments are random, the treatment effects 7; are considered to be normally 
and independently distributed random variables with mean zero and variance o2. The null 
hypothesis of zero treatment effects is 


Ho: o? — 0 
Hi: o? > 0 
When both treatments and blocks are random, the block effects B; are also assumed to be 


normally and independently distributed random variables with mean zero and variance Op. In 
this case the expected values of the mean squares for treatments, blocks, and error are 


E(MStyeatments) — o? + bo - 
E(MSpiocks) — o? zg aoe 
E(MSg) = o* 


The unbiased estimates of the variance components are 


ô? = MS; 

a2 c MSTreatments — MSz 

GE = — oo —— 
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RESPONSE SURFACE METHODS 
AND DESIGNS (CD ONLY) 


After careful study of this chapter, you should be able to do the following: 


1. Design and conduct engineering experiments involving several factors using the factorial 
design approach 


Ui A U N 


. Assess model adequacy with residual plots 


. Know how to analyze and interpret main effects and interactions 
. Understand how the ANOVA is used to analyze the data from these experiments 


. Know how to use the two-level series of factorial designs 
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6. Understand how two-level factorial designs can be run in blocks 


7. Design and conduct two-level fractional factorial designs 


CD MATERIAL 
8. Incorporate random factors in factorial experiments. 
9. Test for curvature in two-level factorial designs by using center points. 


10. Use response surface methodology for process optimization experiments. 


Answers for most odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e- Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found in the 
Mind-Expanding Exercises at the end of the chapter. 


14-1 INTRODUCTION 


An experiment is just a test or series of tests. Experiments are performed in all engineering and 
scientific disciplines and are an important part of the way we learn about how systems and 
processes work. The validity of the conclusions that are drawn from an experiment depends to a 
large extent on how the experiment was conducted. Therefore, the design of the experiment 
plays a major role in the eventual solution of the problem that initially motivated the experiment. 

In this chapter we focus on experiments that include two or more factors that the experi- 
menter thinks may be important. The factorial experimental design will be introduced as a 
powerful technique for this type of problem. Generally, in a factorial experimental design, ex- 
perimental trials (or runs) are performed at all combinations of factor levels. For example, if 
a chemical engineer is interested in investigating the effects of reaction time and reaction tem- 
perature on the yield of a process, and if two levels of time (1 and 1.5 hours) and two levels of 
temperature (125 and 150°F) are considered important, a factorial experiment would consist 
of making experimental runs at each of the four possible combinations of these levels of reac- 
tion time and reaction temperature. 

Most of the statistical concepts introduced in Chapter 13 for single-factor experiments 
can be extended to the factorial experiments of this chapter. The analysis of variance 
(ANOVA), in particular, will continue to be used as one of the primary tools for statistical data 
analysis. We will also introduce several graphical methods that are useful in analyzing the data 
from designed experiments. 


14-2 SOME APPLICATIONS OF DESIGNED EXPERIMENTS 


(CD ONLY) 


14-3 FACTORIAL EXPERIMENTS 


When several factors are of interest in an experiment, a factorial experimental design should 
be used. As noted previously, in these experiments factors are varied together. 


By a factorial experiment we mean that in each complete trial or replicate of the 


experiment all possible combinations of the levels of the factors are investigated. 
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Thus, if there are two factors A and B with a levels of factor A and b levels of factor B, each 
replicate contains all ab treatment combinations. 

The effect of a factor is defined as the change in response produced by a change in the level 
of the factor. It is called a main effect because it refers to the primary factors in the study. For ex- 
ample, consider the data in Table 14-1. This is a factorial experiment with two factors, A and B, 
each at two levels (Aj, Anighs and Bi, Bhign). The main effect of factor A is the difference between 
the average response at the high level of A and the average response at the low level of A, or 


30-40 10*20 ,. 
= - = 


That is, changing factor A from the low level to the high level causes an average response in- 
crease of 20 units. Similarly, the main effect of B is 


= 20+40 10+ 30 _ 


10 
2 2 


In some experiments, the difference in response between the levels of one factor is not the 
same at all levels of the other factors. When this occurs, there is an interaction between the fac- 
tors. For example, consider the data in Table 14-2. At the low level of factor B, the A effect is 


A = 30 — 10 = 20 


and at the high level of factor B, the A effect is 


Since the effect of A depends on the level chosen for factor B, there is interaction between A and B. 
When an interaction is large, the corresponding main effects have very little practical 
meaning. For example, by using the data in Table 14-2, we find the main effect of A as 


30-0 10-420 , 


A= 
2 2 





and we would be tempted to conclude that there is no factor A effect. However, when we ex- 
amined the effects of A at different levels of factor B, we saw that this was not the case. The 
effect of factor A depends on the levels of factor B. Thus, knowledge of the AB interaction is 
more useful than knowledge of the main effect. A significant interaction can mask the signif- 
icance of main effects. Consequently, when interaction is present, the main effects of the 
factors involved in the interaction may not have much meaning. 


Table 14-1 A Factorial Experiment with Table 14-2 A Factorial Experiment with 


‘Two Factors Interaction 
Factor B Factor B 
Factor A Bi Jb ra Factor A Brow D rieh 
Aiow 10 20 Aiow 10 20 


M 30 40 dios 30 0 
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It is easy to estimate the interaction effect in factorial experiments such as those illus- 
trated in Tables 14-1 and 14-2. In this type of experiment, when both factors have two 
levels, the AB interaction effect is the difference in the diagonal averages. This represents 
one-half the difference between the 4 effects at the two levels of B. For example, in 
Table 14-1, we find the AB interaction effect to be 


20-30 10-40 , 


AB 
2 2 


Thus, there is no interaction between A and P. In Table 14-2, the AB interaction effect 1s 


_ 20+30 10+0 


= 20 
2 2 


AB 





As we noted before, the interaction effect in these data is very large. 

The concept of interaction can be illustrated graphically in several ways. Figure 14-1 
plots the data in Table 14-1 against the levels of A for both levels of B. Note that the B,,,, and 
Brigh lines are approximately parallel, indicating that factors A and B do not interact signifi- 
cantly. Figure 14-2 presents a similar plot for the data in Table 14-2. In this graph, the B,,,, and 
Brign lines are not parallel, indicating the interaction between factors A and B. Such graphical 
displays are called two-factor interaction plots. They are often useful in presenting the re- 
sults of experiments, and many computer software programs used for analyzing data from de- 
signed experiments will construct these graphs automatically. 

Figures 14-3 and 14-4 present another graphical illustration of the data from Tables 14-1 
and 14-2. In Fig. 14-3 we have shown a three-dimensional surface plot of the data from 
Table 14-1. These data contain no interaction, and the surface plot is a plane lying above the 
A-B space. The slope of the plane in the A and B directions is proportional to the main effects 
of factors A and B, respectively. Figure 14-4 is a surface plot of the data from Table 14-2. 
Notice that the effect of the interaction in these data 1s to "twist" the plane, so that there is 
curvature in the response function. Factorial experiments are the only way to discover 
interactions between variables. 

An alternative to the factorial design that is (unfortunately) used in practice is to change 
the factors one at a time rather than to vary them simultaneously. To illustrate this one-factor- 
at-a-time procedure, suppose that an engineer is interested in finding the values of temperature 
and pressure that maximize yield in a chemical process. Suppose that we fix temperature 
at 155°F (the current operating level) and perform five runs at different levels of time, say, 


50 50 
40 e Phigh 40 
B me po Biow 5 30 © Biow 
E i: ° et E 20 e 
6 10 e S 10 e 
0 O e Bhigh 
Aiow Ahigh Aw Ahigh 
Factor A Factor A 
Figure 14-1 Factorial experiment, no Figure 14-2 Factorial experiment, with 


interaction. interaction. 
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Figure 14-3  Three-dimensional surface plot of the data from Figure 14-4 Three-dimensional surface plot of the data from 
Table 14-1, showing main effects of the two factors A and B. Table 14-2 showing the effect of the A and B interaction. 





0.5, 1.0, 1.5, 2.0, and 2.5 hours. The results of this series of runs are shown in Fig. 14-5. This 
figure indicates that maximum yield 1s achieved at about 1.7 hours of reaction time. To opti- 
mize temperature, the engineer then fixes time at 1.7 hours (the apparent optimum) and per- 
forms five runs at different temperatures, say, 140, 150, 160, 170, and 180°F The results of this 
set of runs are plotted in Fig. 14-6. Maximum yield occurs at about 155°F. Therefore, we would 
conclude that running the process at 155°F and 1.7 hours is the best set of operating conditions, 
resulting in yields of around 75%. 

Figure 14-7 displays the contour plot of actual process yield as a function of temperature 
and time with the one-factor-at-a-time experiments superimposed on the contours. Clearly, 
this one-factor-at-a-time approach has failed dramatically here, as the true optimum is at least 
20 yield points higher and occurs at much lower reaction times and higher temperatures. The 
failure to discover the importance of the shorter reaction times is particularly important be- 
cause this could have significant impact on production volume or capacity, production plan- 
ning, manufacturing cost, and total productivity. 

The one-factor-at-a-time approach has failed here because it cannot detect the interac- 
tion between temperature and time. Factorial experiments are the only way to detect inter- 
actions. Furthermore, the one-factor-at-a-time method 1s inefficient. It will require more 


80 80 
e 

70 70 
© 60 $ 60 
> > e 

50 50 

e 
0.5 1.0 1.5 2.0 2.5 140 150 160 170 180 
Time (hr) Temperature (°F) 

Figure 14-5 Yield versus reaction time with Figure 14-6 Yield versus temperature with reaction 
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experimentation than a factorial, and as we have just seen, there is no assurance that it will 
produce the correct results. 


14-4 TWO-FACTOR FACTORIAL EXPERIMENTS 


The simplest type of factorial experiment involves only two factors, say, A and B. There are 
a levels of factor A and b levels of factor B. This two-factor factorial is shown in Table 14-3. 
The experiment has n replicates, and each replicate contains all ab treatment combinations. 


Table 14-3 Data Arrangement for a Two-Factor Factorial Design 


Factor B 
1 2 se b Totals Averages 
Yii Vii X121: Y1225 Vibl> Vip2» 
++» Vln i» Vi2n ++» Vibn Ve yi. 
X211: V212 221: Y2225 Y2b1» V2p2» 
Factor A i Y21n +9 Y22n "+s Y2bn Vae Jae 
Vall» Yal2» Ya21» Ya22» Yabl» Vab2» 
<-> Yarn ++ 5 Yazn ++» Yabn Ya- Ya- 
Totals Fj: Y.2. Ve: dus 
Averages y. ya. Vip: Ju 
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The observation in the ith cell for the Ath replicate is denoted by y;;. In performing the 
experiment, the abn observations would be run in random order. Thus, like the single- 
factor experiment studied in Chapter 13, the two-factor factorial 1s a completely random- 
ized design. 

The observations may be described by the linear statistical model 


Il. 
Vag fee a By ees = 12.25 (14-1) 
k —1,2,...,n 


where p is the overall mean effect, 7; is the effect of the ith level of factor A, B; is the ef- 
fect of the jth level of factor B, (TB); is the effect of the interaction between A and B, and 
E; is a random error component having a normal distribution with mean zero and variance 
o°. We are interested in testing the hypotheses of no main effect for factor A, no main effect 
for B, and no AB interaction effect. As with the single-factor experiments of Chapter 13, 
the analysis of variance (ANOVA) will be used to test these hypotheses. Since there are 
two factors in the experiment, the test procedure is sometimes called the two-way analysis 
of variance. 


14-4.1 Statistical Analysis of the Fixed-Effects Model 


Suppose that A and B are fixed factors. That is, the a levels of factor A and the b levels of fac- 
tor B are specifically chosen by the experimenter, and inferences are confined to these levels 
only. In this model, it is customary to define the effects 7;, Bj, and (TB); as deviations from the 
mean, so that Y; 7; = 0, > ab = 0, Z4 (TB); = 0, and X a(1B); = 0. 

The analysis of variance can be used to test hypotheses about the main factor effects of 
A and B and the AB interaction. To present the ANOVA, we will need some symbols, some of 
which are illustrated in Table 14-3. Let y,.. denote the total of the observations taken at the ith 
level of factor A; y.;. denote the total of the observations taken at the jth level of factor B; yy. 
denote the total of the observations in the ijth cell of Table 14-3; and y... denote the grand total 
of all the observations. Define y;.., y.;., y;j., and y... as the corresponding row, column, cell, 
and grand averages. That is, 
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The hypotheses that we will test are as follows: 


1. Hy, = m=" =7, =9 (no main effect of factor A) 
H: at least one 7; # 0 

2. Hy: Bi = B= = B, = 0 (no main effect of factor B) (14-2) 
Hi: at least one p; # 0 

3. Hy (0B) = (0B) = °°: = (TB) = 0 (no interaction) 


Hi: at least one (TB); # 0 


As before, the ANOVA tests these hypotheses by decomposing the total variability in the data 
into component parts and then comparing the various elements in this decomposition. Total 
variability is measured by the total sum of squares of the observations 


a b n 


SSp = X >) D> Oe IF 


i=] j=1 k=1 





and the sum of squares decomposition is defined below. 


The sum of squares identity for a two-factor ANOVA is 


a b n a 
x Qi — y = bn 2, (Tal = aah 


CD = = 








or symbolically, 


SST = SS 4 ar SS s EIE SS 4p lE SS 





Equations 14-3 and 14-4 state that the total sum of squares $.1s partitioned into a sum of squares 
for the row factor A (SS ,), a sum of squares for the column factor B (SS), a sum of squares for the 
interaction between A and B (SS ,5), and an error sum of squares (SS+). There are abn — 1 total 
degrees of freedom. The main effects 4 and B have a — 1 and b — 1 degrees of freedom, while 
the interaction effect AB has (a — 1)(b — 1) degrees of freedom. Within each of the ab cells in 
Table 14-3, there are n — 1 degrees of freedom between the n replicates, and observations in the 
same cell can differ only because of random error. Therefore, there are ab(n — 1) degrees of free- 
dom for error. Therefore, the degrees of freedom are partitioned according to 


abn — 1 = (a — 1) t (b — 1) * (a — 1)(6 — 1) + ab(n — 1) 


If we divide each of the sum of squares on the right-hand side of Equation 14-4 by the 
corresponding number of degrees of freedom, we obtain the mean squares for A, B, the 
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interaction, and error: 


MS, = 55, MS, = SSB MS — 994p MS _ SSE 
^ a-l B b-] 4B (a — 1)(b — 1) P — ab(n — 1) 








Assuming that factors A and B are fixed factors, it is not difficult to show that the expected 
values of these mean squares are 


bn & T? 


SS 2 SS 
E(MS,) -r(e F(MS;) -«( - es^ 


b 
an > B; 
j=l 





From examining these expected mean squares, it is clear that if the null hypotheses about main 
effects Ho: 7; = 0, Ho: B; = 0, and the interaction hypothesis Ho: (TB); = 0 are all true, all four 
mean squares are unbiased estimates of o”. 

To test that the row factor effects are all equal to zero (75: 7; = 0), we would use the ratio 





which has an F-distribution with a — 1 and ab(n — 1) degrees of freedom if Hp: 7; = 0 is true. 
This null hypothesis is rejected at the a level of significance if fo > foa—1abn—1): Similarly, to 
test the hypothesis that all the column factor effects are equal to zero (Hp: B; = 0), we would 
use the ratio 





which has an F-distribution with b — 1 and ab(n — 1) degrees of freedom if Hp: B; = 0 is true. 
This null hypothesis is rejected at the a level of significance if fy > fab-1,ab{n-1) Finally, to test 
the hypothesis Hp: (TB); = 0, which is the hypothesis that all interaction effects are zero, we use 
the ratio 
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which has an F-distribution with (a — 1)(6 — 1) and ab(n — 1) degrees of freedom if the null 
hypothesis Ho: (TB); = 0. This hypothesis is rejected at the a level of significance if 
oe? Jao 6D: 

It is usually best to conduct the test for interaction first and then to evaluate the main 
effects. If interaction is not significant, interpretation of the tests on the main effects is 
straightforward. However, as noted in Section 14-4, when interaction is significant, the main 
effects of the factors involved in the interaction may not have much practical interpretative 
value. Knowledge of the interaction is usually more important than knowledge about the 
main effects. 

Computational formulas for the sums of squares are easily obtained. 


Computing formulas for the sums of squares in a two-factor analysis of variance. 


SS (14-5) 





SS, (14-6) 


SS (14-7) 


SS 4p = (14-8) 


(14-9) 





The computations are usually displayed in an ANOVA table, such as Table 14-4. 


Aircraft primer paints are applied to aluminum surfaces by two methods: dipping and spray- 
ing. The purpose of the primer is to improve paint adhesion, and some parts can be primed 
using either application method. The process engineering group responsible for this operation 
is interested in learning whether three different primers differ in their adhesion properties. 


Table 14-4 ANOVA Table for a Two-Factor Factorial, Fixed-Effects Model 











Source of Sum of Degrees of 
Variation Squares Freedom Mean Square Fo 
A treatment SS 1 EE La 
reatments 4 a a | MS; 
B treatment SS b-1 MS, — 28 M3s 
reatments Bp Ba MS; 
SS 4p MS 4p 
Interaction SS Ap (a — Db — 1) MS 4p = (a — D(b — 1) MS, 
Error SS, ab(n — 1) 
SSg 
Total SS, abn — | MS; = 
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Table 14-5 Adhesion Force Data for Example 14-1 


Primer Type Dipping Spraying ie 
l 4.0, 4.5, 4.3 5.4, 4.9, 5.6 28.7 
2 5.6, 4.9, 5.4 5.8, 6.1, 6.3 34.1 
3 3.8, 3.7, 4.0 (13 0.9, 9.0, 5.0 (53 27.0 
Y 40.2 49.6 89.8 = y... 


A factorial experiment was performed to investigate the effect of paint primer type and ap- 
plication method on paint adhesion. For each combination of primer type and application 
method, three specimens were painted, then a finish paint was applied, and the adhesion 
force was measured. The data from the experiment are shown in Table 14-5. The circled 
numbers in the cells are the cell totals y;. The sums of squares required to perform the 
ANOVA are computed as follows: 




















a b n y? 
SS, = jene 
l i-l p k=1 abn 
89.8) 
= (4.0) + (4.5)? +--+ + (5.0 — f i = 10.72 
a 2 2 
yi y- 
SS — — 
‘ypes 2, bn abn 
 Q87y + (34.1) + (27.0)? (898) — m: 
7 6 18 
2 
EE PS 
methods m = an abn 
40.2)? + (49.6 (89.8% 
— (40.2) + (49.6) = ( ) — 4.9] 
9 18 
yy y 
SO interan — — b 57 no mE DIOS ^ SS methods 
j=1 j= aon 
(RIPO LS ae Cl ABD 5 
7 3 
(89.8) 
— — 4.58 — 4.91 = 0.24 
18 
and 
SSg = SSp — Dunes ^7 D methods a DO isiteracton 


— [0.72 — 4.58 — 4.91 — 0.24 — 0.99 


The ANOVA is summarized in Table 14-6. The experimenter has decided to use a = 0.05. 
Since fo 95515 = 3.89 and fos, 1» = 4.75, we conclude that the main effects of primer type and 
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Figure 14-8 Graph of 
average adhesion force 
versus primer types for 
both application 
methods. 


Table 14-6 ANOVA for Example 14-1 


Source of Sum of Degrees of Mean 

Variation Squares Freedom Square fo P-Value 
Primer types 4.58 2 2.29 28.63 2.7 X E-5 
Application methods 4.9] l 4.9] 61.38 5.0 X E-7 
Interaction 0.24 2 0.12 1.50 0.2621 
Error 0.99 12 0.08 
Total 10.72 17 


application method affect adhesion force. Furthermore, since 1.5 < fo.os.2,12, there is no 
indication of interaction between these factors. The last column of Table 14-6 shows the 
P-value for each F-ratio. Notice that the P-values for the two test statistics for the main effects 
are considerably less than 0.05, while the P-value for the test statistic for the interaction is 
greater than 0.05. 

A graph of the cell adhesion force averages 1 y;.) versus levels of primer type for each ap- 
plication method is shown in Fig. 14-8. The no-interaction conclusion is obvious in this graph, 
because the two lines are nearly parallel. Furthermore, since a large response indicates greater 
adhesion force, we conclude that spraying is the best application method and that primer 
type 2 is most effective. 


Tests on Individual Means 

When both factors are fixed, comparisons between the individual means of either factor may 
be made using any multiple comparison technique such as Fisher's LSD method (described in 
Chapter 13). When there is no interaction, these comparisons may be made using either the 
row averages y;.. or the column averages y;. However, when interaction is significant, com- 
parisons between the means of one factor (say, 4) may be obscured by the AB interaction. In 
this case, we could apply a procedure such as Fisher's LSD method to the means of factor A, 
with factor B set at a particular level. 


Minitab Output 

Table 14-7 shows some of the output from the Minitab analysis of variance procedure for the 
aircraft primer paint experiment in Example 14-1. The upper portion of the table gives factor 
name and level information, and the lower portion of the table presents the analysis of vari- 
ance for the adhesion force response. The results are identical to the manual calculations dis- 
played in Table 14-6 apart from rounding. 


7.0 


e. 
"m adi Spraying 
e o 
= 5.0 : 
Yi. 
Dipping 
e 


1 2 3 
Primer type 
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Table 14-7 Analysis of Variance From Minitab for Example 14-1 


ANOVA (Balanced Designs) 


Factor Type Levels Values 
Primer fixed 3 l 2 3 
Method fixed 2 Dip Spray 


Analysis of Variance for Adhesion 


Source DF SS MS F P 
Primer 2 4.5811 2.2906 27.86 0.000 
Method 1 4.9089 4.9089 59.70 0.000 
Primer *Method 2 0.2411 0.1206 1.47 0.269 
Error 12 0.9867 0.0822 

Total 17 10.7178 


14-4.2 Model Adequacy Checking 


Just as in the single-factor experiments discussed in Chapter 13, the residuals from a factorial 
experiment play an important role in assessing model adequacy. The residuals from a 
two-factor factorial are 


€ük — Yük -. Ji 


That is, the residuals are just the difference between the observations and the corresponding 
cell averages. 

Table 14-8 presents the residuals for the aircraft primer paint data in Example 14-1. The 
normal probability plot of these residuals is shown in Fig. 14-9. This plot has tails that do not 
fall exactly along a straight line passing through the center of the plot, indicating some poten- 
tial problems with the normality assumption, but the deviation from normality does not appear 
severe. Figures 14-10 and, 14-11 plot the residuals versus the levels of primer types and ap- 
plication methods, respectively. There is some indication that primer type 3 results in slightly 
lower variability in adhesion force than the other two primers. The graph of residuals versus 
fitted values in Fig. 14-12 does not reveal any unusual or diagnostic pattern. 


14-4.3 One Observation per Cell 


In some cases involving a two-factor factorial experiment, we may have only one replicate— 
that is, only one observation per cell. In this situation, there are exactly as many parameters in 
the analysis of variance model as observations, and the error degrees of freedom are zero. Thus, 
we cannot test hypotheses about the main effects and interactions unless some additional 


Table 14-8 Residuals for the Aircraft Primer Experiment in Example 14-1 


Application Method 
Primer Type Dipping Spraying 
l —0.27, 0.23, 0.03 0.10, —0.40, 0.30 
2 0.30, —0.40, 0.10 —0.27, 0.03, 0.23 


3 =0.03, «ES 0.17 0:33, m07 =0.17 
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Figure 14-9 Normal probability plot of the Figure 14-10 Plot of residuals versus primer type. 


residuals from Example 14-1. 


14-4.4 


+0.5 


eijk 


-0.5 


assumptions are made. One possible assumption is to assume the interaction effect is negligi- 
ble and use the interaction mean square as an error mean square. Thus, the analysis is equiva- 
lent to the analysis used in the randomized block design. This no-interaction assumption can be 
dangerous, and the experimenter should carefully examine the data and the residuals for indi- 
cations as to whether or not interaction is present. For more details, see Montgomery (2001). 


Factorial Experiments with Random Factors: Overview 


In Section 13-3 we introduced the concept of a random factor. This is, of course, a situation 
in which the factor of interest has a large number of possible levels and the experimenter 
chooses a subset of these levels at random from this population. Conclusions are then drawn 
about the population of factor levels. 

Random factors can occur in factorial experiments. If all the factors are random, the analysis 
of variance model is called a random-effects model. If some factors are fixed and other factors are 
random, the analysis of variance model is called a mixed model. The statistical analysis of random 
and mixed models is very similar to that of the standard fixed-effects models that are the primary 
focus of this chapter. The primary differences are in the types of hypotheses that are tested, the con- 
struction of test statistics for these hypotheses, and the estimation of model parameters. Some 
additional details on these topics are presented in Section 14-6 on the CD. For a more in-depth pres- 
entation, refer to Montgomery (2001) and Neter, Wasserman, Nachtsheim, and Kutner (1996). 
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Figure 14-11 Plot of residuals versus application Figure 14-12 Plot of residuals versus predicted 
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EXERCISES FOR SECTION 14-4 


14-1. In his book (Design and Analysis of Experiments, 
Sth edition, 2001 John Wiley & Sons), D. C. Montgomery 
presents the results of an experiment involving a storage 
battery used in the launching mechanism of a shoulder- 
fired ground-to-air missile. Three material types can be 
used to make the battery plates. The objective is to design a 
battery that is relatively unaffected by the ambient temper- 
ature. The output response from the battery is effective life 
in hours. Three temperature levels are selected, and a fac- 
torial experiment with four replicates is run. The data are as 
follows: 


Temperature (°F) 


Material Low Medium High 
l 130 155 34 40 20 70 
74 180 80 75 82 58 
2 150 188 136 127 25 70 
159 126 106 115 58 45 
3 138 110 174 120 96 104 
168 160 150 139 82 60 


(a) Test the appropriate hypotheses and draw conclusions 
using the analysis of variance with a = 0.05. 

(b) Graphically analyze the interaction. 

(c) Analyze the residuals from this experiment. 


14-2. An engineer who suspects that the surface finish of 
metal parts is influenced by the type of paint used and the drying 


— time. He selected three drying times—20, 25, and 30 minutes— 


CY 


and used two types of paint. Three parts are tested with 
each combination of paint type and drying time. The data are 
as follows: 


Drying Time (min) 

Paint 20 25 30 
] 74 73 78 
64 61 85 

50 44 92 

2 92 98 66 
86 73 45 

68 88 85 


(a) State and test the appropriate hypotheses using the analy- 
sis of variance with a = 0.05. 
(b) Analyze the residuals from this experiment. 
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14-3. An article in /ndustrial Quality Control (1956, pp. 
5—8) describes an experiment to investigate the effect of two 
factors (glass type and phosphor type) on the brightness of a 
television tube. The response variable measured is the current 
(in microamps) necessary to obtain a specified brightness 
level. The data are shown in the following table: 


Glass Phosphor Type 
Type I 2 3 
l 280 300 290 
290 310 285 
285 295 290 
2 230 260 220 
235 240 222 
240 259 230 


(a) State the hypotheses of interest in this experiment. 

(b) Test the above hypotheses and draw conclusions using the 
analysis of variance with a = 0.05. 

(c) Analyze the residuals from this experiment. 


14-4. An experiment was conducted to determine whether ei- 
ther firing temperature or furnace position affects the baked 
density of a carbon anode. The data are as follows: 


Temperature (°C) 


Position 800 825 850 
1 570 1063 565 
565 1080 510 

583 1043 590 

2 528 988 526 
547 1026 538 

521 1004 532 


(a) State the hypotheses of interest. 

(b) Test the above hypotheses using the analysis of variance 
with a = 0.05. What are your conclusions? 

(c) Analyze the residuals from this experiment. 


14-5. Continuation of Exercise 14-4. Using Fisher’s LSD 
method, investigate the differences between the mean baked 
anode density at the three different levels of temperature in 
Exercise 14-4. Use a = 0.05. 

14-6. Johnson and Leone (Statistics and Experimental 
Design in Engineering and the Physical Sciences, John Wiley, 
1977) describe an experiment conducted to investigate 


520 


warping of copper plates. The two factors studied were 
temperature and the copper content of the plates. The re- 
sponse variable is the amount of warping. The data are as 
follows: 


Temperature Copper Content (%) 


CC) 40 60 80 100 A 
50 17,220 16,21 2422 28,27 
75 12,9 18,13 17,12 27,31 
100 16,12 18,21 25,23 30,23 Frequency 1 
125 WG 93-91. 9322 59931 


(a) Is there any indication that either factor affects the amount 
of warping? Is there any interaction between the factors? 
Use a = 0.05. 

(b) Analyze the residuals from this experiment. 

(c) Plot the average warping at each level of copper content 
and compare the levels using Fisher’s LSD method. 
Describe the differences in the effects of the different 
levels of copper content on warping. If low warping is 
desirable, what level of copper content would you 
specify? 

(d) Suppose that temperature cannot be easily controlled in 
the environment in which the copper plates are to be used. 
Does this change your answer for part (c)? 


14-7. Consider a two-factor factorial experiment. Develop a 
formula for finding a 100 (1 — a)% confidence interval on 
the difference between any two means for either a row or 
column factor. Apply this formula to find a 95% CI on the dif- 
ference in mean warping at the levels of copper content 60 and 
80% in Exercise 14-6. 

14-8. An article in the Journal of Testing and Evaluation 
(Vol. 16, no. 6, 1988, pp. 508—515) investigated the effects of 
cyclic loading frequency and environment conditions on fa- 
tigue crack growth at a constant 22 MPa stress for a particular 
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material. The data from the experiment follow. The response 
variable is fatigue crack growth rate. 


Environment 
H,O 


Air Salt H,O 


0.1 





(a) Is there indication that either factor affects crack growth 
rate? Is there any indication of interaction? Use a = 0.05. 

(b) Analyze the residuals from this experiment. 

(c) Repeat the analysis in part (a) using /n(y) as the response. 
Analyze the residuals from this new response variable and 
comment on the results. 


14-9. An article in the ZEEE Transactions on Electron Devices 

(November 1986, p. 1754) describes a study on the effects of 

two variables—polysilicon doping and anneal conditions (time 

and temperature)—on the base current of a bipolar transistor. 

The data from this experiment follows below Exercise 14-10. 

(a) Is there any evidence to support the claim that either poly- 
silicon doping level or anneal conditions affect base 
current? Do these variables interact? Use a = 0.05. 

(b) Graphically analyze the interaction. 

(c) Analyze the residuals from this experiment. 

14-10. Consider the experiment described in Exercise 14-9. 

Use Fisher's LSD method to isolate the effects of anneal con- 

ditions on base current, with a = 0.05. 


Anneal (temperature/time) 


900/60 900/180 950/60 1000/15 1000/30 
px 1920 | 440 830 | 10.15 | 1029 | 11.01 
Polysilicon 4.60 890 | 1020 | 10.30 | 10.58 
doping seque | 220 7.81 938 | 10.19 | 10.81 
3.50 7.15 10.02 | 10.10 | 10.60 





14-5 GENERAL FACTORIAL EXPERIMENTS 
Many experiments involve more than two factors. In this section we introduce the case where 
there are a levels of factor A, b levels of factor B, c levels of factor C, and so on, arranged 
in a factorial experiment. In general, there will be abc ... n total observations, if there are 
n replicates of the complete experiment. 
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Table 14-9 Analysis of Variance Table for the Three-Factor Fixed Effects Model 


























Source of Sum of Degrees of Expected 
Variation Squares Freedom Mean Square Mean Squares Fo 
, Den X MS, 
A SS, g=] MS, | r a 
da — 1 MS; 
B Ss b-1 MS 24 aa Món 
B i d b EN ] MS; 
bn v; MS 
C SS. peni MS. gp e dius 
(nes MS; 
nEph MS yp 
AB SS —1)\(b-1 MS 2 4 —M—————— 
AB (a X ) AB o (a m 1)(b m 1) MS; 
bn È (Ty) MSac 
AC SS = ie I MS 2 y aall 
AC (a Xc ) AC o (a — le — 1) MS; 
an 2. X(By)ii MSgc 
BC. SS p— liie= 1 MS + 
BC ( Ie ) BC o (b m 1)(c _ 1) MS; 
n ZXX(vBy)j, MS spc 
ABC SS — Db — De — 1 MS ? 4 
ABC (a X Ke ) ABC uU (a Em 1)(b _ 1)(c m 1) MS; 
Error SSr abc(n — 1) MS; o? 
Total SS abcn — | 


EXAMPLE 14-2 


For example, consider the three-factor-factorial experiment, with underlying model 


Yigg = w+ 7; + B+ Ye + (0B)g + (ry)a + (BY) 


+ (By); t €ikl (14-10) 


-— p V €. 
| 

— ER = m 

NNNN 

3 o de ES 


Notice that the model contains three main effects, three two-factor interactions, a three-factor 
interaction, and an error term. Assuming that A, B, and C are fixed factors, the analysis of vari- 
ance is shown in Table 14-9. Note that there must be at least two replicates (n = 2) to compute 
an error sum of squares. The F-test on main effects and interactions follows directly from 
the expected mean squares. These ratios follow F distributions under the respective null 
hypotheses. 


A mechanical engineer is studying the surface roughness of a part produced in a metal-cutting 
operation. Three factors, feed rate (A), depth of cut (B), and tool angle (C ), are of interest. All 
three factors have been assigned two levels, and two replicates of a factorial design are run. 
The coded data are shown in Table 14-10. 

The ANOVA is summarized in Table 14-11. Since manual ANOVA computions are 
tedious for three-factor experiments, we have used Minitab for the solution of this problem. 
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Table 14-10 Coded Surface Roughness Data for Example 14-2 


Depth of Cut (B) 
0.025 inch 0.040 inch 
Feed Rate Tool EC A = ) Tool = = ) 
(4) 15° yi. 


9 
20 inches per minute 7 HER 75 
10 
30 inches per minute 12 102 


The F-ratios for all three main effects and the interactions are formed by dividing the mean 
square for the effect of interest by the error mean square. Since the experimenter has selected 
a = 0.05, the critical value for each of these F-ratios is f; o4, 4 = 5.32. Alternately, we could 
use the P-value approach. The P-values for all the test statistics are shown in the last column 
of Table 14-11. Inspection of these P-values is revealing. There is a strong main effect of feed 
rate, since the F-ratio is well into the critical region. However, there is some indication of an 
effect due to the depth of cut, since P = 0.0710 is not much greater than a = 0.05. The next 
largest effect is the AB or feed rate X depth of cut interaction. Most likely, both feed rate and 
depth of cut are important process variables. 


Obviously, factorial experiments with three or more factors can require many runs, par- 
ticularly if some of the factors have several (more than two) levels. This point of view leads 
us to the class of factorial designs considered in Section 14-7 with all factors at two levels. 
These designs are easy to set up and analyze, and they may be used as the basis of many other 
useful experimental designs. 


Table 14-11 Minitab ANOVA for Example 14-2 


ANOVA (Balanced Designs) 


Factor Type Levels Values 

Feed fixed 2 20 30 
Depth fixed 2 0.025 0.040 
Angle fixed 2 15 22 


Analysis of Variance for Roughness 


Source DF SS MS F P 
Feed 1 45.563 45.563 18.69 0.003 
Depth 1 10.563 10.563 4.33 0.071 
Angle 1 3.063 3.063 1.26 0.295 
Feed* Depth 1 7.563 7.563 3.10 0.116 
Feed* Angle 1 0.062 0.062 0.03 0.877 
Depth* Angle 1 1.563 1.563 0.64 0.446 
Feed*Depth* Angle 1 5.062 5.062 2.08 0.188 
Error 8 19.500 2.437 

Total 15 92.938 


EXERCISES FOR SECTION 14-5 


14-11. The percentage of hardwood concentration in raw 
pulp, the freeness, and the cooking time of the pulp are being 
investigated for their effects on the strength of paper. The data 
from a three-factor factorial experiment are shown in the fol- 
lowing table. 


Cooking Time 1.5 hours 
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(a) Analyze the data using the analysis of variance assuming 
that all factors are fixed. Use a = 0.05. 

(b) Find P-values for the F-ratios in part (a). 

(c) The residuals are found by ej;; = yj; — Yi- Graphically 
analyze the residuals from this experiment. 


Cooking Time 2.0 hours 


Percentage 
of Hardwood Freeness Freeness 
Concentration 350 500 650 350 500 650 
10 96.6 97-1 99.4 98.4 99.6 100.6 
96.0 96.0 99.8 98.6 100.4 100.9 
15 98.5 96.0 98.4 97.5 98.7 99.6 
LB, 96.9 97.6 98.1 96.0 99.0 
20 97.9 95.6 97.4 97.6 97.0 98.5 
96.6 96.2 98.1 98.4 97.8 99.8 


14-12. The quality control department of a fabric finishing 
plant is studying the effects of several factors on dyeing for a 
blended cotton/synthetic cloth used to manufacture shirts. 
Three operators, three cycle times, and two temperatures were 
selected, and three small specimens of cloth were dyed under 
each set of conditions. The finished cloth was compared to a 


standard, and a numerical score was assigned. The results are 

shown in the following table. 

(a) State and test the appropriate hypotheses using the analy- 
sis of variance with a = 0.05. 

(b) The residuals may be obtained from ej; = Vij — Yi. 
Graphically analyze the residuals from this experiment. 


Temperature 
300* 350° 
Operator Operator 
Cycle Time l 2 3 1 2 3 
23 27 31 24 38 34 
40 24 28 32 23 36 36 
25 26 28 28 35 39 
36 34 9o 37 34 34 
50 35 38 34 39 38 36 
36 39 35 35 36 31 
28 35 26 26 36 28 
60 24 35 21 29 37 26 
PA 34 25 25 34 34 


14-6 FACTORIAL EXPERIMENTS WITH RANDOM 


FACTORS (CD ONLY) 


14-7 2‘ FACTORIAL DESIGNS 


Factorial designs are frequently used in experiments involving several factors where it is 
necessary to study the joint effect of the factors on a response. However, several special 
cases of the general factorial design are important because they are widely employed 
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14-7.1 2? Design 


Figure 14-13 The 2? 
factorial design. 


CHAPTER 14 DESIGN OF EXPERIMENTS WITH SEVERAL FACTORS 


in research work and because they form the basis of other designs of considerable practical 
value. 

The most important of these special cases is that of k factors, each at only two levels. 
These levels may be quantitative, such as two values of temperature, pressure, or time; or they 
may be qualitative, such as two machines, two operators, the “high” and “low” levels of a fac- 
tor, or perhaps the presence and absence of a factor. A complete replicate of such a design 
requires 2 X 2 X * * - X 2 = 2* observations and is called a 2“ factorial design. 

The 2^ design is particularly useful in the early stages of experimental work, when many 
factors are likely to be investigated. It provides the smallest number of runs for which k fac- 
tors can be studied in a complete factorial design. Because there are only two levels for each 
factor, we must assume that the response is approximately linear over the range of the factor 
levels chosen. 


The simplest type of 2^ design is the 2/—that is, two factors A and B, each at two levels. We 
usually think of these levels as the low and high levels of the factor. The 2? design is shown in 
Fig. 14-13. Note that the design can be represented geometrically as a square with the 2? — 4 
runs, or treatment combinations, forming the corners of the square. In the 2? design it is cus- 
tomary to denote the low and high levels of the factors A and B by the signs — and +, respec- 
tively. This is sometimes called the geometric notation for the design. 

A special notation is used to label the treatment combinations. In general, a treatment 
combination is represented by a series of lowercase letters. If a letter is present, the corre- 
sponding factor is run at the high level in that treatment combination; if it is absent, the factor 
is run at its low level. For example, treatment combination a indicates that factor A is at the 
high level and factor B 1s at the low level. The treatment combination with both factors at the 
low level is represented by (1). This notation is used throughout the 2^ design series. For ex- 
ample, the treatment combination in a 2^ with A and C at the high level and B and D at the low 
level is denoted by ac. 

The effects of interest in the 2? design are the main effects A and B and the two-factor in- 
teraction AB. Let the letters (1), a, b, and ab also represent the totals of all n observations taken 
at these design points. It 1s easy to estimate the effects of these factors. To estimate the main 
effect of 4, we would average the observations on the right side of the square in Fig. 14-13 
where A is at the high level, and subtract from this the average of the observations on the left 
side of the square, where A 1s at the low level, or 





High 
(+) b ab 
Treatment A B 
(1) — — 
B a TS 
b — + 
ab + + 
Low |(1) a 
omer A High 


(-) (+) 
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Ip Ve 
_atab b-t(l) 


2n 2n 


= [a + ab — b — (1)] (14-11) 





similarly, the main effect of B 1s found by averaging the observations on the top of the square, 
where B is at the high level, and subtracting the average of the observations on the bottom of 
the square, where B is at the low level: 


(14-12) 





Finally, the AB interaction is estimated by taking the difference in the diagonal averages 
in Fig. 14-12, or 


(14-13) 





The quantities in brackets in Equations 14-11, 14-12, and 14-13 are called contrasts. For 
example, the A contrast is 


Contrast, = a + ab — b — (1) 
In these equations, the contrast coefficients are always either +1 or — 1. A table of plus and 


minus signs, such as Table 14-12, can be used to determine the sign on each treatment 


Table 14-12 Signs for Effects in the 2^ Design 


Treatment Factorial Effect 
Combination I A B AB 
(1) + - = 1 
a + + - = 
b 3 - n E 
ab + + 4 a 
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EXAMPLE 14-3 





combination for a particular contrast. The column headings for Table 14-12 are the main ef- 
fects A and B, the AB interaction, and J, which represents the total. The row headings are the 
treatment combinations. Note that the signs in the AB column are the product of signs from 
columns A and B. To generate a contrast from this table, multiply the signs in the appropriate 
column of Table 14-12 by the treatment combinations listed in the rows and add. For example, 
contrasty, = [(1)] + [-a] + [-b] + [ab] = ab + (1) — a — b. 

Contrasts are used in calculating both the effect estimates and the sums of squares for A, 
B, and the AB interaction. The sums of squares formulas are 


(14-14) 


The analysis of variance is completed by computing the total sum of squares SS; (with 4n — 1 
degrees of freedom) as usual, and obtaining the error sum of squares SS, [with 4(n — 1) 
degrees of freedom] by subtraction. 


An article in the AT&T Technical Journal (Vol. 65, March/April 1986, pp. 39-50) describes 
the application of two-level factorial designs to integrated circuit manufacturing. A basic pro- 
cessing step in this industry is to grow an epitaxial layer on polished silicon wafers. The 
wafers are mounted on a susceptor and positioned inside a bell jar. Chemical vapors are intro- 
duced through nozzles near the top of the jar. The susceptor is rotated, and heat is applied. 
These conditions are maintained until the epitaxial layer is thick enough. 

Table 14-13 presents the results of a 2? factorial design with n = 4 replicates using the 
factors A = deposition time and B = arsenic flow rate. The two levels of deposition time 
are — =short and + =long, and the two levels of arsenic flow rate are — =55% and + 
=59%. The response variable is epitaxial layer thickness (um). We may find the estimates of 
the effects using Equations 14-11, 14-12, and 14-13 as follows: 


l 
A — z-[a * ab — b — (1 
2n |a "bb — b -— t1) 
] 
= X4) [59.299 + 59.156 — 55.686 — 56.081] = 0.836 
Bede b — a — (1)] 
= P a a 
l 
— (4) [55.686 + 59.156 — 59.299 — 56.081] = 0.067 
Aas bth) ap) 
= n a a 
l 
AB = —— [59.156 + 56.081 — 59.299 — 55.686] = 0.032 


Table 14-13 The 2? Design for the Epitaxial Process Experiment 


Treatment 
Combination 


(1) 


b 
ab 


Design Factors 


B AB 
— -+ 
+ == 
+ -+ 


14.037 
14.821 
13.880 
14.888 


Thickness ( um) 
14.165 13.972 
14.757 14.843 
13.860 14.032 
14.921 14.415 


13.907 
14.878 
13.914 
14.932 
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Thickness (p.m) 
Total Average 
56.081 14.020 
59.299 14.825 
55.686 15.927 
59.156 14.789 


The numerical estimates of the effects indicate that the effect of deposition time is large and 
has a positive direction (increasing deposition time increases thickness), since changing dep- 
osition time from low to high changes the mean epitaxial layer thickness by 0.836 wm. The 
effects of arsenic flow rate (5) and the AB interaction appear small. 

The importance of these effects may be confirmed with the analysis of variance. The 
sums of squares for A, B, and AB are computed as follows: 








OM ud da ECT RN 

16 16 

2 2 

m [b + ab = n [0.538] -— 
SS45 = b tU) Zamo] [02527 0.0040 

16 16 

; , (56.081 + 4 59.156) 

SS; = 14.037? + = + 14.932? — T 


= 3.0672 


The analysis of variance is summarized in Table 14-14 and confirms our conclusions obtained 
by examining the magnitude and direction of the effects. Deposition time is the only factor 
that significantly affects epitaxial layer thickness, and from the direction of the effect esti- 
mates we know that longer deposition times lead to thicker epitaxial layers. 


Residual Analysis 
It is easy to obtain the residuals from a 2^ design by fitting a regression model to the data. For 
the epitaxial process experiment, the regression model is 


Y = Bo + Bix e 


Table 14-14 Analysis of Variance for the Epitaxial Process Experiment 


Source of Degrees of 

Variation Sum of Squares Freedom Mean Square fo P-Value 
A (deposition time) 2.7956 l 2.7956 134.40 7.07 E-8 
B (arsenic flow) 0.0181 l 0.0181 0.87 0.38 
AB 0.0040 1 0.0040 0.19 0.67 
Error 0.2495 12 0.0208 
Total 3.0672 15 
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since the only active variable is deposition time, which is represented by a coded variable x. 
The low and high levels of deposition time are assigned values x, = —1 and x, = +1, 
respectively. The least squares fitted model is 


. 0.836 
9 = 14.389 *| —— ] x 


where the intercept B, is the grand average of all 16 observations (y) and the slope Bis one- 
half the effect estimate for deposition time. (The regression coefficient is one-half the effect 
estimate because regression coefficients measure the effect of a unit change in x, on the mean 
of Y, and the effect estimate is based on a two-unit change from —1 to +1.) 

This model can be used to obtain the predicted values at the four points that form the cor- 
ners of the square in the design. For example, consider the point with low deposition time 


(x, = —1) and low arsenic flow rate. The predicted value is 
. 0.836 
$ = 14.389 + 3 (—1) = 13.971 pm 


and the residuals for the four runs at that design point are 


e, — 14.037 — 13.971 — 0.066 
e, = 14.165 — 13.971 = 0.194 
e, = 13.972 — 13.971 = 0.001 
e, = 13.907 — 13.971 = —0.064 


The remaining predicted values and residuals at the other three design points are calculated in 
a similar manner. 

A normal probability plot of these residuals is shown in Fig. 14-14. This plot indicates 
that one residual e,; = —0.392 is an outlier. Examining the four runs with high deposition 
time and high arsenic flow rate reveals that observation y,; = 14.415 is considerably smaller 
than the other three observations at that treatment combination. This adds some additional 
evidence to the tentative conclusion that observation 15 is an outlier. Another possibility is 
that some process variables affect the variability 1n epitaxial layer thickness. If we could dis- 
cover which variables produce this effect, we could perhaps adjust these variables to levels 
that would minimize the variability in epitaxial layer thickness. This could have important im- 
plications in subsequent manufacturing stages. Figures 14-15 and 14-16 are plots of residuals 
versus deposition time and arsenic flow rate, respectively. Apart from that unusually large 
residual associated with y,;, there is no strong evidence that either deposition time or arsenic 
flow rate influences the variability in epitaxial layer thickness. 

Figure 14-17 shows the standard deviation of epitaxial layer thickness at all four runs in 
the 2? design. These standard deviations were calculated using the data in Table 14-13. Notice 
that the standard deviation of the four observations with A and B at the high level 1s consider- 
ably larger than the standard deviations at any of the other three design points. Most of this 
difference is attributable to the unusually low thickness measurement associated with y,;. The 
standard deviation of the four observations with A and B at the low level is also somewhat 
larger than the standard deviations at the remaining two runs. This could indicate that other 
process variables not included in this experiment may affect the variability in epitaxial layer 
thickness. Another experiment to study this possibility, involving other process variables, 
could be designed and conducted. (The original paper in the AT&T Technical Journal shows 
that two additional factors, not considered in this example, affect process variability. ) 
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14-7.2. 2* Design for k = 3 Factors 


Figure 14-17 The 
standard deviation of 
epitaxial layer thick- 
ness at the four runs in 
the 2? design. 


The methods presented in the previous section for factorial designs with k = 2 factors each at 
two levels can be easily extended to more than two factors. For example, consider k = 3 fac- 
tors, each at two levels. This design is a 2? factorial design, and it has eight runs or treatment 
combinations. Geometrically, the design is a cube as shown in Fig. 14-18(a), with the eight 
runs forming the corners of the cube. Figure 14-18(b) lists the eight runs in a table, with each 
row representing one of the runs are the — and + settings indicating the low and high levels 


0.077 0.251 
T 
b ab 
B 
7 (1) a 
0.110 A 0.055 
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Figure 14-18 The 2? 
design. 


Figure 14-19 
Geometric presenta- 
tion of contrasts corre- 
sponding to the main 
effects and interaction 
in the 2° design. (a) 
Main effects. (b) Two- 
factor interactions. 

(c) Three-factor 
interaction. 


abc 
Run A B C 
+ 
1 — " i 
2 + — 
C 3 - + - 
ab 4 + + 
5 - - + 
= + 6 + - + 
7 — + 
8 + + 





(a) Geometric view (b) The 2? design matrix 


for each of the three factors. This table is sometimes called the design matrix. This design al- 
lows three main effects to be estimated (A, B, and C) along with three two-factor interactions 
(AB, AC, and BC) and a three-factor interaction (ABC). 

The main effects can easily be estimated. Remember that the lowercase letters (1), a, b, 
ab, c, ac, bc, and abc represent the total of all n replicates at each of the eight runs in the de- 
sign. As seen in Fig. 14-19(a), the main effect of A can be estimated by averaging the four 
treatment combinations on the right-hand side of the cube, where A 1s at the high level, and by 





Two-factor interactions 


(b) 


C 
e — + runs 
B o =- runs 
A 
ABC 


Three-factor interaction 


(c) 
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subtracting from this quantity the average of the four treatment combinations on the left-hand 
side of the cube where A 1s at the low level. This gives 


A = Y44— Ya- 
atab+act+abe (1)+b+c+ bc 


4n 4n 


This equation can be rearranged as 


l 
a; (a + ab + ac + abe — (1) - b — c — bc] (14-15) 





In a similar manner, the effect of B is the difference in averages between the four treatment 
combinations in the back face of the cube (Fig. 14-192), and the four in the front. This yields 


B- m Ms 


l 
= 7, |b + ab + be + abc — (1) - a — c — ac] (14-16) 





The effect of C is the difference in average response between the four treatment combinations 
in the top face of the cube in Figure 14-19(a) and the four in the bottom, that is, 


Caa ec 


1 
= ag le + ae + be + abe — (1) - a - b — ab] (14-17) 





The two-factor interaction effects may be computed easily. A measure of the AB interaction 
is the difference between the average A effects at the two levels of B. By convention, one-half 
of this difference is called the AB interaction. Symbolically, 


High (+) [(abc — bc) + (ab — b)| 
| 2n 
Low) dac le CDI 
2n 


Difference Co = e = Des qe eee SEDE] 
2n 
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Since the AB interaction 1s one-half of this difference, 


I 
dn Inpecohosbsabcbaesbeevekarseq b (14-18) 


4n 





We could write Equation 14-18 as follows: 


abc +ab+c+(1) be+bt+act+a 
Po uade NRI Tu NE LONE UM Mc 
4n 4n 
In this form, the AB interaction is easily seen to be the difference in averages between runs 


on two diagonal planes in the cube in Fig. 14-19(b). Using similar logic and referring to 
Fig. 14-19(b), we find that the AC and BC interactions are 


(14-19) 


(14-20) 





The ABC interaction is defined as the average difference between the AB interaction for 
the two different levels of C. Thus, 


ABC — = (abc ~be = ue] = ap =b] Fa =A 


or 


| 
ABC = 7 (Abe — be — ac + c — ab + b + a ~ (1)] (14-21) 





As before, we can think of the ABC interaction as the difference in two averages. If the runs 
in the two averages are isolated, they define the vertices of the two tetrahedra that comprise 
the cube in Fig. 14-19(c). 

In Equations 14-15 through 14-21, the quantities in brackets are contrasts in the treat- 
ment combinations. A table of plus and minus signs can be developed from the contrasts 
and is shown in Table 14-15. Signs for the main effects are determined directly from the test 
matrix in Figure 14-18(b). Once the signs for the main effect columns have been estab- 
lished, the signs for the remaining columns can be obtained by multiplying the appropriate 
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Table 14-15 Algebraic Signs for Calculating Effects in the 2? Design 


Treatment Factorial Effect 
Combination I A B AB C AG BC ABC 

(1) F » = J — + + — 

a T + — E = — Ju 4 

b + — 4 = — 4 _ n 
ab + + a ES E E _ 2 

p iE m = + + — = A 
ac + + — E 4 d _ _ 
bc + — = 4 _ 4 B 
abc + + + + d 4 i 4 


main effect row by row. For example, the signs in the AB column are the products of the A 
and B column signs in each row. The contrast for any effect can easily be obtained from this 
table. 

Table 14-15 has several interesting properties: 


1. Except for the identity column /, each column has an equal number of plus and minus 
signs. 


2. The sum of products of signs in any two columns is zero; that is, the columns in the 
table are orthogonal. 


3. Multiplying any column by column / leaves the column unchanged; that is, Z is an 
identity element. 


4. The product of any two columns yields a column in the table, for example A X B = 
AB, and AB X ABC = A?B?C = C, since any column multiplied by itself is the 
identity column. 


The estimate of any main effect or interaction in a 2* design is determined by multiplying 
the treatment combinations in the first column of the table by the signs in the corresponding 
main effect or interaction column, by adding the result to produce a contrast, and then by di- 
viding the contrast by one-half the total number of runs in the experiment. For any 2^ design 
with n replicates, the effect estimates are computed from 


Contrast 
Effect = ES (14-22) 





and the sum of squares for any effect 1s 


(Contrast)? 


SE (14-23) 
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EXAMPLE 14-4 Consider the surface roughness experiment originally described in Example 14-2. This is a 2? 
factorial design in the factors feed rate (A), depth of cut (B), and tool angle (C), with n = 2 
replicates. Table 14-16 presents the observed surface roughness data. 

The main effects may be estimated using Equations 14-15 through 14-21. The effect of A, 
for example, is 


] 
A=7 4 + ab + ac + abe — (1) - b — c ~ be! 
1 
—— |2 F27 + 23 F40 — 16 —20 — 21 — 18] 
4(2) 
1 


[27] = 3.375 


oo | 


and the sum of squares for A 1s found using Equation 14-23: 


Contrast ,) 27% 
SS, = HE — ter — 45.5625 
n2 2(8) 


It is easy to verify that the other effects are 


B = 1.625 
C = 0.875 
AB = 1.375 
AC = 0.125 
BC = —0.625 
ABC = 1.125 


Examining the magnitude of the effects clearly shows that feed rate (factor A) is dominant, 
followed by depth of cut (B) and the AB interaction, although the interaction effect is rela- 
tively small. The analysis of variance, summarized in Table 14-17, confirms our interpretation 
of the effect estimates. 

Minitab will analyze 2^ factorial designs. The output from the Minitab DOE (Design of 
Experiments) module for this experiment is shown in Table 14-18. The upper portion of the table 
displays the effect estimates and regression coefficients for each factorial effect. However, a 


Table 14-16 Surface Roughness Data for Example 14-4 


Treatment Design Factors — — Surface 
Combinations A B C Roughness Totals 

(1) =] =] ex] 9, 7 16 

a 1 ex] e] 10, 12 22 

b =] 1 ex] 9,11 20 
ab 1 i| =] 12-15 27 

C =] =] 1 11, 10 21 
ac 1 es] 1 10, 13 23 
bc e] 1 l 10, 8 18 
abc 1 l 1 16, 14 30 
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Table 14-17 Analysis of Variance for the Surface Finish Experiment 


Source of Degrees of 

Variation Sum of Squares Freedom Mean Square fo P-Value 
A 45.5625 l 45.5625 18.69 0.0025 
B 10.5625 1 10.5625 4.33 0.0709 
C 3.0625 l 3.0625 1.26 0.2948 
AB 7.5625 1 7.5625 3.10 0.1162 
AC 0.0625 1 0.0625 0.03 0.8784 
BC 1.5625 ] 1.5625 0.64 0.4548 
ABC 5.0625 1 5.0625 2.08 0.1875 
Error 19.5000 8 2.4375 

Total 92.9375 15 


t-statistic is reported for each effect instead of the F-statistic used in Table 14-17. Now the square 
of a t random variable with d degrees of freedom is an F random variable with 1 numerator and 
d denominator degrees of freedom. Thus the square of the t-statistic reported by Minitab will be 
equal (apart from rounding errors) to the F-statistic in Table 14-17. To illustrate, for the main ef- 
fect of feed Minitab reports t = 4.32 (with eight degrees of freedom), and £ = (4.327 = 18.66, 
which is approximately equal to the F-ratio for feed reported in Table 14-17 (F = 18.69). This 
F-ratio has one numerator and eight denominator degrees of freedom. 

The lower panel of the Minitab output in Table 14-18 is an analysis of variance summary 
focusing on the types of terms in the model. A regression model approach is used in the pres- 
entation. You might find it helpful to review Section 12-2.2, particularly the material on the 
partial F-test. The row entitled “main effects” under source refers to the three main effects 


Table 14-18 Minitab Analysis for Example 14-4 


Estimated Effects and Coefficients for Roughness 


Term Effect Coef StDev Coef T P 
Constant 11.0625 0.3903 28.34 0.000 
Feed 3.3750 1.6875 0.3903 4.32 0.003 
Depth 1.6250 0.8125 0.3903 2.08 0.071 
Angle 0.8750 0.4375 0.3903 [12 0.295 
Feed* Depth 1.3750 0.6875 0.3903 1.76 0.116 
Feed* Angle 0.1250 0.0625 0.3903 0.16 0.877 
Depth* Angle —0.6250 —0.3125 0.3903 —0.80 0.446 
Feed*Depth* Angle 1.1250 0.5625 0.3903 1.44 0.188 
Analysis of Variance for Roughness 
Source DF Seq SS Adj SS Adj MS F P 
Main Effects 3 59.188 59.188 19.729 8.09 0.008 
2-Way Interactions > 9.187 9.187 3.062 1.26 0.352 
3-Way Interactions l 5.062 5.062 5.062 2.08 0.188 
Residual Error 8 19.500 19.500 2.437 

Pure Error 8 19.500 19.500 2.437 
Total 15 92.938 
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feed, depth, and angle, each having a single degree of freedom, giving the total 3 in the col- 
umn headed *DE" The column headed *Seq SS" (an abbreviation for sequential sum of 
squares) reports how much the model sum of squares increases when each group of terms is 
added to a model that contains the terms listed above the groups. The first number in the “Seq 
SS" column presents the model sum of squares for fitting a model having only the three main 
effects. The row labeled *2-Way Interactions" refers to AB, AC, and BC, and the sequential 
sum of squares reported here is the increase in the model sum of squares if the interaction 
terms are added to a model containing only the main effects. Similarly, the sequential sum of 
squares for the three-way interaction 1s the increase in the model sum of squares that results 
from adding the term ABC to a model containing all other effects. The column headed “Adj 
SS" (an abbreviation for adjusted sum of squares) reports how much the model sum of squares 
increases when each group of terms is added to a model that contains a// the other terms. Now 
since any 2* design with an equal number of replicates in each cell is an orthogonal design, the 
adjusted sum of squares will equal the sequential sum of squares. Therefore, the F-tests for 
each row in the Minitab analysis of variance table are testing the significance of each group of 
terms (main effects, two-factor interactions, and three-factor interactions) as 1f they were the 
last terms to be included in the model. Clearly, only the main effect terms are significant. The 
t-tests on the individual factor effects indicate that feed rate and depth of cut have large main 
effects, and there may be some mild interaction between these two factors. Therefore, the 
Minitab output is in agreement with the results given previously. 


Residual Analysis 

We may obtain the residuals from a 2‘ design by using the method demonstrated earlier for the 2? 
design. As an example, consider the surface roughness experiment. The three largest effects are 
A, B, and the AB interaction. The regression model used to obtain the predicted values is 


Y = Bo + Bix, + Box. + Bioxio + € 


where x, represents factor A, x, represents factor B, and x,x, represents the AB interaction. The 
regression coefficients B,, B2, and B,, are estimated by one-half the corresponding effect esti- 
mates, and Bọ is the grand average. Thus 


` 2345 1.625 1.375 
y = 11.0625 + > Xr E Xo + Eu X1X2 


= 11.0625 + 1.6875x, + 0.8125x, + 0.6875x)x> 


Note that the regression coefficients are presented by Minitab in the upper panel of Table 14-18. 
The predicted values would be obtained by substituting the low and high levels of A and B into 
this equation. To illustrate this, at the treatment combination where A, B, and C are all at the low 
level, the predicted value is 


y = 11.065 + 1.6875(—1) + 0.8125(—1) + 0.6875(—1)(—1) = 9.25 


Since the observed values at this run are 9 and 7, the residuals are 9 — 9.25 = —0.25 and 
7 — 9.25 = —2.25. Residuals for the other 14 runs are obtained similarly. 

A normal probability plot of the residuals is shown in Fig. 14-20. Since the residuals lie 
approximately along a straight line, we do not suspect any problem with normality in the data. 
There are no indications of severe outliers. It would also be helpful to plot the residuals ver- 
sus the predicted values and against each of the factors A, B, and C. 


Figure 14-20 
Normal probability 
plot of residuals from 
the surface roughness 
experiment. 
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Normal probability 





-2.250 -1.653 -0.917 -0.250 0.417 1.083 1.750 
Residual 


Projection of 2* Designs 

Any 2* design will collapse or project into another 2* design in fewer variables if one or more of 
the original factors are dropped. Sometimes this can provide additional insight into the 
remaining factors. For example, consider the surface roughness experiment. Since factor C and 
all its interactions are negligible, we could eliminate factor C from the design. The result is to 
collapse the cube in Fig. 14-18 into a square in the A — B plane; therefore, each of the four runs 
in the new design has four replicates. In general, if we delete / factors so that r = k — h factors 
remain, the original 2^ design with n replicates will project into a 2” design with n2" replicates. 


14-7.3 Single Replicate of the 2* Design 


As the number of factors in a factorial experiment grows, the number of effects that can be 
estimated also grows. For example, a 2^ experiment has 4 main effects, 6 two-factor interac- 
tions, 4 three-factor interactions, and 1 four-factor interaction, while a 2° experiment has 6 
main effects, 15 two-factor interactions, 20 three-factor interactions, 15 four-factor interac- 
tions, 6 five-factor interactions, and 1 six-factor interaction. In most situations the sparsity of 
effects principle applies; that is, the system is usually dominated by the main effects and low- 
order interactions. The three-factor and higher order interactions are usually negligible. 
Therefore, when the number of factors is moderately large, say, k = 4 or 5, a common prac- 
tice is to run only a single replicate of the 2 design and then pool or combine the higher order 
interactions as an estimate of error. Sometimes a single replicate of a 2^ design is called an 
unreplicated 2^ factorial design. 

When analyzing data from unreplicated factorial designs, occasionally real high-order 
interactions occur. The use of an error mean square obtained by pooling high-order interactions 
is inappropriate in these cases. A simple method of analysis can be used to overcome this prob- 
lem. Construct a plot of the estimates of the effects on a normal probability scale. The effects 
that are negligible are normally distributed, with mean zero and variance o? and will tend to fall 
along a straight line on this plot, whereas significant effects will have nonzero means and will 
not lie along the straight line. We will illustrate this method in the next example. 
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EXAMPLE 14-5 


An article in Solid State Technology (“Orthogonal Design for Process Optimization and Its 
Application in Plasma Etching,’ May 1987, pp. 127—132) describes the application of facto- 
rial designs in developing a nitride etch process on a single-wafer plasma etcher. The process 
uses C,F, as the reactant gas. It is possible to vary the gas flow, the power applied to the cath- 
ode, the pressure in the reactor chamber, and the spacing between the anode and the cathode 
(gap). Several response variables would usually be of interest in this process, but in this 
example we will concentrate on etch rate for silicon nitride. 

We will use a single replicate of a 2^ design to investigate this process. Since it is unlikely 
that the three- and four-factor interactions are significant, we will tentatively plan to combine 
them as an estimate of error. The factor levels used in the design are shown below: 


Level 
Low (—) 
High (+) 


Gap 
(cm) 
0.80 
1.20 


Design Factor 


Pressure 
(mTorr) 


450 
550 


C5F, Flow 
(SCCM) 


125 
200 


Power 
(w) 
275 
329 


Table 14-19 presents the data from the 16 runs of the 2^ design. Table 14-20 is the table of plus 
and minus signs for the 2^ design. The signs in the columns of this table can be used to esti- 


mate the factor effects. For example, the estimate of factor A 1s 


l 
A — z [a + ab + ac + abe + ad + abd + acd + abcd — (1) — b 


=p = þa =d = pg ca = bed] 
1 
g (069 + 650 + 642 + 635 + 749 + 868 + 860 + 729 


— 550 = 604 = 633 — 601 = 1037 — 1052 — 1075 — 1063 | 


—101.625 


Table 14-19 The 2f Design for the Plasma Etch Experiment 


A 


(Gap) (Pressure) 


=l 
l 
E. 
l 
=l 
l 
=] 


B 


=] 
= 


C 


(C;F, Flow) 


D 


(Power) 


-o 


Etch Rate 
(À/min) 


550 
669 
604 
650 
633 
642 
601 
635 
1037 
749 
1057 
868 
1075 
860 
1063 
729 
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Table 14-20 Contrast Constants for the 2^ Design 
A B AB C AC BC ABC D AD BD ABD CD ACD BCD ABCD 


(1) = = de s a — gp = = gw F = + = = + 
a ee = I = FP ës s F 3 + = — 
b BUR =- s go = de c 3B = F F = + = 
ab Po d = — = — á = s č S = 4 F F + 
C = = de B s = qe e BO s = = + F = 
ac = 2 AP = = ndi T Ds = E T + 
bc = dé c» a == we = = Æ = us = =f = + 
abc Rode cde + #4 T T = = = = = = = = 
d = = db oes gd c = á $ S 9 F = F + = 
ad eS = £e = FP ë X p ion = = = + F 
bd = p SS NL. de owe S Gk = = + = + 
abd PP p = = = = we ce $ + = = = = 
cd =< = f F = = Po we s ž s F F = = + 
a = = = F a = = po p s = Y Lc = = 
bea = p = Hé = = = we c Se = F = + = 
abcd + od d + + c qo d p $ + + + T + 


Thus, the effect of increasing the gap between the anode and the cathode from 0.80 to 
1.20 centimeters is to decrease the etch rate by 101.625 angstroms per minute. 
It is easy to verify (using Minitab, for example) that the complete set of effect estimates is 


A = —101.625 AD = —153.625 
B = —1.625 BD = —0.625 
AB = —7.875 ABD = 4.125 
C = 1,375 CD = —2.125 
AC = —24.875 ACD = 5.625 
BC = —43.875 BCD = —25.75 
ABC = —15.625 ABCD = —40.125 
D = 306.125 


The normal probability plot of these effects from the plasma etch experiment is shown in 
Fig. 14-21. Clearly, the main effects of A and D and the AD interaction are significant, because 
they fall far from the line passing through the other points. The analysis of variance summa- 
rized in Table 14-21 confirms these findings. Notice that in the analysis of variance we have 
pooled the three- and four-factor interactions to form the error mean square. If the normal 
probability plot had indicated that any of these interactions were important, they would not 
have been included in the error term. 

Since A = —101.625, the effect of increasing the gap between the cathode and anode is 
to decrease the etch rate. However, D — 306.125; thus, applying higher power levels will in- 
crease the etch rate. Figure 14-22 is a plot of the AD interaction. This plot indicates that the 
effect of changing the gap width at low power settings is small, but that increasing the gap at 
high power settings dramatically reduces the etch rate. High etch rates are obtained at high 
power settings and narrow gap widths. 

The residuals from the experiment can be obtained from the regression model 


. 101.625 306.125 153.625 
$ = 776.0625 — ; A t 5 Ip ; X1X4 
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For example, when both A and D are at the low level, the predicted value is 


$ = 716.0625 — (9:851 + (25-1) - (S595) yr 


and the four residuals at this treatment combination are 
e, = 550 — 597 = —47 e, = 604 — 597 
e, = 633 — 597 = 36 e, = 601 — 597 = 4 


| 
- 


The residuals at the other three treatment combinations (A high, D low), (A low, D high), and 
(A high, D high) are obtained similarly. A normal probability plot of the residuals is shown in 
Fig. 14-23. The plot is satisfactory. 


Table 14-21 Analysis of Variance for the Plasma Etch Experiment 


Source of Degrees of 

Variation Sum of Squares Freedom Mean Square fo P-Value 
A 41,310.563 I 41,310.563 20.28 0.0064 
B 10.563 1 10.563 «1 — 
C 217.563 l 217.563 <] — 
D 374,850.063 l 374,850.063 183.99 0.0000 
AB 248.063 l 248.063 «] — 
AC 2,475.063 1 2,475.063 1.21 0.3206 
AD 94.402.563 l 94,402.563 46.34 0.0010 
BG 7,700.063 1 7,700.063 3.78 0.1095 
BD 1.563 i 1.563 <] — 
CD 18.063 1 18.063 el — 
Error 10,186.813 5 2,037.363 


Total 531,420.938 


— 
Un 


Normal probability 


Figure 14-23 Normal 
probability plot of 
residuals from the 
plasma etch 
experiment. 
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14-7.4 Addition of Center Points to a 2" Design (CD Only) 


EXERCISES FOR SECTION 14-7 


14-13. An engineer is interested in the effect of cutting 
speed (A), metal hardness (B), and cutting angle (C) on the life 
of a cutting tool. Two levels of each factor are chosen, and two 
replicates of a 2° factorial design are run. The tool life data (in 
hours) are shown in the following table: 


Treatment Replicate 
Combination I II 
(1) 221 311 
a 325 435 
b 354 348 
ab 552 472 
c 440 453 
ac 406 377 
bc 605 500 
abc 392 419 


(a) Analyze the data from this experiment. 

(b) Find an appropriate regression model that explains tool 
life in terms of the variables used in the experiment. 

(c) Analyze the residuals from this experiment. 

14-14. Four factors are thought to influence the taste of a soft- 

drink beverage: type of sweetener (A), ratio of syrup to water (B), 

carbonation level (C), and temperature (D). Each factor can be 


—2 runattwo levels, producing a 2^ design. At each run in the design, 


samples of the beverage are given to a test panel consisting of 20 
people. Each tester assigns the beverage a point score from 1 to 
10. Total score is the response variable, and the objective is to find 
a formulation that maximizes total score. Two replicates of this 
design are run, and the results are shown in the table. Analyze the 
data and draw conclusions. Use o = 0.05 in the statistical tests. 


Treatment _ Replicate — 
Combination I II 
(1) 159 163 
a 168 175 
b 158 163 
ab 166 168 
C 175 178 
ac 179 183 
bc 173 168 
abc 179 182 
d 164 159 
ad 187 189 
bd 163 159 
abd 185 191 
cd 168 174 
acd 197 199 
bcd 170 174 
abcd 194 198 


@ 


(_) 


Ss 
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14-15. Consider the experiment in Exercise 14-14. 
Determine an appropriate model and plot the residuals against 
the levels of factors 4, B, C, and D. Also construct a normal 
probability plot of the residuals. Comment on these plots. 


14-16. The data shown here represent a single replicate of a 
2? design that is used in an experiment to study the compres- 
sive strength of concrete. The factors are mix (A), time (B), 
laboratory (C), temperature (D), and drying time (£). 


(1) = 700 e = 800 
a = 900 ae = 1200 
b = 3400 be = 3500 
ab = 5500 abe = 6200 
č = 600 ce = 600 
ac = 1000 ace = 1200 
bc = 3000 bce = 3006 
abe = 5300 abce = 5500 
d = 1000 de = 1900 
ad = 1100 ade = 1500 
bd = 3000 bde | — 4000 
abd = 6100 abde = 6500 
cd = 800 cde = 1500 
acd = 1100 acde = 2000 
bcd = 3300 bcde = 3400 
abcd = 6000 abcde = 6800 


(a) Estimate the factor effects. 

(b) Which effects appear important? Use a normal probability 
plot. 

(c) If it is desirable to maximize the strength, in which direc- 
tion would you adjust the process variables? 

(d) Analyze the residuals from this experiment. 


14-17. An article in the ZEEE Transactions on Semiconduc- 
tor Manufacturing (Vol. 5, no. 3, 1992, pp. 214—222) de- 


em scribes an experiment to investigate the surface charge on a 


= 


silicon wafer. The factors thought to influence induced surface 
charge are cleaning method (spin rinse dry or SRD and spin 
dry or SD) and the position on the wafer where the charge was 
measured. The surface charge (X 10!! q/cm°) response data are 
as shown. 


Test Position 


Cleaning 
Method 





(a) Estimate the factor effects. 
(b) Which factors appear important? Use a = 0.05. 
(c) Analyze the residuals from this experiment. 
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14-18. An experiment described by M. G. Natrella in the 
National Bureau of Standards Handbook of Experimental 
Statistics (No. 91, 1963) involves flame testing fabrics after 
applying fire-retardant treatments. The four factors considered 
are type of fabric (4), type of fire-retardant treatment (B), 
laundering condition (C—the low level is no laundering, the 
high level is after one laundering), and method of conducting 
the flame test (D). All factors are run at two levels, and the re- 
sponse variable is the inches of fabric burned on a standard 
size test sample. The data are: 


(1) 242 d — 40 
a =3l ad = 30 
b -—435 bd = 50 
ab = 29 abd = 25 
c = 39 cd = 40 
ac = 28 acd = 25 
bc = 46 bcd = 50 
abc = 32 abcd — 23 


(a) Estimate the effects and prepare a normal plot of the 
effects. 

(b) Construct an analysis of variance table based on the model 
tentatively identified in part (a). 

(c) Construct a normal probability plot of the residuals and 
comment on the results. 


14-19. An experiment was run in a semiconductor fabrica- 
tion plant in an effort to increase yield. Five factors, each at 
two levels, were studied. The factors (and levels) were 
A = aperture setting (small, large), B = exposure time (20% 
below nominal, 20% above nominal), C = development time 
(30 and 45 seconds), D — mask dimension (small, large), and 
E = etch time (14.5 and 15.5 minutes). The following un- 
replicated 2? design was run: 


(1 = 7 e = 8 
a = 9 ae = 12 
b = 34 be = 35 
ab = 55 abe = 52 
C = 16 ce = 15 
ac = 20 ace | — 22 
bc = 40 bce = 45 
abc = 60 abce 65 
d = 8 de = 6 
ad = 10 ade = 10 
bd = 32 bde = 30 
abd = 50 abde = 53 
cd = 18 cde = 15 
acd = 21 acde = 20 
bcd = 44 bcde = 41 
abcd = 61 abcde = 63 


(a) Construct a normal probability plot of the effect estimates. 
Which effects appear to be large? 
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(b) Conduct an analysis of variance to confirm your findings 
for part (a). 

(c) Construct a normal probability plot of the residuals. Is the 
plot satisfactory? 

(d) Plot the residuals versus the predicted yields and versus 
each of the five factors. Comment on the plots. 

(e) Interpret any significant interactions. 

(f) What are your recommendations regarding process oper- 
ating conditions? 

(g) Project the 2? design in this problem into a 2” for r < 5 
design in the important factors. Sketch the design and 
show the average and range of yields at each run. Does 
this sketch aid in data interpretation? 


14-20. Consider the data from Exercise 14-13. I suppose 
that the data from the second replicate was not available. 
Analyze the data from replicate I only and comment on your 
findings. 

14-21. An experiment has run a single replicate 
of a 2^ design and calculated the following factor 
effects: 


A 80.25 AB = 53.25 ABC = -2.95 

B = —65.50 AC = 11.00 ABD = —8.00 

C= -9.25 AD = 9.75 ACD = 10.25 

D = —20.50 BC = 18.36 BCD = -7.95 
BD = 15.10 ABCD = —6.25 
CD = -1.25 


(a) Construct a normal probability plot of the effects. 

(b) Identify a tentative model, based on the plot of effects in 
part (a). 

(c) Estimate the regression coefficients in this model, assum- 
ing that y = 400. 

14-22. A 2* factorial design was run in a chemical 

process. The design factors are 4 — time, B — concentration, 

C = pressure, and D = temperature. The response variable is 


yield. The data follows: 


Micld a; geet Chl Reve Sy 


Run A B C D (pounds) = F 


l ES = v = 12 A (hours) 25 3 
2 AEN 18 B(%) 14 18 
3 = a m m9 13 C (psi) 60 80 
4 F We s s 16 D(C) 200 250 
5 = á s = = I7 
6 | MEL 4 = 15 
7 = ade q m 20 
8 | a = 15 
9 = SE 10 
10 WELL = $ d 
11 HE ML. 13 
12 de es m f 24 
13 te =| d e 19 
14 MELLE = 21 
15 = a we p 17 
16 Te o P f 23 


(a) Estimate the factor effects. Based on a normal probability 
plot of the effect estimates, identify a model for the data 
from this experiment. 

(b) Conduct an ANOVA based on the model identified in part 
(a). What are your conclusions? 

(c) Analyze the residuals and comment on model adequacy. 

(d) Find a regression model to predict yield in terms of the ac- 
tual factor levels. 

(e) Can this design be projected into a 2? design with two 
replicates? If so, sketch the design and show the average 
and range of the two yield values at each cube corner. 
Discuss the practical value of this plot. 
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It is often impossible to run all the observations in a 2^ factorial design under homogeneous 
conditions. Blocking is the design technique that is appropriate for this general situation. 
However, in many situations the block size is smaller than the number of runs in the complete 
replicate. In these cases, confounding is a useful procedure for running the 2^ design in 2? 
blocks where the number of runs in a block is less than the number of treatment combinations 
in one complete replicate. The technique causes certain interaction effects to be indistinguish- 
able from blocks or confounded with blocks. We will illustrate confounding in the 2“ factorial 


design in 2? blocks, where p < k. 


Consider a 2? design. Suppose that each of the 2? — 4 treatment combinations requires 
four hours of laboratory analysis. Thus, two days are required to perform the experiment. If 
days are considered as blocks, we must assign two of the four treatment combinations to 


each day. 
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Figure 14-24 A2? 
design in two blocks. 
(a) Geometric view. (b) 
Assignment of the four 
runs to two blocks. 


Figure 14-25. The 2? 
design in two blocks 
with ABC confounded. 
(a) Geometric View. 
(b) Assignment of the 
eight runs to two 
blocks. 


CHAPTER 14 DESIGN OF EXPERIMENTS WITH SEVERAL FACTORS 


b ab 
+O 
@ = Run in block 1 Pek. Block 2 
o = Run in block 2 (1) 
b 
(1) s 
NE Od 
— + 
A Assignment of the four 
Geometric view runs to two blocks 


(a) (b) 


This design is shown in Fig. 14-24. Notice that block 1 contains the treatment combina- 
tions (1) and ab and that block 2 contains a and b. The contrasts for estimating the main 
effects of factors A and B are 


— ab * a — b — (1) 
— ab * b — a — (1) 


Contrast 4 


Contrast, 


Note that these contrasts are unaffected by blocking since in each contrast there is one plus 
and one minus treatment combination from each block. That is, any difference between block 
1 and block 2 that increases the readings in one block by an additive constant cancels out. The 
contrast for the AB interaction is 


Contrastyg = ab + (1) -a— b 


Since the two treatment combinations with the plus signs, ab and (1), are in block 1 and the 
two with the minus signs, a and 5, are in block 2, the block effect and the AB interaction are 
identical. That is, the AB interaction 1s confounded with blocks. 

The reason for this is apparent from the table of plus and minus signs for the 2? design 
shown in Table 14-12. From the table we see that all treatment combinations that have a plus 
on AB are assigned to block 1, whereas all treatment combinations that have a minus sign on 
AB are assigned to block 2. 

This scheme can be used to confound any 2^ design in two blocks. As a second example, 
consider a 2? design, run in two blocks. From the table of plus and minus signs, shown in Table 
14-15, we assign the treatment combinations that are minus in the ABC column to block 1 and 
those that are plus in the ABC column to block 2. The resulting design is shown in Fig. 14-25. 


bc abc 

e = Run in block 1 Block 1 Block 2 

o = Run in block 2 
(1) a 
ab b 

C 
ac C 

B bc abc 





Assignment of the eight 
runs to two blocks 


(b) 


Geometric view 


(a) 
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There is a more general method of constructing the blocks. The method employs a 
defining contrast, say 


L= QX] + QL5X» Tn QAX k (14-24) 


where x; is the level of the ith factor appearing in a treatment combination and a; is the expo- 
nent appearing on the ith factor in the effect that is to be confounded with blocks. For the 2^ 
system, we have either o; = 0 or 1, and either x, = 0 (low level) or x; = 1 (high level). 
Treatment combinations that produce the same value of L (modulus 2) will be placed in the 
same block. Since the only possible values of L (mod 2) are 0 and 1, this will assign the 2^ 
treatment combinations to exactly two blocks. 

As an example, consider the 2? design with ABC confounded with blocks. Here x, corre- 
sponds to A, x, to B, x, to C, and a, = a, = a; = 1. Thus, the defining contrast that would be 
used to confound ABC with blocks is 


L=x, tx + x 


To assign the treatment combinations to the two blocks, we substitute the treatment combina- 
tions into the defining contrast as follows: 
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Thus (1), ab, ac, and bc are run in block 1, and a, b, c, and abc are run in block 2. This same 
design is shown in Fig. 14-25. 

A shortcut method is useful in constructing these designs. The block containing the treat- 
ment combination (1) is called the principal block. Any element [except (1)] in the principal 
block may be generated by multiplying two other elements in the principal block modulus 2 
on the exponents. For example, consider the principal block of the 2? design with ABC 
confounded, shown in Fig. 14-25. Note that 


ab + ac = abc = bc 
ab * bc = ab*c = ac 


abc^ = ab 


ac * bc 


Treatment combinations in the other block (or blocks) may be generated by multiplying 
one element in the new block by each element in the principal block modulus 2 on the 
exponents. For the 2? with ABC confounded, since the principal block is (1), ab, ac, and be, 
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EXAMPLE 14-6 





we know that the treatment combination b is in the other block. Thus, elements of this 
second block are 


b» (1) =b 
b- ab = ab’ =a 
b * ac — abc 
b- bc = b’c =c 


An experiment is performed to investigate the effect of four factors on the terminal miss dis- 
tance of a shoulder-fired ground-to-air-missile. The four factors are target type (A), seeker 
type (B), target altitude (C), and target range (D). Each factor may be conveniently run at two 
levels, and the optical tracking system will allow terminal miss distance to be measured to the 
nearest foot. Two different operators or gunners are used in the flight test and, since there may 
be differences between operators, the test engineers decided to conduct the 2^ design in two 
blocks with ABCD confounded. Thus, the defining contrast is 


L=x,t+x%+%x%34+ x4 


The experimental design and the resulting data are shown in Fig. 14-26. The effect esti- 
mates obtained from Minitab are shown in Table 14-22. A normal probability plot of the 
effects in Fig. 14-27 reveals that A (target type), D (target range), AD, and AC have large 
effects. A confirming analysis of variance, pooling the three-factor interactions as error, is 
shown in Table 14-23. Since the AC and AD interactions are significant, it is logical to 
conclude that A (target type), C (target altitude), and D (target range) all have important 
effects on the miss distance and that there are interactions between target type and alti- 
tude and target type and range. Notice that the ABCD effect is treated as blocks in this 
analysis. 





Assignment of the sixteen 
e — Run in block 1 B runs to two blocks 


o = Run in block 2 


(b) 


Geometric view 


(a) 


Figure 14-26 The 2* design in two blocks for Example 14-6. (a) Geometric view. (b) Assignment of the 
16 runs to two blocks. 
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Table 14-22 Minitab Effect Estimates for 
Example 14-6 


Estimated Effects and Coefficients for Distance 





Term Effect Coef 

Constant 6.938 

Block 0.063 

A 2.625 1:512 

B 0.625 0.313 

C 0.875 0.438 

D 1.875 0.938 A* 
AB —0:125 — 0.063 1 "D 
AC -2.375 -1187 2 "n 
AD 1.625 0.813 Li 0 

BC 70.375 -0188 £$ 

BD —0.375 -0487 ^ , 

CD —0.125 — 0.062 P 

ABC —0.125 — 0.063 : 

ABD 0.875 0.438 -2 2 
ACD =() 395 =(), 127 Effect 

BCD —0.375 —0.187 Figure 14-27 Normal probability plot of the 


effects from Minitab, Example 14-6. 


It is possible to confound the 2^ design in four blocks of 2* ? observations each. To con- 
struct the design, two effects are chosen to confound with blocks, and their defining contrasts 
are obtained. A third effect, the generalized interaction of the two effects initially chosen, is 
also confounded with blocks. The generalized interaction of two effects is found by multiply- 
ing their respective letters and reducing the exponents modulus 2. 

For example, consider the 2* design in four blocks. If AC and BD are confounded with 
blocks, their generalized interaction is (4C )(BD) = ABCD. The design is constructed by using 


Table 14-23 Analysis of Variance for Example 14-6 


Source of Sum of Degrees of Mean 
Variation Squares Freedom Square fo P-Value 
Blocks (ABCD) 0.0625 l 0.0625 0.06 — 
A 27.5625 1 27.5625 25.94 0.0070 
B 1.5625 l 1.5625 1.47 0.2920 
C 3.0625 l 3.0625 2.88 0.1648 
D 14.0625 1 14.0625 13.24 0.0220 
AB 0.0625 1 0.0625 0.06 — 
AC 22.5625 l 22.5625 21.24 0.0100 
AD 10.5625 l 10.5625 9.94 0.0344 
BC 0.5625 1 0.5625 0.53 — 
BD 0.5625 l 0.5625 0.53 — 
CD 0.0625 l 0.0625 0.06 — 
Error (ABC + ABD + ACD + BCD) 4.2500 4 1.0625 
Total 84.9375 15 
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the defining contrasts for AC and BD: 
Li =x, + x3 
Lə = x) t x4 
It is easy to verify that the four blocks are 


Block 1 Block 2 Block 3 Block 4 


= 0, L, = =], 4L, =0 =0,L,=1 =1,L,=1 


[i| [i H [E 


This general procedure can be extended to confounding the 2^ design in 2” blocks, where 
p < k. Start by selecting p effects to be confounded, such that no effect chosen is a general- 
ized interaction of the others. Then the blocks can be constructed from the p defining contrasts 
Li, Lo,..., Lp that are associated with these effects. In addition to the p effects chosen to be 
confounded, exactly 27 — p — 1 additional effects are confounded with blocks; these are the 
generalized interactions of the original p effects chosen. Care should be taken so as not to 
confound effects of potential interest. 

For more information on confounding in the 2^ factorial design, refer to Montgomery 
(2001, Chapter 7). This book contains guidelines for selecting factors to confound with blocks 
so that main effects and low-order interactions are not confounded. In particular, the book 
contains a table of suggested confounding schemes for designs with up to seven factors and a 


range of block sizes, some of which are as small as two runs. 


EXERCISES FOR SECTION 14-8 


14-23. Consider the data from the first replicate of Exercise 
14-13. Suppose that these observations could not all be run 
under the same conditions. Set up a design to run these obser- 
vations in two blocks of four observations each, with ABC 
confounded. Analyze the data. 


14-24. Consider the data from the first replicate of Exercise 
14-14. Construct a design with two blocks of eight observa- 
tions each, with ABCD confounded. Analyze the data. 


14-25. Repeat Exercise 14-24 assuming that four blocks 
are required. Confound ABD and ABC (and consequently CD) 
with blocks. 

14-26. Construct a 2? design in two blocks. Select the 
ABCDE interaction to be confounded with blocks. 

14-27. Construct a 2? design in four blocks. Select the ap- 
propriate effects to confound so that the highest possible in- 
teractions are confounded with blocks. 

14-28. Consider the data from Exercise 14-18. Construct the 
design that would have been used to run this experiment in two 
blocks of eight runs each. Analyze the data and draw conclusions. 
14-29.  Anarticle in /ndustrial and Engineering Chemistry 
(*Factorial Experiments in Pilot Plant Studies," 1951, 


pp. 1300-1306) reports on an experiment to investigate the 
effect of temperature (A), gas throughput (B), and concentra- 
tion (C) on the strength of product solution in a recirculation 
unit. Two blocks were used with ABC confounded, and the ex- 
periment was replicated twice. The data are as follows: 


Replicate 1 
Block 1 


Block 2 





Replicate 2 


Block 3 Block 4 





(a) Analyze the data from this experiment. 

(b) Analyze the residuals and comment on model ade- 
quacy. 

(c) Comment on the efficiency of this design. Note that we 
have replicated the experiment twice, yet we have no 
information on the ABC interaction. 
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(d) Suggest a better design, specifically, one that would 
provide some information on a// interactions. 


14-30. Consider the 2° factorial design. Set up a design to 
be run in four blocks of 16 runs each. Show that a design that 
confounds three of the four-factor interactions with blocks 1s 
the best possible blocking arrangement. 


14-9 FRACTIONAL REPLICATION OF THE 2* DESIGN 


As the number of factors in a 2* factorial design increases, the number of runs required 
increases rapidly. For example, a 2° requires 32 runs. In this design, only 5 degrees of free- 
dom correspond to main effects, and 10 degrees of freedom correspond to two-factor 
interactions. Sixteen of the 31 degrees of freedom are used to estimate high-order interac- 
tions—that is, three-factor and higher order interactions. Often there is little interest in 
these high-order interactions, particularly when we first begin to study a process or system. 
If we can assume that certain high-order interactions are negligible, a fractional factorial 
design involving fewer than the complete set of 2^ runs can be used to obtain information 
on the main effects and low-order interactions. In this section, we will introduce fractional 
replications of the 2 design. 

A major use of fractional factorials is in screening experiments. These are experiments 
in which many factors are considered with the purpose of identifying those factors (if any) that 
have large effects. Screening experiments are usually performed in the early stages of a 
project when it is likely that many of the factors initially considered have little or no effect 
on the response. The factors that are identified as important are then investigated more thor- 
oughly in subsequent experiments. 


14-9.1 One-Half Fraction of the 2* Design 


A one-half fraction of the 2^ design contains 2^ ! runs and is often called a 2^ ! fractional fac- 
torial design. As an example, consider the 2?! design—that is, a one-half fraction of the 2°. 
This design has only four runs, in contrast to the full factorial that would require eight runs. 
The table of plus and minus signs for the 2? design is shown in Table 14-24. Suppose we se- 
lect the four treatment combinations a, 5, c, and abc, as our one-half fraction. These treatment 
combinations are shown in the top half of Table 14-24 and in Fig. 14-28(a). 

Notice that the 2?! design is formed by selecting only those treatment combinations that 
yield a plus on the ABC effect. Thus, ABC is called the generator of this particular fraction. 


Table 14-24 Plus and Minus Signs for the 2? Factorial Design 


Treatment Factorial Effect 
Combination 1 A B C AB AC BC ABC 
id T + - = - - " ji 
b + — + m =: a EE 4 
E T — = + + — = d 
abc 4 -— + 3 a 4 i + 
ab + -- + = a = = = 
ac + T — + = 4+ — u 
bc + — + gie — _ ate = 
(1) t a 7 - + + + = 
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Figure 14-28 The 
one-half fractions of 
the 2? design. (a) The 
principal fraction, 

I = «t ABC. (b) The 
alternate fraction, 
I= —ABC. 





The principal fraction, J = +ABC The alternate fraction, J = -ABC 
(a) (b) 


Furthermore, the identity element / is also plus for the four runs, so we call 
I = ABC 


the defining relation for the design. 

The treatment combinations in the 2°~' design yields three degrees of freedom associated 
with the main effects. From the upper half of Table 14-24, we obtain the estimates of the main 
effects as linear combinations of the observations, say, 


A= hla — b — c + abc] 

B= V [—a * b — c * abc] 

C= I5 =a 5-83 356) 
It is also easy to verify that the estimates of the two-factor interactions should be the follow- 
ing linear combinations of the observations: 

BC = ‘la — b — c + abc] 

AC = M [—a +b — c + abc] 

AB 


Uj a=b tet abe] 


Thus, the linear combination of observations in column A, €,, estimates both the main effect 
of A and the BC interaction. That is, the linear combination £, estimates the sum of these two 
effects A + BC. Similarly, €g estimates B + AC, and €c estimates C + AB. Two or more ef- 
fects that have this property are called aliases. In our 2^! design, A and BC are aliases, B and 
AC are aliases, and C and AB are aliases. Aliasing is the direct result of fractional replication. 
In many practical situations, it will be possible to select the fraction so that the main effects 
and low-order interactions that are of interest will be aliased only with high-order interactions 
(which are probably negligible). 

The alias structure for this design is found by using the defining relation J = ABC. 
Multiplying any effect by the defining relation yields the aliases for that effect. In our 
example, the alias of A 1s 


A= As ABC =A BC = BC 
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since A + I = A and A* = I. The aliases of B and C are 
B-—B-ABC = AB’C = AC 
and 
C-—C-ABC = ABC’ = AB 


Now suppose that we had chosen the other one-half fraction, that is, the treatment com- 
binations in Table 14-24 associated with minus on ABC. These four runs are shown in the 
lower half of Table 14-24 and in Fig. 14-28(b). The defining relation for this design is 
I = —ABC. The aliases are A = —BC, B = —AC, and C = —AB. Thus, estimates of A, B, and 
C that result from this fraction really estimate 4 — BC, B — AC, and C — AB. In practice, it 
usually does not matter which one-half fraction we select. The fraction with the plus sign in 
the defining relation is usually called the principal fraction, and the other fraction is usually 
called the alternate fraction. 

Note that if we had chosen AB as the generator for the fractional factorial, 


A-—A*AB-—B 


and the two main effects of 4 and B would be aliased. This typically loses important information. 

Sometimes we use sequences of fractional factorial designs to estimate effects. For 
example, suppose we had run the principal fraction of the 2^ ! design with generator ABC. 
From this design we have the following effect estimates: 


Suppose that we are willing to assume at this point that the two-factor interactions are negli- 
gible. If they are, the 2°~' design has produced estimates of the three main effects A, B, and C. 
However, if after running the principal fraction we are uncertain about the interactions, it 1s 
possible to estimate them by running the alternate fraction. The alternate fraction produces 
the following effect estimates: 


€,=A-—BC 
(5 = B — AC 
(c = C — AB 


We may now obtain de-aliased estimates of the main effects and two-factor interactions 
by adding and subtracting the linear combinations of effects estimated in the two individual 
fractions. For example, suppose we want to de-alias A from the two-factor interaction BC. 
Since €, = A + BC and €, = A — BC, we can combine these effect estimates as follows: 


1 1 
and 


j l 


N| = 
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For all three pairs of effect estimates, we would obtain the following results: 


DICES from / (l; + 1) from 7 (I; — Ii) 

=A (A + BC+ A—- BC)- A ^ [4 + BC — (A — BC)] = BC 
|= B (B+ AC + B—AC)=B [B + AC — (B — AC)] = AC 
pe (C+ AB-- C — AB) - C [C+ AB — (C — AB)] = AB 


Thus, by combining a sequence of two fractional factorial designs, we can isolate both the 
main effects and the two-factor interactions. This property makes the fractional factorial de- 
sign highly useful in experimental problems since we can run sequences of small, efficient ex- 
periments, combine information across several experiments, and take advantage of learning 
about the process we are experimenting with as we go along. This is an illustration of the con- 
cept of sequential experimentation. 

A 2* ! design may be constructed by writing down the treatment combinations for a full 
factorial with k — 1 factors, called the basic design, and then adding the Ath factor by identi- 
fying its plus and minus levels with the plus and minus signs of the highest order interaction. 
Therefore, a 2?! fractional factorial is constructed by writing down the basic design as a full 
2? factorial and then equating factor C with the AB interaction. Thus, to construct the prin- 
cipal fraction, we would use C = -- AB as follows: 


Basic Design Fractional Design 
Full 22 2971 1 = +ABC 
A B A B C — AB 
— — -— — -— 
-— — -- — - 
-— -— -- -- -— 
To obtain the alternate fraction we would equate the last column to C = —AB. 


To illustrate the use of a one-half fraction, consider the plasma etch experiment described in 
Example 14-5. Suppose that we decide to use a 247! design with J = ABCD to investigate the 
four factors gap (A), pressure (B), C,F, flow rate (C), and power setting (D). This design 
would be constructed by writing down as the basic design a 2? in the factors A, B, and C and 
then setting the levels of the fourth factor D = ABC. The design and the resulting etch rates 
are shown in Table 14-25. The design is shown graphically in Fig. 14-29. 


Table 14-25 The 2* ! Design with Defining Relation J = ABCD 


Treatment Etch 
A B C D — ABC Combination Rate 
= — = = (1) 550 
+ — — + ad 749 
— + — + bd 1052 
+ + — B ab 650 
— — + + cd 1075 
+ — + = ac 642 
— + + — bc 601 
+ + + + abcd 729 
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In this design, the main effects are aliased with the three-factor interactions; note that the 
alias of A is 


A-I=A-+ABCD or A = A*BCD = BCD 


and similarly B = ACD, C = ABD, and D = ABC. 
The two-factor interactions are aliased with each other. For example, the alias of AB is CD: 


AB -I = AB - ABCD or AB = A^B^CD = CD 


The other aliases are AC = BD and AD = BC. 
The estimates of the main effects and their aliases are found using the four columns of 
signs in Table 14-25. For example, from column A we obtain the estimated effect 


€, = A+ BCD = 1(—550 + 749 — 1052 + 650 — 1075 + 642 — 601 + 729) 
-127.00 


The other columns produce 


£g = B+ ACD = 400 £o — C ABD = 11.50. and €) — D + ABC = 290.50 


Clearly, € , and fp are large, and if we believe that the three-factor interactions are negligible, 
the main effects A (gap) and D (power setting) significantly affect etch rate. 

The interactions are estimated by forming the AB, AC, and AD columns and adding them 
to the table. For example, the signs in the 4B column are +, —, —, +, +, —, —, +, and this 
column produces the estimate 


lig = AB + CD = 1 (550 — 749 — 1052 + 650 + 1075 — 642 — 601 + 729) = —10 
From the AC and AD columns we find 
lac = AC + BD = —25.50 and lip = AD + BC = —197.50 
The £ p estimate is large; the most straightforward interpretation of the results is that since A 


and D are large, this is the AD interaction. Thus, the results obtained from the 2^! design 
agree with the full factorial results in Example 14-5. 





abcd = 729 


(1) = 550 ad = 749 


Figure 14-29 The 2^! design for the experiment of Example 14-7. 


554 


CHAPTER 14 DESIGN OF EXPERIMENTS WITH SEVERAL FACTORS 


Table 14-26 Effect Estimates from Minitab, 
Example 14-7 


Fractional Factorial Fit 


1:5 
Estimated Effects and Coefficients for Etch Rt 10 ii 
Term Effect Coef £ 05 
Constant 756.00 2 0.0 i 
Gap — 127.00 —63.50 = P į 
Pressure 4.00 200 <= ` T: 
F. 11.50 5.75 ze 
Power 290.50 14525 =g E 
Gap*Pressure — 10.00 — 5.00 a T EIE OR 
Gap*F. —25.50 4975 KM 
Gap*Power — 197.50 —98.75 Figure 14-30 Normal probability plot of 


the effects from Minitab, Example 14-7. 


Computer Solution 

Fractional factorial designs are usually analyzed with a software package. Table 14-26 shows 
the effect estimates obtained from Minitab for Example 14-7. They are in agreement with the 
hand calculation reported earlier. 


Normal Probability Plot of Effects 

The normal probability plot is very useful in assessing the significance of effects from a frac- 
tional factorial design, particularly when many effects are to be estimated. We strongly rec- 
ommend examining this plot. Figure 14-30 presents the normal probability plot of the effects 
from Example 14-7. This plot was obtained from Minitab. Notice that the 4, D, and AD inter- 
action effects stand out clearly in this graph. 


Residual Analysis 

The residuals can be obtained from a fractional factorial by the regression model method 
shown previously. Note that the Minitab output for Example 14-7 in Table 14-26 shows the 
regression coefficients. The residuals should be graphically analyzed as we have discussed 
before, both to assess the validity of the underlying model assumptions and to gain additional 
insight into the experimental situation. 


Projection of the 2*^! Design 

If one or more factors from a one-half fraction of a 2^ can be dropped, the design will project 
into a full factorial design. For example, Fig. 14-31 presents a 2? ! design. Notice that this de- 
sign will project into a full factorial in any two of the three original factors. Thus, if we think 
that at most two of the three factors are important, the 2? ^! design is an excellent design for 
identifying the significant factors. This projection property is highly useful in factor screen- 
ing, because it allows negligible factors to be eliminated, resulting in a stronger experiment in 
the active factors that remain. 

In the 2*~' design used in the plasma etch experiment in Example 14-7, we found that two 
of the four factors (B and C) could be dropped. If we eliminate these two factors, the remain- 
ing columns in Table 14-25 form a 2? design in the factors A and D, with two replicates. This 
design is shown in Fig. 14-32. The main effects of A and D and the strong two-factor AD in- 
teraction are clearly evident from this graph. 
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+1 
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S D (Power) 
(550, 601) 
-1 
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C A (Gap) 
Figure 14-31 Projection of a 2°” ' design into Figure 14-32 The 2? design obtained by 
three 2? designs. dropping factors B and C from the plasma 


etch experiment in Example 14-7. 


Design Resolution 

The concept of design resolution is a useful way to catalog fractional factorial designs ac- 
cording to the alias patterns they produce. Designs of resolution III, IV, and V are particularly 
important. The definitions of these terms and an example of each follow. 


1. Resolution III Designs. These are designs in which no main effects are aliased with 
any other main effect, but main effects are aliased with two-factor interactions and 
some two-factor interactions may be aliased with each other. The 2^! design with 
I = ABC is a resolution III design. We usually employ a Roman numeral subscript to 
indicate design resolution; thus, this one-half fraction is a 2},;' design. 


2. Resolution IV Designs. These are designs in which no main effect is aliased with 
any other main effect or two-factor interactions, but two-factor interactions are 
aliased with each other. The 2^! design with J = ABCD used in Example 14-7 is a 
resolution IV design (2%! ). 


3. Resolution V Designs. These are designs in which no main effect or two-factor 
interaction 1s aliased with any other main effect or two-factor interaction, but two- 
factor interactions are aliased with three-factor interactions. The 2?! design with 
I = ABCDE is a resolution V design (23; ! ). 


Resolution III and IV designs are particularly useful in factor screening experiments. A reso- 
lution IV design provides good information about main effects and will provide some infor- 
mation about all two-factor interactions. 


14-9.2 Smaller Fractions: The 2* Fractional Factorial 


Although the 2^ ! design is valuable in reducing the number of runs required for an experi- 
ment, we frequently find that smaller fractions will provide almost as much useful informa- 
tion at even greater economy. In general, a 2^ design may be run in a 1/2? fraction called a 
2* ? fractional factorial design. Thus, a 1/4 fraction is called a 2* ? design, a 1/8 fraction is 
called a 2^? design, a 1/16 fraction a 2^ ^ design, and so on. 
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To illustrate the 1/4 fraction, consider an experiment with six factors and suppose that the 
engineer is primarily interested in main effects but would also like to get some information 
about the two-factor interactions. A 2° ' design would require 32 runs and would have 
3] degrees of freedom for estimating effects. Since there are only six main effects and 15 two- 
factor interactions, the one-half fraction is inefficient —it requires too many runs. Suppose we 
consider a 1/4 fraction, or a 2° ? design. This design contains 16 runs and, with 15 degrees of 
freedom, will allow all six main effects to be estimated, with some capability for examining 
the two-factor interactions. 

To generate this design, we would write down a 2^ design in the factors A, B, C, and D as 
the basic design and then add two columns for E and F. To find the new columns we could se- 
lect the two design generators / = ABCE and J = BCDF. Thus, column E would be found 
from E — ABC, and column F would be F — BCD. That is, columns ABCE and BCDF are 
equal to the identity column. However, we know that the product of any two columns in the 
table of plus and minus signs for a 2* design is just another column in the table; therefore, the 
product of ABCE and BCDF or ABCE(BCDF) = AB^C*DEF = ADEF is also an identity col- 
umn. Consequently, the complete defining relation for the 2° ? design is 


I = ABCE = BCDF = ADEF 


We refer to each term in a defining relation (such as ABCE above) as a word. To find the alias 
of any effect, simply multiply the effect by each word in the foregoing defining relation. For 
example, the alias of A 1s 


A = BCE = ABCDF = DEF 


The complete alias relationships for this design are shown in Table 14-27. In general, the res- 
olution of a 2* ? design is equal to the number of letters in the shortest word in the complete 
defining relation. Therefore, this is a resolution IV design; main effects are aliased with three- 
factor and higher interactions, and two-factor interactions are aliased with each other. This 
design would provide good information on the main effects and would give some idea about 
the strength of the two-factor interactions. The construction and analysis of the design are 
illustrated in Example 14-8. 


Parts manufactured in an injection-molding process are showing excessive shrinkage, which 
is causing problems in assembly operations upstream from the injection-molding area. In an 
effort to reduce the shrinkage, a quality-improvement team has decided to use a designed 
experiment to study the injection-molding process. The team investigates six factors—mold 
temperature (4), screw speed (5), holding time (C), cycle time (D), gate size (£), and holding 


Table 14-27 Alias Structure for the 267% Design with I = ABCE = 
BCDF = ADEF 


A = BCE = DEF = ABCDF 
B — ACE = CDF = ABDEF 
C = ABE = BDF = ACDEF 
D = BCF = AEF = ABCDE 
E = ABC = ADF = BCDEF 
F = BCD = ADE = ABCEF 
ABD = CDE = ACF = BEF 
ACD = BDE = ABF = CEF 


AB = CE = ACDF = BDEF 
AC = BE = ABDF = CDEF 
AD = EF = BCDE = ABCF 
AE = BC = DF = ABCDEF 
AF = DE = BCEF = ABCD 
BD = CF = ACDE = ABEF 
BF = CD = ACEF = ABDE 
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Table 14-28 A 2%? Design for the Injection-Molding Experiment in Example 14-8 


Observed 
Shrinkage 
Run A B C D E = ABC F = BCD (X10) 
1 = = = = = = 6 
2 + — — — + — 10 
3 = + — — + + 32 
4 + + — — — T 60 
5 = = + — + + 4 
6 + B + — — + 15 
7 — + + — — = 26 
8 + + + — + — 60 
9 = — = + — + 8 
10 + x — + + + 12 
11 = + — + + = 34 
12 + + — + — — 60 
13 = — + + + — 16 
14 + — + + — — 5 
15 = + + — + 37 
16 + + + + + + 22 


pressure (F )—each at two levels, with the objective of learning how each factor affects shrink- 
age and obtaining preliminary information about how the factors interact. 

The team decides to use a 16-run two-level fractional factorial design for these six factors. 
The design is constructed by writing down a 2^ as the basic design in the factors A, B, C, and D 
and then setting E — ABC and F — BCD as discussed above. Table 14-28 shows the design, along 
with the observed shrinkage (7X 10) for the test part produced at each of the 16 runs in the design. 

A normal probability plot of the effect estimates from this experiment is shown in Fig. 14-33. 
The only large effects are A (mold temperature), B (screw speed), and the AB interaction. 
In light of the alias relationship in Table 14-27, it seems reasonable to tentatively adopt 
these conclusions. The plot of the AB interaction in Fig. 14-34 shows that the process is 
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Figure 14-33 Normal probability plot of effects temperature—screw speed) interaction 
for Example 14-8. for Example 14-8. 
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insensitive to temperature if the screw speed is at the low level but sensitive to temperature 
if the screw speed is at the high level. With the screw speed at a low level, the process 
should produce an average shrinkage of around 10% regardless of the temperature level 
chosen. 

Based on this initial analysis, the team decides to set both the mold temperature and the 
screw speed at the low level. This set of conditions should reduce the mean shrinkage of parts 
to around 10%. However, the variability in shrinkage from part to part is still a potential prob- 
lem. In effect, the mean shrinkage can be adequately reduced by the above modifications; 
however, the part-to-part variability in shrinkage over a production run could still cause prob- 
lems in assembly. One way to address this issue is to see if any of the process factors affect the 
variability in parts shrinkage. 

Figure 14-35 presents the normal probability plot of the residuals. This plot appears 
satisfactory. The plots of residuals versus each factor were then constructed. One of these 
plots, that for residuals versus factor C (holding time), is shown in Fig. 14-36. The plot 
reveals much less scatter in the residuals at the low holding time than at the high holding 
time. These residuals were obtained in the usual way from a model for predicted 
shrinkage 


$ = Bo + Bix, + Bx) + Bioxix, 
5973195 + 6 93755, + 17:810: 25:037 5 5 


where x, x2, and x,x, are coded variables that correspond to the factors A and B and the AB 
interaction. The residuals are then 


e=y-ĵ 
The regression model used to produce the residuals essentially removes the location ef- 


fects of A, B, and AB from the data; the residuals therefore contain information about 
unexplained variability. Figure 14-36 indicates that there is a pattern in the variability and 
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Figure 14-35 Normal probability plot of resid- Figure 14-36 Residuals versus holding time 
uals for Example 14-8. (C) for Example 14-8. 
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that the variability in the shrinkage of parts may be smaller when the holding time is at the 
low level. 

Figure 14-37 shows the data from this experiment projected onto a cube in the factors A, 
B, and C. The average observed shrinkage and the range of observed shrinkage are shown at 
each corner of the cube. From inspection of this figure, we see that running the process with 
the screw speed (B) at the low level is the key to reducing average parts shrinkage. If B is low, 
virtually any combination of temperature (4) and holding time (C) will result in low values of 
average parts shrinkage. However, from examining the ranges of the shrinkage values at each 
corner of the cube, it is immediately clear that setting the holding time (C) at the low level is 
the most appropriate choice if we wish to keep the part-to-part variability in shrinkage low 
during a production run. 


The concepts used in constructing the 29? fractional factorial design in Example 14-8 
can be extended to the construction of any 2 ? fractional factorial design. In general, a 2^ frac- 
tional factorial design containing 2%” runs is called a 1/2? fraction of the 2^ design or, more 
simply, a 2^ ? fractional factorial design. These designs require the selection of p independent 
generators. The defining relation for the design consists of the p generators initially chosen 
and their 27 — p — 1 generalized interactions. 

The alias structure may be found by multiplying each effect column by the defining rela- 
tion. Care should be exercised in choosing the generators so that effects of potential interest 
are not aliased with each other. Each effect has 2? — 1 aliases. For moderately large values of 
k, we usually assume higher order interactions (say, third- or fourth-order or higher) to be neg- 
ligible, and this greatly simplifies the alias structure. 

It is important to select the p generators for the 2^ ? fractional factorial design in such 
a way that we obtain the best possible alias relationships. A reasonable criterion is to 
select the generators so that the resulting 2^ ? design has the highest possible design res- 
olution. Montgomery (2001) presents a table of recommended generators for 2^? frac- 
tional factorial designs for k = 15 factors and up to as many as n S 128 runs. A portion 
of his table is reproduced here as Table 14-29. In this table, the generators are shown with 
either + or — choices; selection of all generators as + will give a principal fraction, while 
if any generators are — choices, the design will be one of the alternate fractions for the 
same family. The suggested generators in this table will result in a design of the highest 
possible resolution. Montgomery (2001) also gives a table of alias relationships for these 
designs. 
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Table 14-29 Selected 2" ? Fractional Factorial Designs 


Number of 
Factors Number of 
k Fraction Runs 
3 25 4 
2 8 
5 21 16 
2i? 8 
6 20 32 
2c 16 
2h 8 
7 2m 64 
25 32 
20 16 
2m 8 
8 25 64 
an 32 
a 16 
9 2a 128 
2 64 
2 32 
20? 16 


Source: Montgomery (2001) 


Design 
Generators 
C= +AB 
= +ABC 
= +ABCD 
= AB 
E = ŁAC 


= +ABCDE 


E = *ABC 
F = *BCD 
= +AB 
E = *AC 
F = *BC 


= +tABCDEF 


E = =ABC 
= tABDE 
= ABC 

F = =BCD 
= +ACD 
= +AB 

E = +AC 

F = +BC 
= +ĄBC 
= +ĄBCD 
= tABEF 
= ABC 

G = =ABD 
= tBCDE 
= +BCD 

F = ACD 

G = =ABC 

H = =ABD 


= tACDFG 


J = =BCEFG 
= tABCD 
= tACEF 

J = +CDEF 

F = =BCDE 

G = =ACDE 
= +ABDE 
= tABCE 
= ABC 
= *BCD 
= =ACD 
= +ABD 

J = ABCD 


Number of 
Factors 
k 


10 


11 


Fraction 


10—3 
2y 


10—4 
Ziv 


10-5 
2iv 


10-6 
2n 


1155 
Ziv 


11—6 
Ziv 


1157 
2ni 


Number of 
Runs 


128 


64 


32 


16 


64 


22 


16 


Design 
Generators 


H = =ABCG 
J = tACDE 
K = =ACDF 
= +BCDF 
= tACDF 
= tABDE 
K = =ABCE 
= tABCD 
= tABCE 
H = =ABDE 
= tACDE 
= *BCDE 
E = =ABC 
= +BCD 
= +ACD 
= tABD 
= tABCD 
= +AB 
= tCDE 
= +ABCD 
= tABF 
K = =BDEF 
= *ADEF 
= ABC 
= +BCD 
= *CDE 
= +ACD 
K = =ADE 
= +BDE 
E = =ABC 
F = =BCD 
= +ACD 
= tABD 
J = =ABCD 
= +AB 
= +AC 


EXAMPLE 14-9 


14-9 FRACTIONAL REPLICATION OF THE 2* DESIGN 561 


To illustrate the use of Table 14-29, suppose that we have seven factors and that we are 
interested in estimating the seven main effects and obtaining some insight regarding 
the two-factor interactions. We are willing to assume that three-factor and higher interactions 
are negligible. This information suggests that a resolution IV design would be appropriate. 

Table 14-29 shows that two resolution IV fractions are available: the 2/y ^ with 32 runs 
and the 2|? with 16 runs. The aliases involving main effects and two- and three-factor inter- 
actions for the 16-run design are presented in Table 14-30. Notice that all seven main effects 
are aliased with three-factor interactions. All the two-factor interactions are aliased in groups 
of three. Therefore, this design will satisfy our objectives; that is, it will allow the estimation 
of the main effects, and it will give some insight regarding two-factor interactions. It is not 
necessary to run the 2), ^ design, which would require 32 runs. The construction of the 247° 
design is shown in Table 14-31. Notice that it was constructed by starting with the 16-run 
2* design in A, B, C, and D as the basic design and then adding the three columns E = ABC, 
F = BCD, and G = ACD as suggested in Table 14-29. Thus, the generators for this design 
are / = ABCE, I = BCDF, and I = ACDG. The complete defining relation is / = ABCE = 
BCDF = ADEF = ACDG = BDEG = CEFG = ABFG. This defining relation was used to 
produce the aliases in Table 14-30. For example, the alias relationship of A is 


A = BCE = ABCDF = DEF = CDG = ABDEG = ACEFG = BFG 


which, if we ignore interactions higher than three factors, agrees with Table 14-30. 


For seven factors, we can reduce the number of runs even further. The 2^ * design is an 
eight-run experiment accommodating seven variables. This is a 1/16th fraction and is ob- 
tained by first writing down a 2? design as the basic design in the factors A, B, and C, and then 
forming the four new columns from J = ABD, I = ACE, I = BCF, and I = ABCG, as sug- 
gested in Table 14-29. The design 1s shown in Table 14-32. 

The complete defining relation 1s found by multiplying the generators together two, three, 
and finally four at a time, producing 


I = ABD = ACE = BCF = ABCG = BCDE = ACDF = CDG = ABEF 
= BEG = AFG = DEF = ADEG = CEFG = BDFG = ABCDEFG 


The alias of any main effect is found by multiplying that effect through each term in the 


Table 14-30 Generators, Defining Relation, and Aliases for the 25? 
Fractional Factorial Design 


Generators and Defining Relation 
E = ABC, F = BCD, G = ACD 
I = ABCE = BCDF = ADEF = ACDG = BDEG = ABFG = CEFG 


Aliases 
A = BCE = DEF = CDG = BFG AB = CE = FG 
B — ACE = CDF = DEG = AFG AC = BE = DG 
C = ABE = BDF = ADG = EFG AD = EF = CG 
D = BCF = AEF = ACG = BEG AE = BC = DF 
E = ABC = ADF = BDG = CFG AF = DE = BG 
F = BCD = ADE = ABG = CEG AG = CD = BF 
G = ACD = BDE = ABF = CEF BD = CF = EG 


ABD = CDE = ACF = BEF = BCG = AEG = DFG 
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Table 14-31 A 2i? Fractional Factorial Design 


Basic Design 


Run A B € D E = ABC F = BCD G = ACD 
1 m =- — = — == 
2 + — — — 4 - -- 
3 — + — — -+ - — 
4 + E — — — -+ + 
5 — — + — + -+ -— 
6 + - + — — -- — 
fi — + + — — — -— 
8 + -+ -+ — -— - — 
9 — - - + + - 

10 + - — -+ -+ -— — 
11 — + — -+ + ~ -- 
12 + + - + - - = 
13 — — + + + 
14 + — E -+ — - -- 
15 — + -+ + -— a -— 
16 + 4 -+ -+ 4 ~ -- 


defining relation. For example, the alias of A is 


A = BD = CE = ABCF = BCG = ABCDE = CDF = ACDG 
= BEF = ABEG = FG = ADEF = DEG = ACEFG = ABDFG = BCDEFG 


This design is of resolution III, since the main effect is aliased with two-factor interactions. If 
we assume that all three-factor and higher interactions are negligible, the aliases of the seven 
main effects are 


£47 A * BD * CE + FG 
£g - B * AD * CF + EG 
(c =C + AE + BF * DG 
£p = D+ AB * CG + EF 
(p = E+ AC + BG + DF 
£p =F + BC + AG * DE 
fo - G* CD + BE + AF 


Table 14-32 A 21; * Fractional Factorial Design 


Á B e D — AB E — AC F= BC G — ABC 
— — — -+ - + 
-+ — — — — + E 
— 4 — — -+ — -+ 
+ + — -+ — — — 
— — + -+ — — -+ 
-+ — -+ — -+ — -— 
— E -+ — — + - 
+ + -+ -+ -+ -+ + 
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This 2/,* design is called a saturated fractional factorial, because all the available 
degrees of freedom are used to estimate main effects. It is possible to combine sequences of 
these resolution III fractional factorials to separate the main effects from the two-factor 
interactions. The procedure is illustrated in Montgomery (2001) and in Box, Hunter, and 


Hunter (1978). 


EXERCISES FOR SECTION 14-9 


14-31. R. D. Snee (“Experimenting with a Large Number of 
Variables," in Experiments in Industry: Design, Analysis and 
Interpretation of Results, by R. D. Snee, L. D. Hare, and J. B. 
Trout, eds., ASQC, 1985) describes an experiment in which 
a 2"! design with J = ABCDE was used to investigate the 
effects of five factors on the color of a chemical product. 
The factors are A = solvent/reactant, B = catalyst/reactant, 
C = temperature, D = reactant purity, and E = reactant pH. 
The results obtained are as follows: 


e = — 0.63 d — 6.79 
a= 251 ade — 6.47 
b = — 2.68 bde — 3.45 
abe = . 1.66 abd — 5.68 
c= 2.06 cde = 5.22 
ace = 1.22 acd = 4.38 
bce = — 2.09 bcd = 4.30 
abc = 1.93 abcde = 4.05 


(a) Prepare a normal probability plot of the effects. Which 
factors are active? 

(b) Calculate the residuals. Construct a normal probability 
plot of the residuals and plot the residuals versus the fitted 
values. Comment on the plots. 

(c) If any factors are negligible, collapse the 2?! design into 
a full factorial in the active factors. Comment on the re- 
sulting design, and interpret the results. 


14-32. Montgomery (2001) describes a 2^ ! fractional fac- 
torial design used to study four factors in a chemical process. 
The factors are 4 — temperature, B — pressure, C — concen- 
tration, and D = stirring rate, and the response is filtration 
rate. The design and the data are as follows: 


Run A B C 
1 = = — 
2 + — — 
3 = + — 
4 -- -- — 
5 = — + 
6 $ = + 
7 = + + 
8 + + -— 


(a) Write down the alias relationships. 

(b) Estimate the factor effects. Which factor effects appear 
large? 

(c) Project this design into a full factorial in the three appar- 
ently important factors and provide a practical interpreta- 
tion of the results. 


14-33. An article in Industrial and Engineering Chemistry 
(“More on Planning Experiments to Increase Research 
Efficiency,” 1970, pp. 60—65) uses a 2° ? design to investigate 
the effect on process yield of 4 — condensation temperature, 
B = amount of material 1, C = solvent volume, D = con- 
densation time, and E — amount of material 2. The results ob- 
tained are as follows: 


e — 232 cd — 23.8 
ab — 15.5 ace = 23.4 
ad = 16.9 bde = 16.8 
be = 16.2 abcde = 18.1 


(a) Verify that the design generators used were / = ACE and 
I= BDE. 

(b) Write down the complete defining relation and the aliases 
from the design. 

(c) Estimate the main effects. 

(d) Prepare an analysis of variance table. Verify that the AB 
and AD interactions are available to use as error. 

(e) Plot the residuals versus the fitted values. Also construct a 
normal probability plot of the residuals. Comment on the 


results. 
Treatment Filtration 
D = ABC Combination Rate 
= (1) 45 
+ ad 100 
+ bd 45 
= ab 65 
+ cd 75 
= ac 60 
= bc 80 
+ abcd 96 


CY 


14-39. Construct a 29^ fractional factorial design. Write SICAESEYDE A A, 
down the aliases, assuming that only main effects and two- 20-tooth 0.0265 0.0560 
factor interactions are of interest. 0.0340 0.0650 
14-40. Consider the problem in Exercise 14-19. Suppose 24-tooth 0.0450 0.0720 
that only half of the 32 runs could be made. 0.0510 0.0880 
— (a) Choose the half that you think should be run. 28-tooth 0.0405 0.0620 
[sn (b) Write out the alias relationships for your design. 0.0575 0.0825 


= 
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14-34. Consider the 29? design in Table 14-28. Suppose 
that after analyzing the original data, we find that factors C 
and E can be dropped. What type of 2^ design is left in the re- 
maining variables? 

14-35. Consider the 29? design in Table 14-28. Suppose 
that after the original data analysis, we find that factors D and 
F can be dropped. What type of 2^ design is left in the remain- 
ing variables? Compare the results with Exercise 14-34. Can 
you explain why the answers are different? 


14-36. Suppose that in Exercise 14-22 it was possible to 
run only a + fraction of the 2^ design. Construct the design and 
use only the data from the eight runs you have generated to 
perform the analysis. 


14-37. Suppose that in Exercise 14-16 only a 4 fraction 
ofthe 2? design could be run. Construct the design and analyze 
the data that are obtained by selecting only the response for 
the eight runs in your design. 


14-38. Construct the 25^ design recommended in Table 
13-29. What are the aliases of the main effects and two-factor 
interactions? 


(c) Estimate the factor effects. 

(d) Plot the effect estimates on normal probability paper and 
interpret the results. 

(e) Set up an analysis of variance for the factors identified as 
potentially interesting from the normal probability plot in 
part (d). 

(f) Analyze the residuals from the model. 

(g) Provide a practical interpretation of the results. 


14-10 RESPONSE SURFACE METHODS 
AND DESIGNS (CD ONLY) 


Supplemental Exercises 


14-41. An article in Process Engineering (No. 71, 1992, pp. 
46—47) presents a two-factor factorial experiment used to 


CHAPTER 14 DESIGN OF EXPERIMENTS WITH SEVERAL FACTORS 


investigate the effect of pH and catalyst concentration on prod- 
uct viscosity (cSt). The data are as follows: 


Catalyst Concentration 
2.5 2.7 





pH 5.6 192, 199, 189, 198 
2.9 185, 193, 185, 192 


178, 186, 179, 188 
197, 196, 204, 204 


(a) Test for main effects and interactions using a = 0.05. 
What are your conclusions? 

(b) Graph the interaction and discuss the information provided 
by this plot. 

(c) Analyze the residuals from this experiment. 

14-42. Heat treating of metal parts is a widely used manu- 

facturing process. An article in the Journal of Metals (Vol. 41, 

1989) describes an experiment to investigate flatness distor- 

tion from heat treating for three types of gears and two heat- 

treating times. The data are as follows: 


Time (minutes) 


(a) Is there any evidence that flatness distortion is different 
for the different gear types? Is there any indication that 
heat treating time affects the flatness distortion? Do these 
factors interact? Use a = 0.05. 

(b) Construct graphs of the factor effects that aid in drawing 
conclusions from this experiment. 

(c) Analyze the residuals from this experiment. Comment on 
the validity of the underlying assumptions. 


14-43. An article in the Textile Research Institute Journal 
(Vol. 54, 1984, pp. 171—179) reported the results of an exper- 
iment that studied the effects of treating fabric with selected 
inorganic salts on the flammability of the material. Two appli- 
cation levels of each salt were used, and a vertical burn test 
was used on each sample. (This finds the temperature at which 
each sample ignites.) The burn test data follow. 


Salt 
Level Untreated MgCl, NaCl CaCO, CaCl, Na CO, 
l 812 122 729 133 125 751 
827 728 73] 728 Tal 761 
876 764 726 720 719 155 
2 945 794 741 786 756 910 
881 760 744 TT] 781 854 
919 Tal 721 719 814 848 
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(a) Test for differences between salts, application levels, and 
interactions. Use a = 0.01. 

(b) Draw a graph of the interaction between salt and 
application level. What conclusions can you draw from 
this graph? 

(c) Analyze the residuals from this experiment. 

14-44. An article in the ZEEE Transactions on Components, 

Hybrids, and Manufacturing Technology (Vol. 15, 1992) de- 

scribes an experiment for investigating a method for aligning 

optical chips onto circuit boards. The method involves placing 
solder bumps onto the bottom of the chip. The experiment 
used three solder bump sizes and three alignment methods. 

The response variable is alignment accuracy (in micrometers). 

The data are as follows: 


Solder Bump Size skalie nmen. MENNO d a 
BU lo E E a ME 

4.60 1:0» 1.05 

75 4.53 1.45 1.00 

2,99 1.72 0.82 

130 2.44 1.76 0.95 

4.95 2.19 2.36 

260 4.55 2.60 2.46 


(a) Is there any indication that either solder bump size or 
alignment method affects the alignment accuracy? Is 
there any evidence of interaction between these factors? 
Use a = 0.05. 

(b) What recommendations would you make about this 
process? 

(c) Analyze the residuals from this experiment. Comment on 
model adequacy. 


14-45. An article in Solid State Technology (Vol. 29, 1984, 
pp. 281—284) describes the use of factorial experiments in 
photolithography, an important step in the process of manu- 
facturing integrated circuits. The variables in this experiment 
(all at two levels) are prebake temperature (4), prebake 
time (B), and exposure energy (C), and the response variable 
is delta line width, the difference between the line on the 
mask and the printed line on the device. The data are as fol- 
lows: (1) = —2.30, a = —9.87, b = —18.20, ab = —30.20, 
c = —23.80, ac = —4.30, bc = —3.80, and abc = —14.70. 
(a) Estimate the factor effects. 

(b) Use a normal probability plot of the effect estimates to 
identity factors that may be important. 

(c) What model would you recommend for predicting the 
delta line width response, based on the results of this exper- 
iment? 

(d) Analyze the residuals from this experiment, and comment 
on model adequacy. 


14-46. An article in the Journal of Coatings Technology 
(Vol. 60, 1988, pp. 27—32) describes a 2^ factorial design used 
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for studying a silver automobile basecoat. The response vari- 
able is distinctness of image (DOI). The variables used in the 
experiment are 
A — Percentage of polyester by weight of polyester/melamine 
(low value = 50%, high value = 70%) 
B — Percentage of cellulose acetate butyrate carboxylate (low 
value = 15%, high value = 30%) 
C = Percentage of aluminum stearate (low value = 1%, high 
value = 3%) 
D = Percentage of acid catalyst (low value = 0.25%, high 
value = 0.50%) 
The responses are (1) = 63.8, a = 77.6, b = 68.8, ab = 
76.5, c = 72.5, ac = 77.2, be = 77.7, abc = 84.5, d = 60.6, 
ad = 64.9, bd = 72.7, abd = 73.3, cd = 68.0, acd = 76.3, 
bcd = 76.0, and abcd = 75.9. 
(a) Estimate the factor effects. 
(b) From a normal probability plot of the effects, identify a 
tentative model for the data from this experiment. 
(c) Using the apparently negligible factors as an estimate of 
error, test for significance of the factors identified in part 
(b). Use a = 0.05. 
(d) What model would you use to describe the process, based 
on this experiment? Interpret the model. 
(e) Analyze the residuals from the model in part (d) and com- 
ment on your findings. 


14-47. An article in the Journal of Manufacturing Systems 
(Vol. 10, 1991, pp. 32-40) describes an experiment to investi- 
gate the effect of four factors P = waterjet pressure, F = 
abrasive flow rate, G = abrasive grain size, and V = jet tra- 
verse speed on the surface roughness of a waterjet cutter. A 2* 
design follows. 


Factors Surface 
V a Ve G Roughness 

Run (in/min) (Ib/min) (kpsi) (Mesh No.) (wm) 
l 6 2.0 38 80 104 
2 2 2.0 38 80 98 
3 6 2.0 30 80 103 
4 2 2.0 30 80 96 
5 6 1.0 38 80 137 
6 2 1.0 38 80 112 
7 6 1.0 30 80 143 
8 2 1.0 30 80 129 
9 6 2.0 38 170 88 
10 2 2:0 38 170 70 
11 6 2.0 30 170 110 
12 2 2.0 30 170 110 
13 6 1.0 38 170 102 
14 2 1.0 38 170 76 
15 6 1.0 30 170 98 
16 2 1.0 30 170 68 


= 
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(a) Estimate the factor effects. 

(b) Form a tentative model by examining a normal probabil- 
ity plot of the effects. 

(c) Is the model in part (b) a reasonable description 
of the process? Is lack of fit significant? Use a = 
0.05. 

(d) Interpret the results of this experiment. 

(e) Analyze the residuals from this experiment. 


14-48. Constructa 2]! design for the problem in Exercise 
14-46. Select the data for the eight runs that would have been 
required for this design. Analyze these runs and compare your 
conclusions to those obtained in Exercise 14-46 for the full 
factorial. 


14-49. Construct a 2]! design for the problem in Exercise 
14-47. Select the data for the eight runs that would have been 
required for this design, plus the center points. Analyze these 
data and compare your conclusions to those obtained in 
Exercise 14-47 for the full factorial. 


14-50. Construct a 2854 design in 16 runs. What are the 
alias relationships in this design? 


14-51. Construct a 2?;? design in eight runs. What are the 
alias relationships in this design? 


14-52. Ina process development study on yield, four fac- 
tors were studied, each at two levels: time (4), concentration 
(B), pressure (C), and temperature (D). A single replicate 
of a 2^ design was run, and the data are shown in the table 
below. 


Run Actual Run 

Number Order A B C 
1 5 = = = 

2 9 + — — 

3 8 - + — 

4 13 + + — 

5 — = + 

6 + — + 

7 14 — + + 

8 1 + + + 

9 6 = = = 
10 11 + — — 
11 2 — + — 
12 15 + + — 
13 4 = — + 
14 16 + — + 
15 10 = + + 
16 12 + + + 
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(a) Plot the effect estimates on a normal probability scale. 
Which factors appear to have large effects? 

(b) Conduct an analysis of variance using the normal proba- 
bility plot in part (a) for guidance in forming an error 
term. What are your conclusions? 

(c) Analyze the residuals from this experiment. Does your 
analysis indicate any potential problems? 

(d) Can this design be collapsed into a 2? design with two 
replicates? If so, sketch the design with the average and 
range of yield shown at each point in the cube. Interpret 
the results. 


14-53. An article in the Journal of Quality Technology 

(Vol. 17, 1985, pp. 198—206) describes the use of a replicated 

fractional factorial to investigate the effect of five factors on 

the free height of leaf springs used in an automotive applica- 

tion. The factors are A = furnace temperature, B = heating 

time, C = transfer time, D = hold down time, and E = 

quench oil temperature. The data are shown in the following 

table. 

(a) What is the generator for this fraction? Write out the alias 
structure. 

(b) Analyze the data. What factors influence mean free height? 

(c) Calculate the range of free height for each run. Is there any 
indication that any of these factors affect variability in free 
height? 

(d) Analyze the residuals from this experiment and comment 
on your findings. 


Factor Levels 
Low ( — ) High ( + ) 
A (b) o 3 
B (90) 14 18 
C (psi) 60 80 
DEC) 225 250 


Yield 
D (lbs) 


— 12 
= 18 
= 13 
= 16 
= 17 
= 15 
20 
15 
10 
25 
I3 
24 
19 
21l 
17 
23 


+++ 4+ 4+ 4+ 4+ 4 


EN 


" 


= 
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A B E D E Free Height 

— = - — — 7.78 1:18 7.81 
+ — — is = 8.15 8.18 7.88 
— — — is = 7.50 7.56 7.50 
J + — — = 4:99 7.56 IIS 
— — + F E 7.54 8.00 7.88 
-- — + = m 7.69 8.09 8.06 
— + + = = 7.56 T32 7.44 
+ + + + = 7.56 7.81 7.69 
= — — = + 7.50 7.56 7.50 
+ — — i T 7.88 7.88 7.44 
— -- — $ + 7.50 7.56 7.50 
4 + — — + 7.63 7.15 7.56 
— — $ a + 1.22 7.44 7.44 
~~ — + = $ 7.56 7.69 7:62 
— + + = F 7.18 7.18 1.25 
+ + T us T 7.81 7.50 4:29 


14-54. An article in Rubber Chemistry and Technology 
(Vol. 47, 1974, pp. 825-836) describes an experiment that 
studies the Mooney viscosity of rubber to several variables, 
including silica filler (parts per hundred) and oil filler (parts 
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per hundred). Data typical of that reported in this experiment 
follow, where 


| Silica — 60 _ oil — 21 
ae LL E 
Coded levels 
X1 X? X 
=] =| 15.4 
1 =] 14.15 
=] l 12.87 
l l 13.53 
e] =] 13.90 
1 =] 14.88 
=] l 12:23 
e 1 13.35 


(a) What type of experimental design has been used? 
(b) Analyze the data and draw appropriate conclusions. 
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MIND-EXPANDING EXERCISES 


14-55. Consider an unreplicated 2^ factorial, and sup- 
pose that one of the treatment combinations is missing. 
One logical approach to this problem is to estimate the 
missing value with a number that makes the highest or- 
der interaction estimate zero. Apply this technique to the 
data in Example 14-5, assuming that ab is missing. 
Compare the results of the analysis of these data with the 
results in Example 14-5. 


14-56. What blocking scheme would you recommend 
if it were necessary to run a 2^ design in four blocks of 
four runs each? 


14-57. Consider a 2? design in two blocks with 
AB confounded with blocks. Prove algebraically that 
SS 1p a SS Blocks 


14-58. Consider a 2? design. Suppose that the largest 
number of runs that can be made in one block is four, but 
we can afford to perform a total of 32 observations. 


(a) Suggest a blocking scheme that will provide some 
information on all interactions. 

(b) Show an outline (source of variability, degrees of 
freedom only) for the analysis of variance for this 
design. 


14-59. Construct a 2? ! design. Suppose that it is 
necessary to run this design in two blocks of eight runs 
each. Show how this can be done by confounding a two- 
factor interaction (and its aliased three-factor interac- 
tion) with blocks. 


14-60. Construct a 2]; ? design. Show how this 
design may be confounded in four blocks of eight runs 


each. Are any two-factor interactions confounded with 
blocks? 


14-61. Construct a 2|; ? design. Show how this de- 
sign can be confounded in two blocks of eight runs each 
without losing information on any of the two-factor 
interactions. 

14-62. Setupa2[; ! design using D = AB, E = AC, 
F = BC, and G = ABC as the design generators. Ignore 
all interaction above the two factors. 


(a) Verify that each main effect is aliased with three 
two-factor interactions. 

(b) Suppose that a second 2]; ^ design with generators 
D = —AB, E = —AC, F = —BC, and G = ABC is 
run. What are the aliases of the main effects in this 
design? 

(c) What factors may be estimated 1f the two sets of 
factor effect estimates above are combined? 


To work Exercises 14-63 through 14-67 you will need to 
read Section 14.6 on the CD. 


14-63. Consider the experiment described in 
Example 14-4. Suppose that both factors were random. 
(a) Analyze the data and draw appropriate conclusions. 
(b) Estimate the variance components. 


14-64. For the breaking strength data in Table S14-1, 
suppose that the operators were chosen at random, but 
machines were a fixed factor. Does this influence the 
analysis or your conclusions? 

14-65. A company employs two time-study engi- 
neers. Their supervisor wishes to determine whether the 
standards set by them are influenced by an interaction 
between engineers and operators. She selects three oper- 
ators at random and conducts an experiment in which 
the engineers set standard times for the same job. She 
obtains the data shown here: 


Operator 
Engineer l 2 3 
l 259 2.38 2.40 
DS 2.49 Dr 
2 215 2.85 2.66 
2.86 2) 2:9) 


(a) State the appropriate hypotheses. 

(b) Use the analysis of variance to test these hypotheses 
with a = 0.05. 

(c) Graphically analyze the residuals from this exper- 
iment. 

(d) Estimate the appropriate variance components. 


14-66. Consider the experiment on baked anode den- 

sity described in Exercise 14-4. Suppose that positions 

on the furnace were chosen at random and temperature 

is a fixed factor. 

(a) State the appropriate hypotheses. 

(b) Use the analysis of variance to test these hypotheses 
with a = 0.05. 

(c) Estimate the variance components. 

14-67. Consider the experiment described in Exercise 

14-63. How does the analysis (and conclusions) change 

if both factors are random? Use a — 0.05. 


To work Exercises 14-68 and 14-69 you will need to read 
Section 14-7.4 on the CD. 


14-10 RESPONSE SURFACE METHODS AND DESIGNS (CD ONLY) 


MIND-EXPANDING EXERCISES 


14-68. Consider the experiment in Exercise 14-19. 

Suppose that a center point had been run (replicated five 

times) and the responses were 45, 40, 41, 47, and 43. 

(a) Estimate the experimental error using the center 
points. Compare this to the estimate obtained origi- 
nally in Exercise 14-19 by pooling apparently non- 
significant effects. 

(b) Test for look of fit, using a = 0.05. 


14-69. Consider the data from Exercise 14-13, repli- 
cate 1 only. Suppose that a center point with four 
replicates is added to these eight factorial runs and the 
responses are 425, 400, 437, and 418. 

(a) Estimate the facter effects. 

(b) Test for lack of fit using a = 0.05 

(c) Test for main effects and interactions using a = 0.05. 
(d) Analyze residuals and draw conclusions. 


To work problem 14-70 through 14-74 you will need to 
read Section 14-10 on the CD. 


14-70. An article in Rubber Age (Vol. 89, 1961, pp. 
453—458) describes an experiment on the manufacture of a 
product in which two factors were varied. The factors are 
reaction time (hr) and temperature (^C). These factors are 
coded as x, = (time — 12)/8 and x, = (temperature — 
250)/30. The following data were observed where y is the 
yield (in percent): 


Run 
Number Xi X> y 
1 =] 0 83.8 
2 1 0 81.7 
3 0 0 82.4 
4 0 0 82.9 
5 0 =] 84.7 
6 0 l 75.9 
7 0 0 81.2 
8 — 1.414 —0.414 81.3 
9 — 1.414 1.414 83.1 
10 1.414 —1.414 85.3 
11 1.414 1.414 Do 
1 0 0 82.0 


(a) Plot the points at which the experimental runs were 
made. 


(b) Fit a second-order model to the data. Is the second- 
order model adequate? 

(c) Plot the yield response surface. What recommenda- 
tions would you make about the operating condi- 
tions for this process? 


14-71. Consider the first-order model 


y = 50 + 1.5x] = 0.8x2 


where —1 < x; x 1. Find the direction of steepest ascent. 
14-72. A manufacturer of cutting tools has devel- 
oped two empirical equations for tool life (y1) and tool cost 
(y;). Both models are functions of tool hardness (x,) and 
manufacturing time (x,). The equations are 


Vi = 10+ SXi =F AS 


y» 23s: 3x s 4x» 

and both equations are valid over the range —1.5 < 
x; € 1.5. Suppose that tool life must exceed 12 hours 
and cost must be below $27.50. 

(a) Is there a feasible set of operating conditions? 

(b) Where would you run this process? 


14-73. An article in Tappi (Vol. 43, 1960, pp. 38-44) 
describes an experiment that investigated the ash value of 
paper pulp (a measure of inorganic impurities). Two vari- 
ables, temperature Tin degrees Celsius and time f in hours, 
were studied, and some of the results are shown in the fol- 
lowing table. The coded predictor variables shown are 


(mS 
Jis ces 








Mio 


X X2 M Xi X2 L 
E =l 211 (es 168 
1 =] 92 0 IL 12/0 
=| 1 216 0 0 127 
1 1 99 0 0 175 
LO DEP» 0 0 157 
es 0 48 0 0 146 


(a) What type of design has been used in this study? Is 
the design rotatable? 
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MIND-EXPANDING EXERCISES 


(b) Fit a quadratic model to the data. Is this model satis- 
factory? 

(c) If it 1s important to minimize the ash value, where 
would you run the process? 


14-74. In their book Empirical Model Building and 
Response Surfaces (John Wiley, 1987), G. E. P. Box and 
N. R. Draper describe an experiment with three factors. 
The data shown in the following table are a variation of 
the original experiment on page 247 of their book. 
Suppose that these data were collected in a semiconduc- 
tor manufacturing process. 


(a) The response y, is the average of three readings on 
resistivity for a single wafer. Fit a quadratic model 
to this response. 

(b) The response y, is the standard deviation of the three 
resistivity measurements. Fit a linear model to this 
response. 

(c) Where would you recommend that we set x,, x;, and 
x, if the objective is to hold mean resistivity at 500 
and minimize the standard deviation? 


XI X? X3 Vi J? XI X? X3 Yi b 

zl =l all 24.00 12.49 i 0 0 501.67 92.50 
0 El zi 120.33 8.39 = Í 0 264.00 63.50 
1 = =] 21307 42.83 0 1 0 4277.00 88.61 

= 0 zil 86.00 3.46 I i 0 730.67 21.08 
0 0 T 136.63 80.41 =l Si Íl 22007 133.82 
1 0 zt 340.67 16.17 0 =I I 239.67 23.46 
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14-2 SOME APPLICATIONS OF EXPERIMENTAL 
DESIGN TECHNIQUES (CD ONLY) 


Experimental design is an extremely important tool for engineers and scientists who are inter- 
ested in improving the performance of a manufacturing process. It also has extensive application 
in the development of new processes and in new product design. We now give some examples. 


A Process Characterization Experiment 

A team of development engineers is working on a new process for soldering electronic com- 
ponents to printed circuit boards. Specifically, the team is working with a new type of flow 
solder machine that should reduce the number of defective solder joints. (A flow solder 
machine preheats printed circuit boards and then moves them into contact with a wave of 
liquid solder. This machine makes all the electrical and most of the mechanical connections 
of the components to the printed circuit board. Solder defects require touchup or rework, 
which adds cost and often damages the boards.) The process will have several (perhaps 
many) variables, and all of them are not equally important. The initial list of candidate vari- 
ables to be included in the experiment is constructed by combining the knowledge and 
information about the process from all team members. In this example, the engineers con- 
ducted a brainstorming session and invited manufacturing personnel with experience using 
various types of flow soldering equipment to participate. The team determined that the flow 
solder machine has several variables that can be controlled. They are 


1. Solder temperature 
Preheat temperature 
Conveyor speed 
Flux type 
Flux-specific gravity 
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Solder wave depth 
7. Conveyor angle 
In addition to these controllable factors, there are several other factors that cannot be easily 
controlled, once the machine enters routine manufacturing, including 
1. Thickness of the printed circuit board 
Types of components used on the board 
Layout of the components on the board 
Operator 


Environmental factors 
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Production rate 


Sometimes we call the uncontrollable factors noise factors. A schematic representation of 
the process is shown in Fig. S14-1. 

In this situation the engineer is interested in characterizing the flow solder machine; that 
is, he is interested in determining which factors (both controllable and uncontrollable) affect 
the occurrence of defects on the printed circuit boards. To determine these factors, he can 
design an experiment that will enable him to estimate the magnitude and direction of the fac- 
tor effects. Sometimes we call such an experiment a screening experiment. The information 
from this characterization study or screening experiment can help determine the critical 
process variables, as well as the direction of adjustment for these factors in order to reduce the 


14-1 


14-2 


Figure 514-1 The 
flow solder 
experiment. 


Figure S14-2 


Contour plot of yield 
as a function of reac- 
tion time and reaction 
temperature, illustrat- 


ing an optimization 
experiment. 
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number of defects, and assist in determining which process variables should be carefully con- 
trolled during manufacturing to prevent high defect levels and erratic process performance. 


An Optimization Experiment 

In a characterization experiment, we are interested in determining which factors affect the 
response. A logical next step is to determine the region in the important factors that leads 
to an optimum response. For example, if the response is cost, we will look for a region of 
minimum cost. 

As an illustration, suppose that the yield of a chemical process is influenced by the op- 
erating temperature and the reaction time. We are currently operating the process at 155°F 
and 1.7 hours of reaction time, and the current process yield is around 75%. Figure 814-2 
shows a view of the time-temperature space from above. In this graph we have connected 
points of constant yield with lines. These lines are yield contours, and we have shown the 
contours at 60, 70, 80, 90, and 95% yield. To locate the optimum, we might begin with a 
factorial experiment such as we described below, with the two factors, time and temperature, 
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run at two levels each at 10°F and 0.5 hours above and below the current operating condi- 
tions. This two-factor factorial design is shown in Fig. S14-2. The average responses 
observed at the four points in the experiment (145°F, 1.2 hours; 145°F, 2.2 hours; 165^F, 1.2 
hours; and 165°F, 2.2 hours) indicate that we should move in the general direction of 
increased temperature and lower reaction time to increase yield. A few additional runs could 
be performed in this direction to locate the region of maximum yield. 


A Product Design Example 

We can also use experimental design in the development of new products. For example, suppose 
that a group of engineers are designing a door hinge for an automobile. The product characteris- 
tic is the check effort, or the holding ability, of the latch that prevents the door from swinging 
closed when the vehicle is parked on a hill. The check mechanism consists of a leaf spring and a 
roller. When the door is opened, the roller travels through an arc causing the leaf spring to be com- 
pressed. To close the door, the spring must be forced aside, and this creates the check effort. The 
engineering team thinks that check effort is a function of the following factors: 


1. Roller travel distance 
Spring height from pivot to base 
Horizontal distance from pivot to spring 


Free height of the reinforcement spring 
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Free height of the main spring 


The engineers can build a prototype hinge mechanism in which all these factors can be varied 
over certain ranges. Once appropriate levels for these five factors have been identified, an ex- 
periment can be designed consisting of various combinations of the factor levels, and the pro- 
totype can be tested at these combinations. This will produce information concerning which 
factors are most influential on the latch check effort, and through analysis of this information, 
the latch design can be improved. 

These examples illustrate only three applications of experimental design methods. In the 
engineering environment, experimental design applications are numerous. Some potential areas 
of use are 


1. Process troubleshooting 

Process development and optimization 
Evaluation of material and alternatives 
Reliability and life testing 


Performance testing 
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Product design configuration 
7. Component tolerance determination 
Experimental design methods allow these problems to be solved efficiently during the early 


stages of the product cycle. This has the potential to dramatically lower overall product cost 
and reduce development lead time. 


14-6 FACTORIAL EXPERIMENTS WITH RANDOM FACTORS 
(CD ONLY) 


In this chapter, we focus primarily on the case where all the factors are fixed; that is, the experi- 
menter specifically chose the levels, and the conclusions from the experiment are confined to 
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those specific levels. Now we briefly consider the situation where one or more of the factors in a 
factorial experiment are random, using the two-factor factorial design as an illustration. 


The Random-Effects Model 

Suppose that we have two factors A and B arranged in a factorial experiment in which the lev- 
els of both factors are selected at random from larger populations of factor levels, and we wish 
to extend our conclusion to the entire population of factor levels. The observations are repre- 
sented by the model 


i—1,2,...,a 
Yr = e t t; t B; t(TB)y t eyri = 12522040 (S14-1) 
k= 1,2,...,n 


where the parameters 7;, B;, (TB); and e;;, are normally and independently distributed random 
variables with means zero and variances o7, op, 07. and c^, respectively. As a result of these 
assumptions, the variance of any observation Y; is 
V(Y;) = o? + Op + 078 + o? 

and 07, 05, 025, and o? are called variance components. The hypotheses that we are interested in 
testing are Hy: 02 = 0, Hy og = 0, and Ho: Or, = (). Notice the similarity to the single-factor 
experiment random-effects model discussed in Chapter 13. 

The basic analysis of variance remains unchanged; that is, SS, SS}, SS ,5, SS 7, and SS; are 
all calculated as in the fixed-effects case. To construct the test statistics, we must examine the 
expected mean squares. They are 


E(MS,) = o° + 1025 + bno? 
E(MSg) = o* + nozg + anos 
E(MS,5) = o° + nog 

and 


E(MS;) = o° (S14-2) 


Note from the expected mean squares that the appropriate statistic for testing the 
no-interaction hypothesis Ho: Org = (is 





since if H, is true, both numerator and denominator of F, have expectation o^, and only if Hj 
is false is E(MS,5) greater than E(MS;). The ratio Fo is distributed as P, 11,5501 
Similarly, for testing that there is no main effect of factor A, or Hy: o2 = 0, we would use 
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which is distributed as PF’, (g—1)@-1), and for testing Ho: op = 0 the test statistic is 





which is distributed as Fy) (q—1y-1)- These are all upper-tail, one-tail tests. Thus, the null 
hypotheses above would be rejected at the a level of significance if the calculated value of f, 
exceeds the upper a percentage point of the F-distribution. Notice that these test statistics are 
not the same as those used if both factors A and B are fixed. The expected mean squares are 
always used as a guide to test statistic construction. 

The variance components may be estimated by equating the observed mean squares to 
their expected values and solving for the variance components. This yields 





Two factors that may influence the breaking strength of cloth are being studied. Four test 
machines and three operators are chosen at random, and an experiment is run using cloth from 
the same production segment. The data are shown in Table S14-1, and the analysis of variance 
is in Table S14-2. Notice that the first four columns in Table S14-2. are computed as in a stan- 
dard (fixed-effects model) analysis. The test statistics are computed using Equations (814-3) 
through S14-5. We will use a — 0.05. The test statistic for the no-interaction hypothesis 
Ho: 078 = Q is 


— MSi 594 | TT 
0^ MS; 375 ^ 





Table S14-1 Breaking Strength Data for Example S14-1 


Machine 
Operator I 2 3 i 
A 113 113 111 113 
112 118 111 119 
B 111 110 111 114 
112 111 109 112 
C 109 112 114 111 
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Table S14-2 Analysis of Variance for Example 14-2A 


Source of sum of | Degreesof | Mean 

Variation Squares Freedom Square fo P-Value 
Operators 26.33 2 15.17 2.217 0.1904 

Machines 24.33 3 8.11 1.365 0.3403 

Interaction 35.67 6 5.94 1.584 0.2338 

Error 45.00 12 3.75 


Total 131.35 23 


which would be compared to fo.05s,6,12 = 3.00, and so Ho: 078 — 0 cannot be rejected. 
The P-value for this ratio is P = 0.2338. To test for no machine effect (A): oh = 0), we 
compute 


_ MSs _ 811 _ | 4. 
MS, 594 ^ 





Jo 


which would be compared to fp 09535 = 4.76. The P-value for this ratio is P = 0.3403. 
Therefore, we conclude that machines do not significantly affect the breaking strength test 
results. To test for no operator effect (Hj: o? = 0), we compute 


_ MS, 01317. 5,4 
0 MSi 594 





which would be compared to foo526 = 5.14. The P-value for this ratio is P = 0.1904. We 
conclude that the operators do not affect the breaking strength test results. The variance com- 
ponents may be estimated using Equations S14-6 as follows: 


6? = 3.75 
i 394—375 
6 => 1.10 
.,. 13.17 — 5.94 
ô? = ——————— = 0.90 
8 
~> 811 — 5.94 
ed m 0.36 


All of the variance components 67, op, and O78 are small; in fact, they are not significantly 
different from zero. 


The Mixed Model 
Now suppose that one of the factors, A, is fixed and the other, B, is random. This is called the 
mixed model analysis of variance. The linear model is 


ji —1,2,...,a 
lI uev 912.55 (S14-7) 
k—1,2,...,n 
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In this model, 7; is a fixed effect defined such that €‘; 7; = 0, B;, is a random effect, the 
interaction term (TB); is a random effect, and €,, is a normally and independently distributed 
random variable with mean zero and variance o°. The interaction elements (TB); are normal 
random variables with mean zero and variance [(a — 1)/a]o%g, and 37. (7);; = 0. Because 
the sum of the interaction effects over the levels of the fixed factor equals zero, this version of 
the mixed model is called the restricted model. The interaction elements are not all 
independent. For more details, see Montgomery (2001). 
The expected mean squares in this case are 


a 


PESE 
E(MS,) = o? + no% + ——— 


E(MSg) = o° + ano 
E(MS 4p) = o^ F nog 
IUS = g* 





Therefore, the appropriate test statistic for testing Hp: 7; = O is 


(S14-10) 


(S14-11) 





for which the reference distribution is F(, 9.1, op 1 
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Table S14-3 Analysis of Variance for the Two-Factor Mixed Model (A fixed, B random) 


Source of 
Variation 
Rows (A) 
Columns (B) 
Interaction 
Error 

Total 


Sum of Degrees of Expected Mean Mean 
Squares Freedom Square Square Fo 
SS, a—1 o? + nozg + bn Xis?/(a — 1) MS, MS,/MS,s 
SS; b- 1 o^ + anos MS; MS;/MS,, 
Sm (a — 1b — 1) o? + nože MS 4p MS,5/ MS; 
SS; ab(n — 1) o? MS; 
SS, abn — | 


The variance components 0$. Tg» and o° may be estimated by eliminating the first equa- 
tion from Equations S14-8, leaving three equations in three unknowns, whose solution is 


AD — MS, zu MS; 
V. 7 an 
a2 MS 45 m MS; 
OB = n 
and 
ó^ = MS; 


This general approach can be used to estimate the variance components in any mixed model. 
After eliminating the mean squares containing fixed factors, there will always be a set of equa- 
tions remaining that can be solved for the variance components. Table S14-3 summarizes the 
analysis of variance for the two-factor mixed model. 


Computer Output 

Some software packages have the capability to handle random factors in the analysis of vari- 
ance. Table S14-4 is the computer solution to Example S14-1 from Minitab. Notice that the 
F-tests on the main effects have been constructed using MS, in the denominator. The results 
closely match those in Table 814-2. 

Table S14-5 is the Minitab output for the breaking strength data assuming a mixed model, 
where operators are fixed and machines are random. Minitab allowed us to assume that 
restricted form of the mixed model [not all software packages make this easy, and some auto- 
matically use an unrestricted model, that is, the term >_, (TB); # 0 in general]. Notice that 


Table S14-4 Minitab Output for Example S14-1 


Analysis of Variance (Balanced Designs) 


Factor Type Levels Values 
Operator random 3 A B C 
Machine random 4 l 2 3 4 


Analysis of Variance for Strength 


Source DF SS MS F P 
Operator 2 26.333 13.167 2:21 0.190 
Machine 3 24.333 8.111 1.36 0.340 
Operator* Machine 6 35.667 5.944 1.59 0.234 
Error 12 45.000 3.750 


Total 23 131.333 
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Table S14-5 Minitab Analysis of Variance for the Breaking Strength Data in Table S14-1 Where 
Operators Are Fixed and Machines Are Random (Mixed Model) 


Analysis of Variance (Balanced Designs) 


Factor Type Levels Values 
Operator fixed 3 A B C 
Machine random 4 1 2 3 4 


Analysis of Variance for Strength 


Source DF SS MS F P 
Operator 2 26.333 13.167 2.21 0.190 
Machine 3 24.333 8.111 2.16 0.145 
Operator* Machine 6 35.667 5.944 1.59 0.234 
Error 12 45.000 3.750 

Total 23 131.333 


the test statistics for main effects and interactions have been constructed properly using 
Equations 814-9 through S14-11. 


14-7.4 Addition of Center Points to a 2* Design (CD Only) 


A potential concern in the use of two-level factorial designs is the assumption of linearity in the 
factor effects. Of course, perfect linearity is unnecessary, and the 2^ system will work quite well 
even when the linearity assumption holds only approximately. However, there is a method of 
replicating certain points in the 2^ factorial that will provide protection against curvature as well 
as allow an independent estimate of error to be obtained. The method consists of adding center 
points to the 2^ design. These consist of nc replicates run at the point x; = 0 (i = 1,2,...,k). One 
important reason for adding the replicate runs at the design center 1s that center points do not 
affect the usual effects estimates in a 2“ design. We assume that the k factors are quantitative. 

To illustrate the approach, consider a 2? design with one observation at each of the 
factorial points (—, —), (+, —), (7, +), and (+, +) and nc observations at the center points 
(0, 0). Figure S14-3 illustrates the situation. Let yp be the average of the four runs at the four 


+] 


Figure S14-3 A2? 
design with center 
points. 
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EXAMPLE 814-2 


Figure S14-4 The 2? 
design with five center 
points for Example 
S14-2. 


factorial points, and let yc be the average of the nc run at the center point. If the difference 
yr — yc is small, the center points lie on or near the plane passing through the factorial points, 
and there is no curvature. On the other hand, if yp. — yc is large, curvature is present. A single- 
degree-of-freedom sum of squares for curvature is given by 


nen Vr — Voy 


DD Gurvatire = nr ES nc 
"NS 2 
= IO (S14-12) 
l l 
Np ` ne 


where, in general, np is the number of factorial design points. This quantity may be compared 
to the error mean square to test for curvature. Notice that when Equation S14-12 is divided by 
6^ = MSp, the result is similar to the square of the f statistic used to compare two means. 


More specifically, when points are added to the center of the 2^ design, the model we may 
entertain is 


k k 
Y= Do T 2, Dix; + > MBgx;x, + >, By +e 
f Iz i 


where the B; are pure quadratic effects. The test for curvature actually tests the hypotheses 


Furthermore, if the factorial points in the design are unreplicated, we may use the nç center 
points to construct an estimate of error with nc — 1 degrees of freedom. 


A chemical engineer is studying the percentage of conversion or yield of a process. There are 
two variables of interest, reaction time and reaction temperature. Because she is uncertain about 
the assumption of linearity over the region of exploration, the engineer decides to conduct a 2° 
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Table S14-6 Analysis of Variance for Example 814-2 


Source of Sum of Degrees of Mean 

Variation Squares Freedom Square fo P-Value 
A (Time) 2.4025 l 2.4025 55.87 0.0017 
B (Temperature) 0.4225 l 0.4225 9.83 0.0350 
AB 0.0025 l 0.0025 0.06 0.8237 
Curvature 0.0027 1 0.0027 0.06 0.8163 
Error 0.1720 4 0.0430 
Total 3.0022 8 


design (with a single replicate of each factorial run) augmented with five center points. The 
design and the yield data are shown in Fig. 814-4. 

Table S14-6 summarizes the analysis of variance for this experiment. The mean square 
error is calculated from the center points as follows: 








(yi — Yo) 
SS E Center points 
MS; = — 
nc 7 1 nc — 1 
5 
; — 40.46) 
2,6 Y 0.1720 
— nu -— — 1 — 0.0430 


The average of the points in the factorial portion of the design is yp = 40.425, and the average of 
the points at the center is yc = 40.46. The difference yp — yc = 40.425 — 40.46 = —0.035 
appears to be small. The curvature sum of squares in the analysis of variance table is computed 
from Equation S14-12 as follows: 


nghc(Yg — yc) 


DO Curvature = np 4 Nc 





4)(5)(—0.035)° 
Oe ae 
4+ 5 
The analysis of variance indicates that both factors exhibit significant main effects, that there 
is no interaction, and that there is no evidence of curvature in the response over the region of 
exploration. That is, the null hypothesis Ap: Xu B; = 0 cannot be rejected. 


14-10 RESPONSE SURFACE METHODS AND DESIGNS (CD ONLY) 


Response surface methodology, or RSM, is a collection of mathematical and statistical tech- 
niques that are useful for modeling and analysis in applications where a response of interest is 
influenced by several variables and the objective is to optimize this response. For example, 
suppose that a chemical engineer wishes to find the levels of temperature (x,) and feed con- 
centration (x5) that maximize the yield (y) of a process. The process yield is a function of the 
levels of temperature and feed concentration, say, 


Y = f(x}, x2) te 
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where e represents the noise or error observed in the response Y. If we denote the expected re- 
sponse by E(Y) = f(x;, x;) = n, then the surface represented by 


Lom f Ds x2) 


is called a response surface. 

We may represent the response surface graphically as shown in Fig. S14-5, where n is 
plotted versus the levels of x, and x;. Notice that the response is represented as a surface plot 
in a three-dimensional space. To help visualize the shape of a response surface, we often plot 
the contours of the response surface as shown in Fig. $14-6. In the contour plot, lines of con- 
stant response are drawn in the x,, x; plane. Each contour corresponds to a particular height of 
the response surface. The contour plot is helpful in studying the levels of x, and x, that result 
in changes in the shape or height of the response surface. 

In most RSM problems, the form of the relationship between the response and the inde- 
pendent variables is unknown. Thus, the first step in RSM is to find a suitable approximation 
for the true relationship between Y and the independent variables. Usually, a low-order poly- 
nomial in some region of the independent variables is employed. If the response is well 
modeled by a linear function of the independent variables, the approximating function 1s the 
first-order model 


Y= Do + Dix, T B2xX2 Pote ae Dix, +e (S14-13) 


If there is curvature in the system, then a polynomial of higher degree must be used, such as 
the second-order model 


Y = Bo + > Bix; + > Bux; M >, Byxix; te (S14-14) 


i<j 


Many RSM problems use one or both of these approximating polynomials. Of course, it is 
unlikely that a polynomial model will be a reasonable approximation of the true functional re- 
lationship over the entire space of the independent variables, but for a relatively small region 
they usually work quite well. 

The method of least squares, discussed in Chapters 11 and 12, is used to estimate the 
parameters in the approximating polynomials. The response surface analysis is then done in 
terms of the fitted surface. If the fitted surface is an adequate approximation of the true 
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response function, analysis of the fitted surface will be approximately equivalent to analysis 
of the actual system. 

RSM is a sequential procedure. Often, when we are at a point on the response surface that 
is remote from the optimum, such as the current operating conditions in Fig. 814-6, there is 
little curvature in the system and the first-order model will be appropriate. Our objective here 
is to lead the experimenter rapidly and efficiently to the general vicinity of the optimum. Once 
the region of the optimum has been found, a more elaborate model such as the second-order 
model may be employed, and an analysis may be performed to locate the optimum. From Fig. 
S14-6, we see that the analysis of a response surface can be thought of as “climbing a hill,” 
where the top of the hill represents the point of maximum response. If the true optimum is a 
point of minimum response, we may think of “descending into a valley.” 

The eventual objective of RSM is to determine the optimum operating conditions for the 
system or to determine a region of the factor space in which operating specifications are sat- 
isfied. Also, note that the word “optimum” in RSM is used in a special sense. The “hill climb- 
ing" procedures of RSM guarantee convergence to a local optimum only. 


Method of Steepest Ascent 

Frequently, the initial estimate of the optimum operating conditions for the system will be far 
from the actual optimum. In such circumstances, the objective of the experimenter is to move rap- 
idly to the general vicinity of the optimum. We wish to use a simple and economically efficient 
experimental procedure. When we are remote from the optimum, we usually assume that a first- 
order model is an adequate approximation to the true surface in a small region of the x's. 

The method of steepest ascent is a procedure for moving sequentially along the path of 
steepest ascent, that 1s, in the direction of the maximum increase in the response. Of course, if 
minimization is desired, we are talking about the method of steepest descent. The fitted 
first-order model is 


k 
$ Bo + Y Bix; (S14-15) 
i=l 


and the first-order response surface, that is, the contours of y, is a series of parallel lines such 
as that shown in Fig. S14-7. The direction of steepest ascent is the direction in which 
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y increases most rapidly. This direction is normal to the fitted response surface contours. We 
usually take as the path of steepest ascent the line through the center of the region of interest 
and normal to the fitted surface contours. Thus, the steps along the path are proportional to the 
regression coefficients {Ê}. The experimenter determines the actual step size based on 


process knowledge or other practical considerations. 


Experiments are conducted along the path of steepest ascent until no further increase in 
response is observed. Then a new first-order model may be fit, a new direction of steepest as- 
cent determined, and further experiments conducted in that direction until the experimenter 


feels that the process is near the optimum. 
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In Example S14-2 we described an experiment on a chemical process in which two factors, 
reaction time (x,) and reaction temperature (x,), affect the percent conversion or yield (Y). 
Figure S14-3 shows the 2? design plus five center points used in this study. The engineer 
found that both factors were important, there was no interaction, and there was no curvature 
in the response surface. Therefore, the first-order model 


Y= Bo + Pixi + Box + € 


should be appropriate. Now the effect estimate of time is 1.55 hours and the effect estimate of 
temperature is 0.65°F, and since the regression coefficients B; and B»; are one-half of the 
corresponding effect estimates, the fitted first-order model is 


$ = 4044 + 0.775x, + 0.325x5 


Figure $14-8(a) and (b) show the contour plot and three-dimensional surface plot of this 
model. Figure $14-8 also shows the relationship between the coded variables x, and x, (that 
defined the high and low levels of the factors) and the original variables, time (in minutes) and 
temperature (in ^F). 

From examining these plots (or the fitted model), we see that to move away from the de- 
sign center—the point (x, = 0, x, = 0)—along the path of steepest ascent, we would move 
0.775 unit in the x, direction for every 0.325 unit in the x, direction. Thus, the path of steepest 
ascent passes through the point (x, = 0, x, = 0) and has a slope 0.325/0.775. The engineer 
decides to use 5 minutes of reaction time as the basic step size. Now, 5 minutes of reaction 
time is equivalent to a step in the coded variable x, of Ax, = 1. Therefore, the steps along the 
path of steepest ascent are Ax, — 1.0000 and Ax, — (0.325/0.775)A x, — 0.42. A change of 
Ax, = 0.42 in the coded variable x, is equivalent to about 2°F in the original variable temper- 
ature. Therefore, the engineer will move along the path of steepest ascent by increasing 
reaction time by 5 minutes and temperature by 2°F. An actual observation on yield will be 
determined at each point. 

Figure S14-9 shows several points along this path of steepest ascent and the yields actually 
observed from the process at those points. At points A—D the observed yield increases steadily, but 
beyond point D, the yield decreases. Therefore, steepest ascent would terminate in the vicinity of 
55 minutes of reaction time and 163°F with an observed percent conversion of 67%. 






D 
Point A: 40 minutes, 157°F, y = 40.5 
Path of Point B: 45 minutes, 159°F, y = 51.3 
steepest ascent Point C: 50 minutes, 161°F, y = 59.6 
Point D: 55 minutes, 163°F, y = 67.1 
Point E: 60 minutes, 165°F, y = 63.6 
Point F: 65 minutes, 167?F, y = 60.7 
Original fitted 
contours 


40 50 60 70 
Time 


Figure S14-9 Steepest ascent experiment for Example S14-3. 
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EXAMPLE $14-4 


Analysis of a Second-Order Response Surface 

When the experimenter is relatively close to the optimum, a second-order model is usually 
required to approximate the response because of curvature in the true response surface. The 
fitted second-order model is 


where B denotes the least squares estimate of B. In this section we show how to use this fitted 
model to find the optimum set of operating conditions for the x's and to characterize the nature 
of the response surface. 


Continuation of Example 814-3 
Consider the chemical process from Example S14-3, where the method of steepest ascent 
terminated at a reaction time of 55 minutes and a temperature of 163°F. The experimenter decides 
to fit a second-order model in this region. Table 814-7 and Fig. S14-10 show the experimental 
design, which consists of a 2? design centered at 55 minutes and 165°F, five center points, and four 
runs along the coordinate axes called axial runs. This type of design is called a central compos- 
ite design, and it is a very popular design for fitting second-order response surfaces. 

Two response variables were measured during this phase of the experiment: percentage 
conversion (yield) and viscosity. The least-squares quadratic model for the yield response is 


9$, = 69.1 + 1.633x, + 1.083x, — 0.9693? — 1.21932 + 0.225x4x; 


The analysis of variance for this model is shown in Table 814-8. 

Figure S14-11 shows the response surface contour plot and the three-dimensional surface 
plot for this model. From examination of these plots, the maximum yield is about 70%, 
obtained at approximately 60 minutes of reaction time and 167°F. 

The viscosity response is adequately described by the first-order model 


Do = 37.08 + 3.85x, + 3.10x; 


Table S14-7 Central Composite Design for Example S14-4 


Conversion Viscosity 

Observation Time Temperature Coded Variables (percent) (mPa-sec) 

Number (minutes) CF) Xi X; Response 1 Response 2 
l 50 160 =] =] 65.3 33 
2 60 160 l =l 68.2 39 
3 50 170 =] l 66 36 
4 60 170 l l 69.8 43 
5 48 165 —1.414 0 64.5 30 
6 62 165 1.414 0 69 44 
7 55 158 0 — 1.414 64 31 
8 25 172 0 1.414 68.5 45 
9 35 165 0 0 68.9 37 
10 29 165 0 0 69.7 34 
11 39 165 0 0 68.5 35 
12 aD 165 0 0 69.4 36 
13 53 165 0 0 69 37 


Figure 814-10 
Central composite 
design for Example 
S14-4. 
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+2 
e (0, 1.414) 
(-1, 1) (1, 1) 
(-1.414, 0) (1.414, 0) 
© © © 
-2 (0, O) 42 x4 
(-1, -1) (1, -1) 
e (0, -1.414) 
-2 


Table S14-9 summarizes the analysis of variance for this model. The response surface is 
shown graphically in Fig. S14-12. Notice that viscosity increases as both time and tempera- 
ture increase. 

As in most response surface problems, the experimenter in this example had conflicting 
objectives regarding the two responses. The objective was to maximize yield, but the accept- 
able range for viscosity was 38 S y, S 42. When there are only a few independent variables, 
an easy way to solve this problem is to overlay the response surfaces to find the optimum. 
Figure 814-13 shows the overlay plot of both responses, with the contours y, = 69% conver- 
sion, y, = 38, and y, = 42 highlighted. The shaded areas on this plot identify unfeasible 
combinations of time and temperature. This graph shows that several combinations of time 
and temperature will be satisfactory. 


Example S14-4 illustrates the use of a central composite design (CCD) for fitting a 


second-order response surface model. These designs are widely used in practice because they 
are relatively efficient with respect to the number of runs required. In general, a CCD in k 


Table S14-8 Analysis of Variance for the Quadratic Model, Yield Response 


Source of Sum of Degrees of Mean 
Variation Squares Freedom Square fo P-Value 
Model 45.89 5 9.178 14.93 0.0013 
Residual 4.30 7 0.615 
Total 50.19 12 
Independent Coefficient Standard t for Ho 

Variable Estimate Error Coefficient = 0 P-Value 
Intercept 69.100 0.351 197.1 
Xx 1.633 0.277 5.891 0.0006 
Xj 1.083 0.277 3.907 0.0058 
Xl — 0.969 0.297 =3:299 0.0139 
x; =[,219 0.297 —4.100 0.0046 
X 4X5 0.225 0.392 0.5740 0.5839 
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Table S14-9 Analysis of Variance for the First-Order Model, Viscosity Response 


Source of Sum of Degrees of Mean 
Variation Squares Freedom Square fo P-Value 
Model 195.4 2 97.72 15.89 0.0008 
Residual 61.5 10 6.15 
Total 256.9 12 
Independent Coefficient Degrees of Standard t for Ho 
Variable Estimate Freedom Error Coefficient = 0 P-Value 
Intercept 37.08 i 0.69 53.91 
Xi 3.85 1 0.88 4.391 0.0014 
X» 3.10 1 0.88 3.536 0.0054 
1 172.0 7 
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Figure $14-12 
Response surface 
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viscosity response, 
Example S14-4. 
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factors requires 2^ factorial runs, 2k axial runs, and at least one center point (three to five cen- 
ter points are typically used). Designs for k = 2 and k = 3 factors are shown in Fig. S14-14. 

The central composite design may be made rotatable by proper choice of the axial 
spacing a in Fig. S14-14. If the design is rotatable, the standard deviation of predicted 
response y is constant at all points that are the same distance from the center of the design. For 
rotatability, choose o. — (F)'/^, where F is the number of points in the factorial part of the design 
(usually F = 2"). For the case of k = 2 factors, a = (22)/^ = 1.414, as was used in the 
design in Example S14-4. Figure S14-15 presents a contour plot and a surface plot of the stan- 
dard deviation of prediction for the quadratic model used for the yield response. Notice that 
the contours are concentric circles, implying that yield is predicted with equal precision for all 
points that are the same distance from the center of the design. Also, as one would expect, the 
precision decreases with increasing distance from the design center. 


14-20 


1 172.0 





169.7 Viscosity 


42.00 






38.00 
167.3 


165.0 
Conversion 
69.00 


x» (temperature) 
o 


162.7 


160.3 


-1 158.0 | 
48.00 50.33 52.67 55.00 57.33 59.67 6200 


-1 0 +1 
X1 (time) 


Figure S14-13 Overlay of yield and viscosity response surfaces, 
Example S14-4. 
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Figure S14-14 Central composite designs for k = 2 and k = 3. 
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Figure S14-15 Plots of constant V V(y) for a rotatable 
central composite design. 
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LEARNING OBJECTIVES 


After careful study of this chapter, you should be able to do the following: 

1. Determine situations where nonparametric procedures are better alternatives to the t-test and 
ANOVA 

2. Use one- and two-sample nonparametric tests 

3. Use nonparametric alternatives to the single-factor ANOVA 


4. Understand how nonparametric tests compare to the t-test in terms of relative efficiency 


Answers for most odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e-Text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e- Text immediately following the section they accompany. 


571 


572 


CHAPTER 15 NONPARAMETRIC STATISTICS 


15-1 INTRODUCTION 


Most of the hypothesis-testing and confidence interval procedures discussed in previous chap- 
ters are based on the assumption that we are working with random samples from normal popu- 
lations. Traditionally, we have called these procedures parametric methods because they are 
based on a particular parametric family of distributions—1n this case, the normal. Alternately, 
sometimes we say that these procedures are not distribution-free because they depend on the as- 
sumption of normality. Fortunately, most of these procedures are relatively insensitive to slight 
departures from normality. In general, the t- and F-tests and the t-confidence intervals will have 
actual levels of significance or confidence levels that differ from the nominal or advertised lev- 
els chosen by the experimenter, although the difference between the actual and advertised levels 
is usually fairly small when the underlying population is not too different from the normal. 

In this chapter we describe procedures called nonparametric and distribution-free 
methods, and we usually make no assumptions about the distribution of the underlying pop- 
ulation other than that it is continuous. These procedures have actual level of significance a or 
confidence level 100(1 — a)% for many different types of distributions. These procedures 
have considerable appeal. One of their advantages 1s that the data need not be quantitative but 
can be categorical (such as yes or no, defective or nondefective) or rank data. Another advan- 
tage is that nonparametric procedures are usually very quick and easy to perform. 

The procedures described in this chapter are competitors of the parametric t- and 
F-procedures described earlier. Consequently, it is important to compare the performance of 
both parametric and nonparametric methods under the assumptions of both normal and non- 
normal populations. In general, nonparametric procedures do not utilize all the information 
provided by the sample. As a result, a nonparametric procedure will be less efficient than the 
corresponding parametric procedure when the underlying population is normal. This loss of 
efficiency is reflected by a requirement of a larger sample size for the nonparametric proce- 
dure than would be required by the parametric procedure in order to achieve the same power. 
On the other hand, this loss of efficiency is usually not large, and often the difference in sam- 
ple size is very small. When the underlying distributions are not close to normal, nonparamet- 
ric methods have much to offer. They often provide considerable improvement over the 
normal-theory parametric methods. 

Generally, if both parametric and nonparametric methods are applicable to a particular 
problem, we should use the more efficient parametric procedure. However, the assumptions for 
the parametric method may be difficult or impossible to justify. For example, the data may be in 
the form of ranks. These situations frequently occur in practice. For instance, a panel of judges 
may be used to evaluate 10 different formulations of a soft-drink beverage for overall quality, 
with the “best” formulation assigned rank 1, the “next-best” formulation assigned rank 2, and so 
forth. It is unlikely that rank data satisfy the normality assumption. Many nonparametric meth- 
ods involve the analysis of ranks and consequently are ideally suited to this type of problem. 


15-2 SIGN TEST 


15-2.1 Description of the Test 


The sign test is used to test hypotheses about the median fi of a continuous distribution. 
The median of a distribution is a value of the random variable X such that the probability 
is 0.5 that an observed value of X is less than or equal to the median, and the probability is 
0.5 that an observed value of X is greater than or equal to the median. That 1s, 
P(X = ð) = P(X = ft) = 0.5. 
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Since the normal distribution is symmetric, the mean of a normal distribution equals the 
median. Therefore, the sign test can be used to test hypotheses about the mean of a normal dis- 
tribution. This is the same problem for which we used the t-test in Chapter 9. We will discuss 
the relative merits of the two procedures in Section 15-2.4. Note that, although the t-test was 
designed for samples from a normal distribution, the sign test is appropriate for samples from 
any continuous distribution. Thus, the sign test is a nonparametric procedure. 

Suppose that the hypotheses are 


Ho: M = [io 
Hy: A < po (15-1) 
The test procedure is easy to describe. Suppose that X;, 35, . . . , X, 1S a random sample from 


the population of interest. Form the differences 
As = og, i= 1,2,...,n (15-2) 


Now if the null hypothesis Hp: 4 = [wo is true, any difference X; — Ho is equally likely 
to be positive or negative. An appropriate test statistic is the number of these differences that 
are positive, say R*. Therefore, to test the null hypothesis we are really testing that the 
number of plus signs is a value of a binomial random variable that has the parameter p — 1/2. 
A P-value for the observed number of plus signs r^ can be calculated directly from the bino- 
mial distribution. For instance, in testing the hypotheses in Equation 15-1, we will reject H in 
favor of H, only if the proportion of plus signs is sufficiently less than 1/2 (or equivalently, 
whenever the observed number of plus signs r^ is too small). Thus, if the computed P-value 


1 
p= 20 <r’ when p = >) 


is less than or equal to some preselected significance level o, we will reject Hy) and conclude 
H, is true. 
To test the other one-sided hypothesis 


Ho: Ñ = Lo 
Hi: K > Ho (15-3) 
we will reject Ho in favor of H, only if the observed number of plus signs, say r”, is large or, 


equivalently, whenever the observed fraction of plus signs is significantly greater than 1/2. 
Thus, if the computed P-value 


1 
P- (Rt =r" when p = >) 


is less than a, we will reject A, and conclude that 77, is true. 
The two-sided alternative may also be tested. If the hypotheses are 


Hyg o (1544) 
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we should reject Ho: ju = [Mo if the proportion of plus signs is significantly different (either 
less than or greater than) from 1/2. This is equivalent to the observed number of plus signs r” 
being either sufficiently large or sufficiently small. Thus, if r* < n/2 the P-value is 


l 
P = 2P (r < y* when p = >) 
and if r^ > n/2 the P-value is 


1 
p= 27(a" = r* when p =>) 


If the P-value is less than some preselected level a, we will reject Hy and conclude that H; is true. 


Montgomery, Peck, and Vining (2001) report on a study in which a rocket motor is formed by 
binding an igniter propellant and a sustainer propellant together inside a metal housing. The 
shear strength of the bond between the two propellant types is an important characteristic. The 
results of testing 20 randomly selected motors are shown in Table 15-1. We would like to test 
the hypothesis that the median shear strength is 2000 psi, using a = 0.05. 

This problem can be solved using the eight-step hypothesis-testing procedure introduced 
in Chapter 9: 


1. The parameter of interest is the median of the distribution of propellant shear strength. 
2. Ho: p = 2000 psi 

3. Hy: ù # 2000 psi 

4. a = 0.05 


Table 15-1 Propellant Shear Strength Data 


Observation Shear Strength Differences 
i oc x; — 2000 Sign 
| 2158.70 +158.70 + 
2 1678.15 =32 183 = 
3 2316.00 +316.00 + 
4 2061.30 + 61.30 + 
5 2207.50 +207.50 + 
6 1708.30 291.70 = 
7 1784.70 =2 15.30 m 
8 2575.10 +575.10 + 
9 2357.90 +357.90 + 
10 2256.70 +256.70 + 
11 2165.20 +165.20 + 
12 2399.55 3-399.55 + 
13 1779.80 —220.20 = 
14 2330.75 336.75 + 
15 1765.30 —234.70 a 
16 2053.50 +53.50 + 
17 2414.40 +414.40 + 
18 2200.50 +200.50 + 
19 2654.20 + 654.20 + 
20 1753.70 —246.30 = 


15-2 SIGN TEST 575 


5. The test statistic is the observed number of plus differences in Table 15-1, or 


r' = 14. 
6. We will reject Hy if the P-value corresponding to r* = 14 is less than or equal to 
a = 0.05. 


7. Computations: Since r' = 14 is greater than n/2 = 20/2 = 10, we calculate the 
P-value from 


1 
P = 2P (R = 14 when p -3 


=o Y (osos 


r=14 
= 0.1153 


8. Conclusions: Since P = 0.1153 is not less than a = 0.05, we cannot reject the null 
hypothesis that the median shear strength is 2000 psi. Another way to say this is that 
the observed number of plus signs r* = 14 was not large or small enough to indi- 
cate that median shear strength is different from 2000 psi at the a = 0.05 level of 
significance. 


It is also possible to construct a table of critical values for the sign test. This table is 
shown as Appendix Table VII. The use of this table for the two-sided alternative hypothesis in 
Equation 15-4 is simple. As before, let R* denote the number of the differences (X; — (Xo) that 
are positive and let R` denote the number of these differences that are negative. Let R = min 
(R^, R`). Appendix Table VII presents critical values r* for the sign test that ensure that P 
(type I error) = P (reject Hy when Hj is true) = a fora = 0.01, a = 0.05 and a = 0.10. If 
the observed value of the test statistic r < r£, the null hypothesis Ho: ù = po should be 
rejected. 

To illustrate how this table is used, refer to the data in Table 15-1 that was used in 
Example 15-1. Now r^ = 14 andr” = 6; therefore, r = min (14, 6) = 6. From Appendix 
Table VII with n = 20 and a = 0.05, we find that 7595 = 5. Since r = 6 is not less than or 
equal to the critical value r§); = 5, we cannot reject the null hypothesis that the median shear 
strength is 2000 psi. 

We can also use Appendix Table VII for the sign test when a one-sided alternative 
hypothesis is appropriate. If the alternative is Hı: Ñ > fo, reject Hg: p = Ao ifr = r$ 
if the alternative is H,: Ñ > Ño, reject Hg: À = Ao ifr” < r*. The level of significance of 
a one-sided test 1s one-half the value for a two-sided test. Appendix Table VII shows the 
one-sided significance levels in the column headings immediately below the two-sided 
levels. 

Finally, note that when a test statistic has a discrete distribution such as R does in the sign 
test, it may be impossible to choose a critical value r* that has a level of significance exactly 
equal to a. The approach used in Appendix Table VII is to choose r* to yield an a that is as 
close to the advertised significance level a as possible. 


Ties in the Sign Test 

Since the underlying population is assumed to be continuous, there is a zero probability that 
we will find a *tie"—that is, a value of X; exactly equal to jug. However, this may sometimes 
happen in practice because of the way the data are collected. When ties occur, they should be 
set aside and the sign test applied to the remaining data. 
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The Normal Approximation 

When p = 0.5, the binomial distribution is well approximated by a normal distribution when 
n is at least 10. Thus, since the mean of the binomial is np and the variance is np(1 — p), the 
distribution of R* is approximately normal with mean 0.5n and variance 0.25n whenever n is 
moderately large. Therefore, in these cases the null hypothesis Hp: ju = [io can be tested using 
the statistic 





The two-sided alternative would be rejected if the observed value of the test statistic 
|Zo] > Zq/2, and the critical regions of the one-sided alternative would be chosen to reflect the 
sense of the alternative. (If the alternative is H,: ù > wo, reject Ho if Zo > Za, for example.) 


We will illustrate the normal approximation procedure by applying it to the problem in 
Example 15-1. Recall that the data for this example are in Table 15-1. The eight-step proce- 
dure follows: 


1. The parameter of interest is the median of the distribution of propellant shear strength. 


2. Hp: ù = 2000 psi 
3. Hy: 2000 psi 
4. a= 0.05 
5. The test statistic is 
E r' — 0.5n 
7 Q5 Vn 


Since a = 0.05, we will reject Hy in favor of H, if |Zo| > zo 955 = 1.96. 


Computations: Since r* = 14, the test statistic is 


_ 14 — 0.5(20) 


NEN ee) 
i 0.5 V20 


= 1.789 


8. Conclusions: Since z) = 1.789 is not greater than Zoo25 = 1.96, we cannot reject the 
null hypothesis. Thus, our conclusions are identical to those in Example 15-1. 


15-2.2 Sign Test for Paired Samples 


The sign test can also be applied to paired observations drawn from continuous populations. 
Let (Xi 35), J = 1,2,. . . , n be a collection of paired observations from two continuous pop- 
ulations, and let 


D= hig = Ay J=1,2,...,n 


EXAMPLE 15-3 
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be the paired differences. We wish to test the hypothesis that the two populations have a 
common median, that is, that [jj = ji». This is equivalent to testing that the median of the 
differences tp = 0. This can be done by applying the sign test to the n observed differences 
d, as illustrated in the following example. 


An automotive engineer is investigating two different types of metering devices for an 
electronic fuel injection system to determine whether they differ in their fuel mileage 
performance. The system is installed on 12 different cars, and a test is run with each meter- 
ing device on each car. The observed fuel mileage performance data, corresponding differ- 
ences, and their signs are shown in Table 15-2. We will use the sign test to determine whether 
the median fuel mileage performance is the same for both devices using a = 0.05. The eight- 
step-procedure follows: 


1. The parameters of interest are the median fuel mileage performance for the two 
metering devices. 


2. Ho: ùi = p», or, equivalently, Hp: jp = 0 

3. Ay: wy Æ por, equivalently, Hi: Up # 0 

4. a= 0.05 

5. We will use Appendix Table VII to conduct the test, so the test statistic is r — 
min(r^, r ). 


6. Sincea = 0.05 and n = 12, Appendix Table VII gives the critical values as r$o5 = 2. 
We will reject Hy in favor of H, ifr S 2. 


7. Computations: Table 15-2 shows the differences and their signs, and we note that 
r = 8,r =4,andsor = min(8, 4) = 4. 


8. Conclusions: Since r = 4 is not less than or equal to the critical value r§); = 2, we 
cannot reject the null hypothesis that the two devices provide the same median fuel 
mileage performance. 


Table 15-2 Performance of Flow Metering Devices 


Metering Device 


Car l 2 Difference, d; Sign 
l 17.6 16.8 0.8 T 
2 19.4 20.0 —0.6 = 
3 19.5 18.2 1.3 is 
17.1 16.4 0.7 T 
5 15.3 16.0 —0.7 — 
6 15:9 15.4 0.5 F 
7 16.3 16.5 —0.2 — 
8 18.4 18.0 0.4 $ 
9 13:3 16.4 0.9 T 

10 19.1 20.1 —1.0 - 
11 [5S 16.7 LI + 
12 18.2 17.9 0.3 $ 
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15-2.3 Type II Error for the Sign Test 


Figure 15-1  Calcula- 
tion of B for the sign 
test. (a) Normal 
distributions. (b) 
Exponential 
distributions. 


The sign test will control the probability of type I error at an advertised level a for testing the null 
hypothesis Hp: j& = pw for any continuous distribution. As with any hypothesis-testing procedure, 
it is important to investigate the probability of a type II error, 8. The test should be able to effec- 
tively detect departures from the null hypothesis, and a good measure of this effectiveness is the 
value of B for departures that are important. A small value of B implies an effective test procedure. 

In determining Q, it is important to realize not only that a particular value of ù, say [ug + A, 
must be used but also that the form of the underlying distribution will affect the calculations. To 
illustrate, suppose that the underlying distribution is normal with o = 1 and we are testing the 
hypothesis Hp: ù = 2 versus H,: Ñ > 2. (Since ù = p in the normal distribution, this is equiv- 
alent to testing that the mean equals 2.) Suppose that it 1s important to detect a departure from 
|. = 2 to ù = 3. The situation is illustrated graphically in Fig. 15-1(a). When the alternative 
hypothesis is true (/7,: ù = 3), the probability that the random variable X is less than or equal to 
the value 2 is 


p^ P(X x 2) = P(Z € —1) = d$(-1) = 0.1587 
Suppose we have taken a random sample of size 12. At the a = 0.05 level, Appendix Table VII 


indicates that we would reject Hp: ju. = 2ifr zx r$9; = 2. Therefore, B is the probability that 
we do not reject Hp: ju = 2 when in fact ù = 3, or 


2 
12 
B=1- >( )(o.1587)(0.8413)" = 0.2944 
X 





Under Hp: u - 2 Under H4 : w=3 
(a) 
0.3699 
i-2 u-2.89 x 2 u= 4.33 x 
Under Hp: u 2 2 Under H1 : u -3 
(b) 
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If the distribution of X had been exponential rather than normal, the situation would be as 
shown in Fig. 15-1(b), and the probability that the random variable X is less than or equal 
to the value x = 2 when ù = 3 (note that when the median of an exponential distribution 
is 3, the mean is 4.33) is 


2 
l 1 
= P(X <2) = | ——e 43% dx = 0.3699 
pones E — 
In this case, 
2 TIS 
B=1- X (0.3699)*(0.6301)!2-* = 0.8794 
x=0 NX 


Thus, B for the sign test depends not only on the alternative value of ù but also on the 
area to the right of the value specified in the null hypothesis under the population probabil- 
ity distribution. This area is highly dependent on the shape of that particular probability dis- 
tribution. 


15-2.4 Comparison to the t-Test 


If the underlying population is normal, either the sign test or the t-test could be used to test 
Ho: Ù = Wo. The t-test is known to have the smallest value of B possible among all tests that 
have significance level a for the one-sided alternative and for tests with symmetric critical re- 
gions for the two-sided alternative, so it is superior to the sign test in the normal distribution 
case. When the population distribution is symmetric and nonnormal (but with finite mean 
|. = H), the t-test will have a smaller B (or a higher power) than the sign test, unless the dis- 
tribution has very heavy tails compared with the normal. Thus, the sign test is usually consid- 
ered a test procedure for the median rather than as a serious competitor for the t-test. The 
Wilcoxon signed-rank test discussed in the next section is preferable to the sign test and 
compares well with the t-test for symmetric distributions. 


EXERCISES FOR SECTION 15-2 


15-1. Tensamples were taken from a plating bath used in an 
electronics manufacturing process, and the bath pH was deter- 
mined. The sample pH values are 7.91, 7.85, 6.82, 8.01, 7.46, 
6.95, 7.05, 7.35, 7.25, 7.42. Manufacturing engineering be- 
lieves that pH has a median value of 7.0. Do the sample data 
indicate that this statement is correct? Use the sign test with 
a = 0.05 to investigate this hypothesis. Find the P-value for 
this test. 

15-2. The titanium content in an aircraft-grade alloy is an 
important determinant of strength. A sample of 20 test coupons 
reveals the following titanium content (in percent): 


8.32, 8.05, 8.93, 8.65, 8.25, 8.46, 8.52, 8.35, 8.36, 8.41, 8.42, 
8.30, 8.71, 8.75, 8.60, 8.83, 8.50, 8.38, 8.29, 8.46 


The median titanium content should be 8.5%. Use the sign test 
with a = 0.05 to investigate this hypothesis. Find the P-value 
for this test. 


15-3. The impurity level (in ppm) is routinely measured in 
an intermediate chemical product. The following data were 
observed in a recent test: 


242, Vly IOs L9,2:6, 1.5. 1:9, 20,25; 2.6, 2.32.9, 18, 
1.3, 1.7, 2.0, 1.9, 2.3, 1.9, 2.4, 1:6 


Can you claim that the median impurity level is less than 
2.5 ppm? State and test the appropriate hypothesis using the 
sign test with a = 0.05. What is the P-value for this 
test? 
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15-4. Consider the data in Exercise 15-1. Use the normal 
approximation for the sign test to test Hp: À = 7.0 versus 
Hi: ù # 7.0. What is the P-value for this test? 


15-5. Consider the compressive strength data in Exercise 

8-26. 

(a) Use the sign test to investigate the claim that the median 
strength is at least 2250 psi. Use a = 0.05. 

(b) Use the normal approximation to test the same hypothesis 
that you formulated in part (a). What is the P-value for 
this test? 


15-6. Consider the margarine fat content data in Exercise 
8-25. Use the sign test to test Hy: — 17.0 versus 
Hi: Ñ # 17.0, with a = 0.05. Find the P-value for the test 
statistic and use this quantity to make your decision. 


15-7. Consider the data in Exercise 15-2. Use the normal 
approximation for the sign test to test Hp: [v = 8.5 versus 
Ay: ù # 8.5, with a = 0.05. What is the P-value for this 
test? 


15-8. Consider the data in Exercise 15-3. Use the normal 
approximation for the sign test to test Hp: Ñ = 2.5 versus 
Hi: Ñ < 2.5. What is the P-value for this test? 


15-9. Two different types of tips can be used in a Rockwell 
hardness tester. Eight coupons from test ingots of a nickel- 
based alloy are selected, and each coupon is tested twice, once 
with each tip. The Rockwell C-scale hardness readings are 
shown in the following table. Use the sign test with a = 0.05 
to determine whether or not the two tips produce equivalent 
hardness readings. 


Coupon Tip l Mp2 
l 63 60 
2 52 51 
3 58 56 
4 60 59 
5 55 58 
6 57 54 
7 53 52 
8 59 61 


15-10. Two different formulations of primer paint can be 
used on aluminum panels. The drying time of these two for- 
mulations is an important consideration in the manufacturing 
process. Twenty panels are selected; half of each panel is 
painted with primer 1, and the other half is painted with 
primer 2. The drying times are observed and reported in the 
following table. Is there evidence that the median drying 
times of the two formulations are different? Use the sign test 
witha = 0.01. 


Drying Times (in hr) 
Panel Formulation 1 Formulation 2 
l 1.6 1.8 
2 LS 1.5 
3 L3 1.5 
4 1.6 1.7 
5 LT 1.6 
6 1.9 2.0 
J 1.8 2l 
8 1.6 1.7 
9 1.4 1.6 
10 1.8 1.9 
11 1.9 2.0 
12 1.8 1.9 
13 LT 1.5 
14 1.5 L7 
15 1.6 1.6 
16 1.4 1.2 
17 1.3 1.6 
18 1.6 1.8 
19 1.5 1.6 
20 1.8 2.0 
15-11. Use the normal approximation to the sign test for the 


data in Exercise 15-10. What conclusions can you draw? 
15-12. The diameter of a ball bearing was measured by 12 
inspectors, each using two different kinds of calipers. The 
results were as follows: 


Inspector Caliper 1 Caliper 2 
1 0.265 0.264 
2 0.265 0.265 
3 0.266 0.264 
4 0.267 0.266 
5 0.267 0.267 
6 0.265 0.268 
7 0.267 0.264 
8 0.267 0.265 
9 0.265 0.265 

10 0.268 0.267 
11 0.268 0.268 
12 0.265 0.269 


Is there a significant difference between the medians of the 
population of measurements represented by the two samples? 
Use a = 0.05. 


15-13. Consider the blood cholesterol data in Exercise 
10-39. Use the sign test to determine whether there is any dif- 
ference between the medians of the two groups of measure- 
ments, with a = 0.05. What practical conclusion would you 
draw from this study? 


15-14. Use the normal approximation for the sign test for 
the data in Exercise 15-12. With a = 0.05, what conclusions 
can you draw? 


15-15. Use the normal approximation to the sign test for 
the data in Exercise 15-13. With a = 0.05, what conclusions 
can you draw? 


15-16. The distribution time between arrivals in a telecom- 

munication system is exponential, and the system manager 

wishes to test the hypothesis that Hp: [& = 3.5 minutes versus 

Hi: & > 3.5 minutes. 

(a) What is the value of the mean of the exponential distribu- 
tion under Hp: ù = 3.5? 

(b) Suppose that we have taken a sample of n = 10 observa- 
tions and we observer = 3. Would the sign test reject Ho 
ata = 0.05? 
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(c) What is the type II error probability of this test if ù = 
4.5? 

15-17. Suppose that we take a sample of n = 10 measure- 

ments from a normal distribution with o = 1. We wish to test 

Ho: w = 0 against H,: u > 0. The normal test statistic is 

Zo = X/(a/ Vn), and we decide to use a critical region of 1.96 

(that is, reject Ho if zọ = 1.96). 

(a) What is a for this test? 

(b) What is B for this test, if ju = 1? 

(c) If a sign test is used, specify the critical region that gives 
an a value consistent with a for the normal test. 

(d) What is the B value for the sign test, if ju. = 1? Compare 
this with the result obtained in part (b). 

15-18. Consider the test statistic for the sign test in 

Exercise 15-9. Find the P-value for this statistic. 

15-19. Consider the test statistic for the sign test in Exercise 

15-10. Find the P-value for this statistic. Compare it to the 

P-value for the normal approximation test statistic computed 

in Exercise 15-11. 


The sign test makes use only of the plus and minus signs of the differences between the ob- 
servations and the median pg (or the plus and minus signs of the differences between the 
observations in the paired case). It does not take into account the size or magnitude of these 
differences. Frank Wilcoxon devised a test procedure that uses both direction (sign) and mag- 
nitude. This procedure, now called the Wilcoxon signed-rank test, is discussed and illus- 


trated in this section. 


The Wilcoxon signed-rank test applies to the case of symmetric continuous distribu- 
tions. Under these assumptions, the mean equals the median, and we can use this procedure to 
test the null hypothesis that m 5 m0. We now show how to do this. 


15-3.1 Description of the Test 


We are interested in testing Hp): w = p against the usual alternatives. Assume that Xj, 
X>,...,X,18 a random sample from a continuous and symmetric distribution with mean (and 
median) u. Compute the differences X; — wo, i = 1,2,...,n. Rank the absolute differences 
IX; — Mol, i = 1,2,...,n in ascending order, and then give the ranks the signs of their 
corresponding differences. Let W^ be the sum of the positive ranks and W` be the absolute 
value of the sum of the negative ranks, and let W = min(W*, W`). Appendix Table VIII con- 
tains critical values of W, say w*. If the alternative hypothesis is Hj: w # wo, then if the ob- 
served value of the statistic w < w*, the null hypothesis Hp: w = pọ is rejected. Appendix 
Table VIII provides significance levels of a = 0.10, a = 0.05, a = 0.02, a = 0.01 for the 


two-sided test. 


For one-sided tests, if the alternative is H,: w > wo, reject Hg: jp = pg ifw S w*; and if 
the alternative is H,: |. < po, reject Hy: y = po if w^ < w*. The significance levels for one- 
sided tests provided in Appendix Table VIII are a = 0.05, 0.025, 0.01, and 0.005. 
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EXAMPLE 15-4 We will illustrate the Wilcoxon signed-rank test by applying it to the propellant shear strength 
data from Table 15-1. Assume that the underlying distribution is a continuous symmetric dis- 
tribution. The eight-step procedure is applied as follows: 


1. The parameter of interest is the mean (or median) of the distribution of propellant 
shear strength. 

Hy: w = 2000 psi 

Hi: w # 2000 psi 

a = 0.05 


The test statistic is 


SS pe 


w = min(w’,w ) 


We will reject Hy if w S w§o5 = 52 from Appendix Table VIII. 


Computations: The signed ranks from Table 15-1 are shown in the following table: 


Observation Difference x; — 2000 Signed Rank 


16 +53.50 Fl 
4 +61.30 F2 
l +158.70 T? 

11 T 165.20 +4 

18 +200.50 Ta 

+207.50 +6 
=215.30 m 

13 —220.20 —8 

15 —234.70 =9 

20 —246.30 —10 

10 +256.70 gaii 
6 —291.70 —12 
B T 316.00 zd iE. 
2 —321.85 —14 

14 +336.75 Fls 
9 +357.90 +16 

12 7399.55 3 

17 +414.40 +18 
8 +575.10 F19 

19 +654.20 +20 


The sum of the positive ranks is w° = (1 +2+34+4+4+5+6+4+11+134+15+ 
16 + 17 + 18 + 19 + 20) = 150, and the sum of the absolute values of the negative 
ranks isw =(7+8+9+4 10 + 12 + 14) = 60. Therefore, 


w = min(150, 60) = 60 
8. Conclusions: Since w = 60 is not less than or equal to the critical value wy 95 = 52, 


we cannot reject the null hypothesis that the mean (or median, since the population is 
assumed to be symmetric) shear strength is 2000 psi. 
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Ties in the Wilcoxon Signed-Rank Test 

Because the underlying population is continuous, ties are theoretically impossible, although 
they will sometimes occur in practice. If several observations have the same absolute magni- 
tude, they are assigned the average of the ranks that they would receive if they differed slightly 
from one another. 


15-3.2 Large-Sample Approximation 


If the sample size is moderately large, say n > 20, it can be shown that W^ (or W`) has 
approximately a normal distribution with mean 


n(n + 1) 


and variance 


Opt = 


Therefore, a test of Hy: ju. = po can be based on the statistic 





An appropriate critical region for either the two-sided or one-sided alternative hypotheses can 
be chosen from a table of the standard normal distribution. 


15-3.3 Paired Observations 


EXAMPLE 15-5 


The Wilcoxon signed-rank test can be applied to paired data. Let (Xj, 35), J = 1,2,...,nbe 
a collection of paired observations from two continuous distributions that differ only with re- 
spect to their means. (It is not necessary that the distributions of X, and_X, be symmetric.) This 
assures that the distribution of the differences D, = X;; — X5; is continuous and symmetric. 
Thus, the null hypothesis is Hp: uj, = po, which is equivalent to Hp: Wp = 0. We initially con- 
sider the two-sided alternative H,: w, ~ p, (or Hi: up # 0). 

To use the Wilcoxon signed-rank test, the differences are first ranked in ascending order of 
their absolute values, and then the ranks are given the signs of the differences. Ties are assigned 
average ranks. Let W^ be the sum of the positive ranks and W` be the absolute value of the sum 
of the negative ranks, and W = min(W^, W`). If the observed value w = w*, the null hypoth- 
esis Hy: y; = W, (or Ho: wp = 0) 1s rejected where w* is chosen from Appendix Table VIII. 

For one-sided tests, if the alternative is Hj: u; > p (or H4: wp > 0), reject Ho ifw 5S wx; 
and if Hi: u; < p, (or Hy: wp < 0), reject Hj if w^ < w*. Be sure to use the one-sided test 
significance levels shown in Appendix Table VIII. 


We will apply the Wilcoxon signed-rank test to the fuel-metering device test data used previously 
in Example 15-3. The eight-step hypothesis-testing procedure can be applied as follows: 


1. The parameters of interest are the mean fuel mileage performance for the two meter- 
ing devices. 
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Ho: u; = w, or, equivalently, Ho: wp = 0 
Hi: u; 7 w, or, equivalently, H,: wp ~ 0 
a = 0.05 


The test statistic is 


un de SS 


w = min(w",w ) 
where w* and w` are the sums of the positive and negative ranks of the differences 


in Table 15-2. 


6. Since a = 0.05 andn = 12, Appendix Table VIII gives the critical value as wọ; = 13. 
We will reject Ho: wp = O0 if w s 13. 


7. Computations: Using the data in Table 15-2, we compute the following signed ranks: 


Car Difference Signed Rank 
7 —0.2 =] 
12 0.3 2 
8 0.4 3 
6 0.5 4 
2 —0.6 =5 
4 0.7 6.5 
5 —0.7 —D.3 
1 0.8 8 
9 0.9 9 
10 —1.0 —10 
11 1.1 11 
3 1.3 12 


Note that w* = 55.5 and w` = 22.5. Therefore, 
w = min(55.5, 22.5) = 22.5 


8. Conclusions: Since w = 22.5 is not less than or equal to wos = 13, we cannot reject the 
null hypothesis that the two metering devices produce the same mileage performance. 


15-3.4 Comparison to the t-Test 


When the underlying population is normal, either the t-test or the Wilcoxon signed-rank test 
can be used to test hypotheses about u. As mentioned earlier, the t-test is the best test in such 
situations in the sense that it produces a minimum value of for all tests with significance 
level a. However, since it is not always clear that the normal distribution is appropriate, and 
since in many situations it is inappropriate, it is of interest to compare the two procedures for 
both normal and nonnormal populations. 

Unfortunately, such a comparison is not easy. The problem is that B for the Wilcoxon 
signed-rank test is very difficult to obtain, and B for the t-test is difficult to obtain for nonnormal 
distributions. Because type II error comparisons are difficult, other measures of comparison 
have been developed. One widely used measure is asymptotic relative efficiency (ARE). 
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The ARE of one test relative to another is the limiting ratio of the sample sizes necessary to 
obtain identical error probabilities for the two procedures. For example, if the ARE of one test 
relative to the competitor is 0.5, when sample sizes are large, the first test will require twice as 
large a sample as the second one to obtain similar error performance. While this does not tell 
us anything for small sample sizes, we can say the following: 


1. For normal populations, the ARE of the Wilcoxon signed-rank test relative to the 
t-test is approximately 0.95. 


2. Fornonnormal populations, the ARE is at least 0.86, and in many cases it will exceed 
unity. 


Although these are large-sample results, we generally conclude that the Wilcoxon signed-rank 
test will never be much worse than the t-test and that in many cases where the population is non- 
normal it may be superior. Thus, the Wilcoxon signed-rank test is a useful alternate to the t-test. 


EXERCISES FOR SECTION 15-3 


15-20. Consider the data in Exercise 15-1 and assume that 
the distribution of pH is symmetric and continuous. Use the 
Wilcoxon signed-rank test with a = 0.05 to test the hypothe- 
sis Hy: w = 7 against Hy: p # 7. 

15-21. Consider the data in Exercise 15-2. Suppose that the 
distribution of titanium content is symmetric and continuous. 
Use the Wilcoxon signed-rank test with a = 0.05 to test the 
hypothesis Hp: u = 8.5 versus H,: u # 8.5. 

15-22. Consider the data in Exercise 15-2. Use the large- 
sample approximation for the Wilcoxon signed-rank test to test 
the hypothesis Hp: w = 8.5 versus H,: p # 8.5. Use a = 0.05. 
Assume that the distribution of titanium content is continuous 
and symmetric. 

15-23. Consider the data in Exercise 15-3. Use the Wilcoxon 
signed-rank test to test the hypothesis Hp: ju = 2.5 ppm versus 
Hi: y < 2.5 ppm with a = 0.05. Assume that the distribution of 
impurity level is continuous and symmetric. 
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15-24. Consider the Rockwell hardness test data in 
Exercise 15-9. Assume that both distributions are continuous 
and use the Wilcoxon signed-rank test to test that the mean 
difference in hardness readings between the two tips is zero. 
Use a — 0.05. 


15-25. Consider the paint drying time data in Exercise 15-10. 
Assume that both populations are continuous, and use the 
Wilcoxon signed-rank test to test that the difference in mean 
drying times between the two formulations is zero. Use 
a = 0.01. 


15-26. Apply the Wilcoxon signed-rank test to the meas- 
urement data in Exercise 15-12. Use a = 0.05 and as- 
sume that the two distributions of measurements are contin- 
uous. 

15-27. Apply the Wilcoxon signed-rank test to the blood cho- 
lesterol data from Exercise 10-39. Use a = 0.05 and assume that 
the two distributions are continuous. 


Suppose that we have two independent continuous populations X, and X, with means p; and 
|». Assume that the distributions of X, and X, have the same shape and spread and differ only 
(possibly) in their locations. The Wilcoxon rank-sum test can be used to test the hypothesis 
Ho: u, = pz. This procedure is sometimes called the Mann-Whitney test, although the Mann- 
Whitney test statistic is usually expressed in a different form. 


15-4.1 Description of the Test 


Let Xi, Xi? - .., Xin, and X21, X22, . - - , Xn, be two independent random samples of sizes n, = 
n, from the continuous populations X; and X, described earlier. We wish to test the hypotheses 


Ho: pı = po 
Hi: y F go 
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EXAMPLE 15-6 


The test procedure is as follows. Arrange all n, + n, observations in ascending order of 
magnitude and assign ranks to them. If two or more observations are tied (identical), use the 
mean of the ranks that would have been assigned if the observations differed. 

Let W, be the sum of the ranks in the smaller sample (1), and define W, to be the sum of 
the ranks in the other sample. Then, 


Ny + nyj(n + Ny + | 
y, = m t mot D W, (15-7) 


Now if the sample means do not differ, we will expect the sum of the ranks to be nearly equal 
for both samples after adjusting for the difference in sample size. Consequently, if the sums of 
the ranks differ greatly, we will conclude that the means are not equal. 

Appendix Table IX contains the critical value of the rank sums for a = 0.05 and a = 0.01 
assuming the two-sided alternative above. Refer to Appendix Table IX with the appropriate 
sample sizes n, and r5, and the critical value w, can be obtained. The null Hp: w; = p is 
rejected in favor of H,: u; Æ w, if either of the observed values w, or w, is less than or equal 
to the tabulated critical value w,. 

The procedure can also be used for one-sided alternatives. If the alternative is H,: p; < p», 
reject Ho if w; S wa; for H,: w; > p, reject Ho if w> S wa. For these one-sided tests, the tabu- 
lated critical values w, correspond to levels of significance of a = 0.025 and a = 0.005. 


The mean axial stress in tensile members used in an aircraft structure is being studied. Two alloys 
are being investigated. Alloy 1 is a traditional material, and alloy 2 is a new aluminum-lithium al- 
loy that is much lighter than the standard material. Ten specimens of each alloy type are tested, 
and the axial stress is measured. The sample data are assembled in Table 15-3. Using a = 0.05, we 
wish to test the hypothesis that the means of the two stress distributions are identical. 

We will apply the eight-step hypothesis-testing procedure to this problem: 


1. The parameters of interest are the means of the two distributions of axial stress. 


2. Hy by = w 

3. Ay: py 7 p 

4. a — 0.05 

5. We will use the Wilcoxon rank-sum test statistic in Equation 15-7, 


(ni + m)(n + nj + 1) 


W = 2 


6. Since a = 0.05 and n, = n, = 10, Appendix Table IX gives the critical value as wo 95 = 
78. If either w, or w, is less than or equal to Wo 95 = 78, we will reject Ho: wy = p. 


Table 15-3 Axial Stress for Two Aluminum-Lithium Alloys 


Alloy 1 Alloy 2 
238 psi 3254 psi 3261 psi 3248 psi 
3195 3229 3187 3215 
3246 3225 3209 3226 
3190 3217 3212 3240 


3204 3241 3258 3234 
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7. Computations: The data from Table 15-3 are analyzed in ascending order and ranked 
as follows: 


Alloy Number Axial Stress Rank 
2 3187 psi l 
1 3190 2 
1 3195 3 
1 3204 4 
2 3209 5 
2 3212 6 
2 3215 T 
1 3217 8 
1 3225 9 
2 3226 10 
1 3229 11 
2 3234 12 
1 3238 13 
2 3240 14 
1 3241 15 
| 3246 16 
2 3248 17 
1 3254 18 
2 3258 19 
2 3261 20 


The sum of the ranks for alloy 1 is 


w =2+3+4+8+9+11+ 13 t I5 t 164+ 18 = 99 


and for alloy 2 


(n, + njn, +m +1) (10 + 10)(10 + 10 + 1) 
ee eeypes c dno 


8. Conclusions: Since neither w; nor w, is less than or equal to W995 = 78, we cannot 
reject the null hypothesis that both alloys exhibit the same mean axial stress. 


W = 


15-4.2 Large-Sample Approximation 


When both n, and n, are moderately large, say, greater than 8, the distribution of w, can be 
well approximated by the normal distribution with mean 


n(n + n F 1) 
My, — -= Şo 


and variance 


, _ nmn + nz + 1) 
SNO 12 
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Therefore, for n, and n; > 8, we could use 





as a statistic, and the appropriate critical region is |zo| > zyj5, Zo > Za, OF Zo < — Zo 
depending on whether the test 1s a two-tailed, upper-tail, or lower-tail test. 


15-4.3 Comparison to the t-Test 


In Section 15-3.4 we discussed the comparison of the t-test with the Wilcoxon signed-rank 
test. The results for the two-sample problem are identical to the one-sample case. That is, 
when the normality assumption is correct, the Wilcoxon rank-sum test is approximately 95% 
as efficient as the t-test in large samples. On the other hand, regardless of the form of the dis- 
tributions, the Wilcoxon rank-sum test will always be at least 86% as efficient. The efficiency 
of the Wilcoxon test relative to the t-test is usually high if the underlying distribution has heav- 
ier tails than the normal, because the behavior of the t-test is very dependent on the sample 
mean, which is quite unstable in heavy-tailed distributions. 


EXERCISES FOR SECTION 15-4 


15-28. An electrical engineer must design a circuit to deliver 
the maximum amount of current to a display tube to achieve suf- 
ficient image brightness. Within her allowable design constraints, 
she has developed two candidate circuits and tests prototypes of 
each. The resulting data (in microamperes) are as follows: 


Circuit 1: | 251, 255, 258, 257, 250, 251, 254, 250, 248 





Circuit 2: | 250, 253, 249, 256, 259, 252, 260, 251 


Use the Wilcoxon rank-sum test to test Ho: pj, = p against the 
alternative H,: u; > jjj. Use a = 0.025. 


15-29. One of the authors travels regularly to Seattle, 
Washington. He uses either Delta or Alaska. Flight delays are 
sometimes unavoidable, but he would be willing to give most 
of his business to the airline with the best on-time arrival 
record. The number of minutes that his flight arrived late for 
the last six trips on each airline follows. Is there evidence that 
either airline has superior on-time arrival performance? Use 
a = 0.01 and the Wilcoxon rank-sum test. 


Delta: | 13,10, 1, —4, 
Alaska: |15, 8,3, —1, —2, 4 (minutes late) 


15-30. The manufacturer of a hot tub is interested in testing 
two different heating elements for his product. The element 
that produces the maximum heat gain after 15 minutes would 
be preferable. He obtains 10 samples of each heating unit and 
tests each one. The heat gain after 15 minutes (in ^F) follows. 


0, 9 (minutes late) 





Is there any reason to suspect that one unit is superior to the 
other? Use a — 0.05 and the Wilcoxon rank-sum test. 


Unit 1: | 25, 27, 29, 31, 30, 26, 24, 32, 33, 38 





Unit 2:131, 33, 32, 35, 34, 29, 38, 35, 37, 30 


15-31. Use the normal approximation for the Wilcoxon 
rank-sum test for the problem in Exercise 15-28. Assume that 
a = 0.05. Find the approximate P-value for this test statistic. 


15-32. Use the normal approximation for the Wilcoxon 
rank-sum test for the heat gain experiment in Exercise 15-30. 
Assume that a = 0.05. What is the approximate P-value for 
this test statistic? 


15-33. Consider the chemical etch rate data in Exercise 10-21. 
Use the Wilcoxon rank-sum test to investigate the claim that 
the mean etch rate is the same for both solutions. If a = 0.05, 
what are your conclusions? 


15-34. Use the Wilcoxon rank-sum test for the pipe deflec- 
tion temperature experiment described in Exercise 10-20. If 
a = 0.05, what are your conclusions? 


15-35. Use the normal approximation for the Wilcoxon 
rank-sum test for the problem in Exercise 10-21. Assume that 
a = 0.05. Find the approximate P-value for this test. 
15-36. Use the normal approximation for the Wilcoxon 
rank-sum test for the problem in Exercise 10-20. Assume that 
a = 0.05. Find the approximate P-value for this test. 
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15-5 NONPARAMETRIC METHODS IN THE ANALYSIS 
OF VARIANCE 


15-5.1 Kruskal-Wallis Test 


The single-factor analysis of variance model developed in Chapter 13 for comparing a 
population means is 


j= 
fy — pg tte i E (15-9) 


In this model, the error terms e; are assumed to be normally and independently distributed with 
mean zero and variance o”. The assumption of normality led directly to the F-test described in 
Chapter 13. The Kruskal-Wallis test is a nonparametric alternative to the F-test; it requires only 
that the e; have the same continuous distribution for all factor levels i = 1, 2, .. . , a. 

Suppose that N = Y n; is the total number of observations. Rank all N observations 
from smallest to largest, and assign the smallest observation rank 1, the next smallest 
rank 2, ..., and the largest observation rank N. If the null hypothesis 


Hog: jq = M2 755 = pa 


is true, the N observations come from the same distribution, and all possible assignments of 
the N ranks to the a samples are equally likely, we would expect the ranks 1, 2,. .., N to be 
mixed throughout the a samples. If, however, the null hypothesis Hj, is false, some samples 
will consist of observations having predominantly small ranks, while other samples will con- 
sist of observations having predominantly large ranks. Let R; be the rank of observation Y;, 
and let R;. and R;. denote the total and average of the n; ranks in the ith treatment. When the 
null hypothesis is true, 


N+ 1 
ER) = 7 


and 


The Kruskal-Wallis test statistic measures the degree to which the actual observed average 
ranks R;. differ from their expected value (N + 1)/2. If this difference is large, the null 
hypothesis Ho is rejected. The test statistic is 


(15-10) 
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EXAMPLE 15-7 


An alternative computing formula that is occasionally more convenient is 





LTD) 





We would usually prefer Equation 15-11 to Equation 15-10 because it involves the rank totals 
rather than the averages. 

The null hypothesis H, should be rejected if the sample data generate a large value for 
H. The null distribution for H has been obtained by using the fact that under Ho each pos- 
sible assignment of ranks to the a treatments is equally likely. Thus, we could enumerate 
all possible assignments and count the number of times each value of H occurs. This has 
led to tables of the critical values of H, although most tables are restricted to small sample 
sizes n;. In practice, we usually employ the following large-sample approximation. 
Whenever Hj, is true and either 


a=3 and n,=6 fori=1,2,3 
a>3 and n,=5 fori=1,2,...,a 


H has approximately a chi-square distribution with a — 1 degrees of freedom. Since large val- 
ues of H imply that H is false, we will reject Hp if the observed value 


h = X5 .a-1 
The test has approximate significance level a. 


Ties in the Kruskal-Wallis Test 
When observations are tied, assign an average rank to each of the tied observations. When 
there are ties, we should replace the test statistic in Equation 15-11 by 


a 2 
l | Ri. MN + IY" | (15-12) 


26n 4 


where n; is the number of observations in the ith treatment, N is the total number of observa- 
tions, and 


(15-13) 
i=1 j=] 


Note that S? is just the variance of the ranks. When the number of ties is moderate, there will 
be little difference between Equations 15-11 and 15-12 and the simpler form (Equation 15-11) 
may be used. 


Montgomery (2001) presented data from an experiment in which five different levels of 
cotton content in a synthetic fiber were tested to determine whether cotton content has any 
effect on fiber tensile strength. The sample data and ranks from this experiment are shown in 
Table 15-4. We will apply the Kruskal-Wallis test to these data, using a = 0.01. 
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Table 15-4 Data and Ranks for the Tensile Testing Experiment 


Percentage 
of Cotton 5 
15 Vy 7 7 15 11 9 
ranks ry 2.0 2:0 12.5 7.0 4.0 24.3 
20 yj 12 17 12 18 18 
ranks rp; 9.9 14.0 9.5 16.5 16.5 66.0 
25 Ysj 14 18 18 19 19 
ranks l3; 11.0 16.5 16.5 20.5 20.5 85.0 
30 Ya I9 23 22 19 23 
ranks l'a; 20.5 25.0 23.0 20.5 24.0 113.0 
35 sj 1 10 11 15 11 
ranks rs; 2.0 5.0 7.0 12.5 7.0 33.5 


Since there is a fairly large number of ties, we use Equation 15-12 as the test statistic. 
From Equation 15-13 we find 





a n; 2 
des xl Tye Si DES 
N-I Aj : 4 
|] eun 25(26) 

= 24 2 
— 53.54 


and the test statistic 1s 





S Li 
= 5245.7 E 
53.54 4 
= 19.06 


Since h > X6.014 = 13.28, we would reject the null hypothesis and conclude that treatments 
differ. This same conclusion is given by the usual analysis of variance F-test. 


15-5.2 Rank Transformation 


The procedure used in the previous section whereby the observations are replaced by their 
ranks is called the rank transformation. It is a very powerful and widely useful technique. 
If we were to apply the ordinary F-test to the ranks rather than to the original data, we would 
obtain 


Hia 1) 


"0 = iy — 1 — AYN — a) 


as the test statistic. Note that as the Kruskal-Wallis statistic H increases or decreases, F also 
increases or decreases. Now, since the distribution of Fy is approximated by the F-distribution, 
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the Kruskal-Wallis test is approximately equivalent to applying the usual analysis of variance 
to the ranks. 

The rank transformation has wide applicability in experimental design problems for which 
no nonparametric alternative to the analysis of variance exists. If the data are ranked and the or- 
dinary F-test is applied, an approximate procedure results, but one that has good statistical 
properties. When we are concerned about the normality assumption or the effect of outliers or 
"wild" values, we recommend that the usual analysis of variance be performed on both the 
original data and the ranks. When both procedures give similar results, the analysis of variance 
assumptions are probably satisfied reasonably well, and the standard analysis is satisfactory. 
When the two procedures differ, the rank transformation should be preferred since it 1s less 
likely to be distorted by nonnormality and unusual observations. In such cases, the experi- 
menter may want to investigate the use of transformations for nonnormality and examine the 
data and the experimental procedure to determine whether outliers are present and why they 


have occurred. 


EXERCISES FOR SECTION 15-5 


15-37. Montgomery (2001) presented the results of an ex- 
periment to compare four different mixing techniques on the 
tensile strength of portland cement. The results are shown in 
the following table. Is there any indication that mixing tech- 
nique affects the strength? Use a — 0.05. 


Mixing 
Technique Tensile Strength (Ib/in.?) 
1 3129 3000 2865 2890 
2 3200 3000 2975 3150 
3 2800 2900 2985 3050 
4 2600 2700 2600 2765 


15-38. An article in the Quality Control Handbook, 3rd edi- 
tion (McGraw-Hill, 1962) presents the results of an experiment 
performed to investigate the effect of three different conditioning 
methods on the breaking strength of cement briquettes. The data 
are shown in the following table. Using a = 0.05, is there any 
indication that conditioning method affects breaking strength? 


Conditioning 
Method Breaking Strength (Ib/in.?) 
1 553 550 568 541 537 
2 553 599 579 545 540 
3 492 530 528 510 571 
15-39. In Statistics for Research (John Wiley & Sons, 


1983), S. Dowdy and S. Wearden presented the results of an 
experiment to measure the performance of hand-held chain 
saws. The experimenters measured the kickback angle 
through which the saw is deflected when it begins to cut a 
3-inch stock synthetic board. Shown in the following table are 


deflection angles for five saws chosen at random from each of 
four different manufacturers. Is there any evidence that the 
manufacturers' products differ with respect to kickback angle? 
Use a = 0.01. 


Manufacturer Kickback Angle 
A 42 17 24 39 43 
B 28 50 44 32 61 
e 57 45 48 41 54 
D 29 40 22 34 30 


15-40. Consider the data in Exercise 13-2. Use the 
Kruskal-Wallis procedure with a = 0.05 to test for differ- 
ences between mean uniformity at the three different gas flow 
rates. 


15-41. Find the approximate P-value for the test statistic 
computed in Exercise 15-37. 


15-42. Find the approximate P-value for the test statistic 
computed in Exercise 15-40. 


Supplemental Exercises 


15-43. The surface finish of 10 metal parts produced in a 
grinding process is as follows: (in microinches): 10.32, 9.68, 
9.92, 10.10, 10.20, 9.87, 10.14, 9.74, 9.80, 10.26. Do the data 
support the claim that the median value of surface finish is 10 
microinches? Use the sign test with a = 0.05. What is the 
P-value for this test? 

15-44. Use the normal appoximation for the sign test for 
the problem in Exercise 15-43. Find the P-value for this test. 
What are your conclusions if a. = 0.05? 

15-45. Fluoride emissions (in ppm) from a chemical plant 
are monitored routinely. The following are 15 observations 
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based on air samples taken randomly during one month of pro- 
duction: 7, 3, 4,2, 5, 6, 9, 8, 7, 3, 4, 4, 3, 2, 6. Can you claim 
that the median fluoride impurity level is less than 6 ppm? 
State and test the appropriate hypotheses using the sign test 
with a = 0.05. What is the P-value for this test? 


15-46. Use the normal approximation for the sign test for 
the problem in Exercise 15-45. What is the P-value for this 
test? 

15-47. Consider the data in Exercise 10-42. Use the sign 
test with a = 0.05 to determine whether there is a difference 
in median impurity readings between the two analytical 
tests. 


15-48. Consider the data in Exercise 15-43. Use the 
Wilcoxon signed-rank test for this problem with a = 0.05. 
What hypotheses are being tested in this problem? 


15-49. Consider the data in Exercise 15-45. Use the 
Wilcoxon signed-rank test for this problem with a = 0.05. 
What conclusions can you draw? Does the hypothesis you are 
testing now differ from the one tested originally in Exercise 
15-45? 

15-50. Use the Wilcoxon signed-rank test with a = 0.05 for 
the diet-modification experiment described in Exercise 10-41. 
State carefully the conclusions that you can draw from this 
experiment. 


15-51. Use the Wilcoxon rank-sum test with a = 0.01 for 
the fuel-economy study described in Exercise 10-83. What 
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conclusions can you draw about the difference in mean 
mileage performance for the two vehicles in this study? 


15-52. Use the large-sample approximation for the Wilcoxon 
rank-sum test for the fuel-economy data in Exercise 10-83. What 
conclusions can you draw about the difference in means if 
a = 0.01? Find the P-value for this test. 


15-53. Use the Wilcoxon rank-sum test with a = 0.025 for 
the fill-capability experiment described in Exercise 10-85. 
What conclusions can you draw about the capability of the two 
fillers? 


15-54. Use the large-sample approximation for the 
Wilcoxon rank-sum test with a = 0.025 for the fill-capability 
experiment described in Exercise 10-85. Find the P-value for 
this test. What conclusions can you draw? 


15-55. Consider the contact resistance experiment in 
Exercise 13-31. Use the Kruskal-Wallis test to test for differ- 
ences in mean contact resistance among the three alloys. Ifa = 
0.01, what are your conclusions? Find the P-value for this test. 


15-56. Consider the experiment described in Exercise 13-28. 
Use the Kruskal-Wallis test for this experiment with a = 0.05. 
What conclusions would you draw? Find the P-value for 
this test. 

15-57. Consider the bread quality experiment in Exercise 
13-35. Use the Kruskal-Wallis test with a = 0.01 to analyze the 
data from this experiment. Find the P-value for this test. What 
conclusions can you draw? 


MIND-EXPANDING EXERCISES 


15-58. For the large-sample approximation to the 
Wilcoxon signed-rank test, derive the mean and stan- 
dard deviation of the test statistic used in the procedure. 


15-59. Testing for Trends. A turbocharger wheel is 
manufactured using an investment-casting process. The 
shaft fits into the wheel opening, and this wheel opening 
is a critical dimension. As wheel wax patterns are formed, 
the hard tool producing the wax patterns wears. This may 
cause growth in the wheel-opening dimension. Ten 
wheel-opening measurements, in time order of produc- 
tion, are 4.00 (millimeters), 4.02, 4.03, 4.01, 4.00, 4.03, 
4.04, 4.02, 4.03, 4.03. 

(a) Suppose that p is the probability that observation 
X;,5 exceeds observation X;. If there is no upward or 
downward trend, X;,; is no more or less likely to ex- 
ceed X; or lie below X;. What is the value of p? 

(b) Let V be the number of values of i for which 
X;,5 > X;. If there is no upward or downward trend 


in the measurements, what is the probability distri- 
bution of V? 

(c) Use the data above and the results of parts (a) and 
(b) to test Hp: there is no trend, versus H,: there is 
upward trend. Use a = 0.05. 


Note that this test is a modification of the sign test. It 


was developed by Cox and Stuart. 


15-60. Consider the Wilcoxon signed-rank test, and 

suppose that n = 5. Assume that Hp: ju = p is true. 

(a) How many different sequences of signed ranks are 
possible? Enumerate these sequences. 

(b) How many different values of W" are there? Find 
the probability associated with each value of W'. 

(c) Suppose that we define the critical region of the test 
to be to reject Hy if w^ > w* and w* = 13. What is 
the approximate a level of this test? 

(d) Does this exercise show how the critical values for the 
Wilcoxon signed-rank test were developed? Explain. 
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IMPORTANT TERMS AND CONCEPTS 


In the E-book, click on any 
term or concept below to 
go to that subject. 

Asymptotic relative 
efficiency 


Kruskal-Wallis test 


Nonparametric or 
distribution free 
procedures 

Normal approximation 
to nonparametric 
tests 

Paired observations 


Ranks 

Rank transformation in 
ANOVA 

Sign test 

Tests on the mean of a 
symmetric continu- 
ous distribution 


Tests on the median of 
a continuous 
distribution 

Wilcoxon rank-sum test 

Wilcoxon signed-rank 
test 


Statistical Quality 
Control 


CHAPTER OUTLINE 


16-1 QUALITY IMPROVEMENT 


AND STATISTICS 


16-2 STATISTICAL QUALITY 


CONTROL 


16-3 STATISTICAL PROCESS 


CONTROL 


16-4 INTRODUCTION TO 


CONTROL CHARTS 
16-4.1 Basic Principles 


16-4.2 Design of a Control 
Chart 


16-4.3 Rational Subgroups 


16-4.4 Analysis of Patterns on 
Control Charts 


16-5 X AND R OR S CONTROL 


CHARTS 


LEARNING OBJECTIVES 


16-6 


16-7 
16-8 


16-9 


16-10 


16-11 


16-12 


CONTROL CHARTS FOR 
INDIVIDUAL MEASUREMENTS 


PROCESS CAPABILITY 


ATTRIBUTE CONTROL 
CHARTS 


16-8.1 P Chart (Control Chart for 


Proportions) 


16-8.2 U Chart (Control Chart for 
Defects per Unit) 


CONTROL CHART 
PERFORMANCE 


CUMULATIVE SUM CONTROL 
CHART 


OTHER SPC PROBLEM-SOLVING 
TOOLS 


IMPLEMENTING SPC 


After careful study of this chapter, you should be able to do the following: 


1. 
Lis 


Ui 


Understand the role of statistical tools in quality improvement 


Understand the different types of variability, rational subgroups, and how a control chart is used 


to detect assignable causes 


. Understand the general form of a Shewhart control chart and how to apply zone rules (such as 


the Western Electric rules) and pattern analysis to detect assignable causes 


. Calculate and interpret process capability ratios 


. Construct and interpret control charts for variables such as X, R, S, and individuals charts 


. Construct and interpret control charts for attributes such as P and U charts 
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7. Calculate the ARL performance for a Shewhart control chart 
8. Construct and interpret a cumulative sum control chart 


9. Use other statistical process control problem-solving tools 


Answers for most odd numbered exercises are at the end of the book. Answers to exercises whose 
numbers are surrounded by a box can be accessed in the e-text by clicking on the box. Complete 
worked solutions to certain exercises are also available in the e-text. These are indicated in the 
Answers to Selected Exercises section by a box around the exercise number. Exercises are also 
available for some of the text sections that appear on CD only. These exercises may be found within 
the e- Text immediately following the section they accompany. 


16-1 QUALITY IMPROVEMENT AND STATISTICS 


The quality of products and services has become a major decision factor in most businesses 
today. Regardless of whether the consumer is an individual, a corporation, a military defense 
program, or a retail store, when the consumer is making purchase decisions, he or she is 
likely to consider quality of equal importance to cost and schedule. Consequently, quality 
improvement has become a major concern to many U.S. corporations. This chapter is about 
statistical quality control, a collection of tools that are essential in quality-improvement 
activities. 

Quality means fitness for use. For example, you or I may purchase automobiles that we 
expect to be free of manufacturing defects and that should provide reliable and economical 
transportation, a retailer buys finished goods with the expectation that they are properly pack- 
aged and arranged for easy storage and display, or a manufacturer buys raw material and 
expects to process it with no rework or scrap. In other words, all consumers expect that the 
products and services they buy will meet their requirements. Those requirements define 
fitness for use. 

Quality or fitness for use is determined through the interaction of quality of design and 
quality of conformance. By quality of design we mean the different grades or levels of 
performance, reliability, serviceability, and function that are the result of deliberate engi- 
neering and management decisions. By quality of conformance, we mean the systematic re- 
duction of variability and elimination of defects until every unit produced is identical and 
defect-free. 

Some confusion exists in our society about quality improvement; some people still think that 
it means gold-plating a product or spending more money to develop a product or process. This 
thinking is wrong. Quality improvement means the systematic elimination of waste. Examples 
of waste include scrap and rework in manufacturing, inspection and testing, errors on documents 
(such as engineering drawings, checks, purchase orders, and plans), customer complaint hotlines, 
warranty costs, and the time required to do things over again that could have been done right the 
first time. A successful quality-improvement effort can eliminate much of this waste and lead to 
lower costs, higher productivity, increased customer satisfaction, increased business reputation, 
higher market share, and ultimately higher profits for the company. 

Statistical methods play a vital role in quality improvement. Some applications are out- 
lined below: 


1. In product design and development, statistical methods, including designed exper- 
iments, can be used to compare different materials, components, or ingredients, and 
to help determine both system and component tolerances. This application can sig- 
nificantly lower development costs and reduce development time. 
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2. Statistical methods can be used to determine the capability of a manufacturing 
process. Statistical process control can be used to systematically improve a process 
by reducing variability. 

3. Experimental design methods can be used to investigate improvements in the 
process. These improvements can lead to higher yields and lower manufacturing 
costs. 


4. Life testing provides reliability and other performance data about the product. This 
can lead to new and improved designs and products that have longer useful lives and 
lower operating and maintenance costs. 


Some of these applications have been illustrated in earlier chapters of this book. It is 
essential that engineers, scientists, and managers have an in-depth understanding of these 
Statistical tools in any industry or business that wants to be a high-quality, low-cost pro- 
ducer. In this chapter we provide an introduction to the basic methods of statistical quality 
control that, along with experimental design, form the basis of a successful quality- 
improvement effort. 


16-2 STATISTICAL QUALITY CONTROL 


The field of statistical quality control can be broadly defined as those statistical and engineer- 
ing methods that are used in measuring, monitoring, controlling, and improving quality. 
Statistical quality control is a field that dates back to the 1920s. Dr. Walter A. Shewhart of the 
Bell Telephone Laboratories was one of the early pioneers of the field. In 1924 he wrote a 
memorandum showing a modern control chart, one of the basic tools of statistical process 
control. Harold E Dodge and Harry G. Romig, two other Bell System employees, provided 
much of the leadership in the development of statistically based sampling and inspection 
methods. The work of these three men forms much of the basis of the modern field of statis- 
tical quality control. World War II saw the widespread introduction of these methods to U.S. 
industry. Dr. W. Edwards Deming and Dr. Joseph M. Juran have been instrumental in spread- 
ing statistical quality-control methods since World War II. 

The Japanese have been particularly successful in deploying statistical quality-control 
methods and have used statistical methods to gain significant advantage over their 
competitors. In the 1970s American industry suffered extensively from Japanese (and other 
foreign) competition; that has led, in turn, to renewed interest in statistical quality-control 
methods in the United States. Much of this interest focuses on statistical process control 
and experimental design. Many U.S. companies have begun extensive programs to 
implement these methods in their manufacturing, engineering, and other business 
organizations. 


16-3 STATISTICAL PROCESS CONTROL 


It is impractical to inspect quality into a product; the product must be built right the first 
time. The manufacturing process must therefore be stable or repeatable and capable of op- 
erating with little variability around the target or nominal dimension. Online statistical 
process control is a powerful tool for achieving process stability and improving capability 
through the reduction of variability. 
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It is customary to think of statistical process control (SPC) as a set of problem-solving 
tools that may be applied to any process. The major tools of SPC* are 
Histogram 
Pareto chart 
Cause-and-effect diagram 
Defect-concentration diagram 


Control chart 


dE E LE e m 


Scatter diagram 
7. Check sheet 


Although these tools are an important part of SPC, they comprise only the technical aspect 
of the subject. An equally important element of SPC is attitude—a desire of all individu- 
als in the organization for continuous improvement in quality and productivity through the 
systematic reduction of variability. The control chart is the most powerful of the SPC 
tools. 


16-4 INTRODUCTION TO CONTROL CHARTS 


16-4.1 Basic Principles 


In any production process, regardless of how well-designed or carefully maintained it is, a 
certain amount of inherent or natural variability will always exist. This natural variability or 
"background noise" is the cumulative effect of many small, essentially unavoidable causes. 
When the background noise in a process is relatively small, we usually consider it an accept- 
able level of process performance. In the framework of statistical quality control, this natural 
variability is often called a “stable system of chance causes." A process that is operating with 
only chance causes of variation present is said to be in statistical control. In other words, the 
chance causes are an inherent part of the process. 

Other kinds of variability may occasionally be present in the output of a process. This 
variability in key quality characteristics usually arises from three sources: improperly ad- 
justed machines, operator errors, or defective raw materials. Such variability is generally large 
when compared to the background noise, and it usually represents an unacceptable level of 
process performance. We refer to these sources of variability that are not part of the chance 
cause pattern as assignable causes. A process that is operating in the presence of assignable 
causes is said to be out of control.' 

Production processes will often operate in the in-control state, producing acceptable 
product for relatively long periods of time. Occasionally, however, assignable causes will 
occur, seemingly at random, resulting in a "shift" to an out-of-control state where a large pro- 
portion of the process output does not conform to requirements. A major objective of statisti- 
cal process control is to quickly detect the occurrence of assignable causes or process shifts 
so that investigation of the process and corrective action may be undertaken before many 





* Some prefer to include the experimental design methods discussed previously as part of the SPC toolkit. We did not 
do so, because we think of SPC as an online approach to quality improvement using techniques founded on passive 
observation of the process, while design of experiments is an active approach in which deliberate changes are made 
to the process variables. As such, designed experiments are often referred to as offline quality control. 


"The terminology chance and assignable causes was developed by Dr. Walter A. Shewhart. Today, some writers use 
common cause instead of chance cause and special cause instead of assignable cause. 


Figure 16-1 A typi- 
cal control chart. 
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Upper control limit 


A AM /W 


Lower CORTO limit 


Sample quality characteristic 


Sample number or time 


nonconforming units are manufactured. The control chart is an online process-monitoring 
technique widely used for this purpose. 

Recall the following from Chapter 1. Figure 1-10 illustrates that adjustments to common 
causes of variation increase the variation of a process whereas Fig. 1-11 illustrates that actions 
should be taken in response to assignable causes of variation. Control charts may also be used to 
estimate the parameters of a production process and, through this information, to determine the 
capability of a process to meet specifications. The control chart can also provide information 
that is useful in improving the process. Finally, remember that the eventual goal of statistical 
process control is the e/imination of variability in the process. Although it may not be possible 
to eliminate variability completely, the control chart helps reduce it as much as possible. 

A typical control chart is shown in Fig. 16-1, which is a graphical display of a quality char- 
acteristic that has been measured or computed from a sample versus the sample number or time. 
Often, the samples are selected at periodic intervals such as every hour. The chart contains a cen- 
ter line (CL) that represents the average value of the quality characteristic corresponding to the 
in-control state. (That is, only chance causes are present.) Two other horizontal lines, called the 
upper control limit (UCL) and the lower control limit (LCL), are also shown on the chart. These 
control limits are chosen so that if the process is in control, nearly all of the sample points will 
fall between them. In general, as long as the points plot within the control limits, the process is 
assumed to be in control, and no action is necessary. However, a point that plots outside of the 
control limits 1s interpreted as evidence that the process is out of control, and investigation and 
corrective action are required to find and eliminate the assignable cause or causes responsible for 
this behavior. The sample points on the control chart are usually connected with straight-line 
segments so that it 1s easier to visualize how the sequence of points has evolved over time. 

Even if all the points plot inside the control limits, if they behave in a systematic or non- 
random manner, this is an indication that the process is out of control. For example, if 18 of 
the last 20 points plotted above the center line but below the upper control limit and only two 
of these points plotted below the center line but above the lower control limit, we would be 
very suspicious that something was wrong. If the process is in control, all the plotted points 
should have an essentially random pattern. Methods designed to find sequences or nonrandom 
patterns can be applied to control charts as an aid in detecting out-of-control conditions. A par- 
ticular nonrandom pattern usually appears on a control chart for a reason, and if that reason 
can be found and eliminated, process performance can be improved. 

There is a close connection between control charts and hypothesis testing. Essentially, the 
control chart is a test of the hypothesis that the process is in a state of statistical control. A 
point plotting within the control limits is equivalent to failing to reject the hypothesis of sta- 
tistical control, and a point plotting outside the control limits 1s equivalent to rejecting the hy- 
pothesis of statistical control. 
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Figure 16-2 Process 
improvement using 
the control chart. 
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We give a general model for a control chart. Let J/ be a sample statistic that measures some 
quality characteristic of interest, and suppose that the mean of W is wy and the standard deviation 
of Wis oy.* Then the center line, the upper control limit, and the lower control limit become 


CL = py 





where k is the “distance” of the control limits from the center line, expressed in standard 
deviation units. A common choice is k = 3. This general theory of control charts was first pro- 
posed by Dr. Walter A. Shewhart, and control charts developed according to these principles 
are often called Shewhart control charts. 

The control chart is a device for describing exactly what is meant by statistical control; as 
such, it may be used in a variety of ways. In many applications, it is used for online process 
monitoring. That is, sample data are collected and used to construct the control chart, and if 
the sample values of x (say) fall within the control limits and do not exhibit any systematic 
pattern, we say the process is in control at the level indicated by the chart. Note that we may 
be interested here in determining both whether the past data came from a process that was in 
control and whether future samples from this process indicate statistical control. 

The most important use of a control chart is to improve the process. We have found that, 
generally 


1. Most processes do not operate in a state of statistical control. 


2. Consequently, the routine and attentive use of control charts will identify assignable 
causes. If these causes can be eliminated from the process, variability will be reduced 
and the process will be improved. 


This process-improvement activity using the control chart is illustrated in Fig. 16-2. Notice that: 





* Note that “sigma” refers to the standard deviation of the statistic plotted on the chart (i.e., oj), not the standard 
deviation of the quality characteristic. 
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3. The control chart will only detect assignable causes. Management, operator, and en- 
gineering action will usually be necessary to eliminate the assignable cause. An ac- 
tion plan for responding to control chart signals is vital. 


In identifying and eliminating assignable causes, it is important to find the underlying root 
cause of the problem and to attack it. A cosmetic solution will not result in any real, long-term 
process improvement. Developing an effective system for corrective action is an essential 
component of an effective SPC implementation. 

We may also use the control chart as an estimating device. That is, from a control chart 
that exhibits statistical control, we may estimate certain process parameters, such as the mean, 
standard deviation, and fraction nonconforming or fallout. These estimates may then be used 
to determine the capability of the process to produce acceptable products. Such process 
capability studies have considerable impact on many management decision problems that oc- 
cur over the product cycle, including make-or-buy decisions, plant and process improvements 
that reduce process variability, and contractual agreements with customers or suppliers re- 
garding product quality. 

Control charts may be classified into two general types. Many quality characteristics can 
be measured and expressed as numbers on some continuous scale of measurement. In such 
cases, it is convenient to describe the quality characteristic with a measure of central tendency 
and a measure of variability. Control charts for central tendency and variability are collec- 
tively called variables control charts. The X chart is the most widely used chart for moni- 
toring central tendency, whereas charts based on either the sample range or the sample stan- 
dard deviation are used to control process variability. Many quality characteristics are not 
measured on a continuous scale or even a quantitative scale. In these cases, we may judge 
each unit of product as either conforming or nonconforming on the basis of whether or not it 
possesses certain attributes, or we may count the number of nonconformities (defects) 
appearing on a unit of product. Control charts for such quality characteristics are called 
attributes control charts. 

Control charts have had a long history of use in industry. There are at least five reasons 
for their popularity: 


1. Control charts are a proven technique for improving productivity. A successful 
control chart program will reduce scrap and rework, which are the primary produc- 
tivity killers in any operation. If you reduce scrap and rework, productivity increases, 
cost decreases, and production capacity (measured in the number of good parts per 
hour) increases. 


2. Control charts are effective in defect prevention. The control chart helps keep the 
process in control, which is consistent with the “do it right the first time” philosophy. 
It is never cheaper to sort out the “good” units from the “bad” later on than it is to 
build them correctly initially. If you do not have effective process control, you are 
paying someone to make a nonconforming product. 


3. Control charts prevent unnecessary process adjustments. A control chart can dis- 
tinguish between background noise and abnormal variation; no other device, including 
a human operator, is as effective in making this distinction. If process operators adjust 
the process based on periodic tests unrelated to a control chart program, they will often 
overreact to the background noise and make unneeded adjustments. These unnecessary 
adjustments can result in a deterioration of process performance. In other words, the 
control chart is consistent with the “if it isn’t broken, don’t fix it" philosophy. 


4. Control charts provide diagnostic information. Frequently, the pattern of points 
on the control chart will contain information that is of diagnostic value to an 


602 CHAPTER 16 STATISTICAL QUALITY CONTROL 


experienced operator or engineer. This information allows the operator to implement 
a change in the process that will improve its performance. 


5. Control charts provide information about process capability. The control chart 
provides information about the value of important process parameters and their sta- 
bility over time. This allows an estimate of process capability to be made. This in- 
formation is of tremendous use to product and process designers. 


Control charts are among the most effective management control tools, and they are as 
important as cost controls and material controls. Modern computer technology has made it 
easy to implement control charts in any type of process, because data collection and analysis 
can be performed on a microcomputer or a local area network terminal in realtime, online at 
the work center. 


16-4.2 Design of a Control Chart 


Figure 16-3 X con- 
trol chart for piston 
ring diameter. 


To illustrate these ideas, we give a simplified example of a control chart. In manufacturing au- 
tomobile engine piston rings, the inside diameter of the rings is a critical quality characteris- 
tic. The process mean inside ring diameter is 74 millimeters, and it is known that the standard 
deviation of ring diameter is 0.01 millimeters. A control chart for average ring diameter 1s 
shown in Fig. 16-3. Every hour a random sample of five rings is taken, the average ring di- 
ameter of the sample (say x) is computed, and x is plotted on the chart. Because this control 
chart utilizes the sample mean X to monitor the process mean, it is usually called an X con- 
trol chart. Note that all the points fall within the control limits, so the chart indicates that the 
process is in statistical control. 

Consider how the control limits were determined. The process average is 74 millimeters, 
and the process standard deviation is o = 0.01 millimeters. Now if samples of size n = 5 are 
taken, the standard deviation of the sample average X is 


V = 30008 
Uy v = eS SS SS SS f 
Fa V5 


Therefore, if the process is in control with a mean diameter of 74 millimeters, by using 
the central limit theorem to assume that X is approximately normally distributed, we 
would expect approximately 100(1 — o)96 of the sample mean diameters X to fall between 
74 + z,/(0.0045) and 74 — Z,/(0.0045). As discussed above, we customarily choose the 
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constant Z,/7 to be 3, so the upper and lower control limits become 
UCL = 74 + 3(0.0045) = 74.0135 
and 
LCL = 74 — 3(0.0045) = 73.9865 


as shown on the control chart. These are the 3-sigma control limits referred to above. Note that 
the use of 3-sigma limits implies that a = 0.0027; that is, the probability that the point plots 
outside the control limits when the process is in control is 0.0027. The width of the control 
limits is inversely related to the sample size n for a given multiple of sigma. Choosing the con- 
trol limits is equivalent to setting up the critical region for testing the hypothesis 


Ao: u = 74 
Ay: u + 74 


where ø = 0.01 is known. Essentially, the control chart tests this hypothesis repeatedly at dif- 
ferent points in time. 

In designing a control chart, we must specify both the sample size to use and the fre- 
quency of sampling. In general, larger samples will make it easier to detect small shifts in the 
process. When choosing the sample size, we must keep in mind the size of the shift that we are 
trying to detect. If we are interested in detecting a relatively large process shift, we use smaller 
sample sizes than those that would be employed if the shift of interest were relatively small. 

We must also determine the frequency of sampling. The most desirable situation from the 
point of view of detecting shifts would be to take large samples very frequently; however, this 1s 
usually not economically feasible. The general problem is one of allocating sampling effort. That 
is, either we take small samples at short intervals or larger samples at longer intervals. Current in- 
dustry practice tends to favor smaller, more frequent samples, particularly in high-volume man- 
ufacturing processes or where a great many types of assignable causes can occur. Furthermore, 
as automatic sensing and measurement technology develops, it is becoming possible to greatly 
increase frequencies. Ultimately, every unit can be tested as it is manufactured. This capability 
will not eliminate the need for control charts because the test system will not prevent defects. The 
increased data will increase the effectiveness of process control and improve quality. 


16-4.3 Rational Subgroups 


A fundamental idea in the use of control charts is to collect sample data according to what 
Shewhart called the rational subgroup concept. Generally, this means that subgroups or sam- 
ples should be selected so that to the extent possible, the variability of the observations within 
a subgroup should include all the chance or natural variability and exclude the assignable 
variability. Then, the control limits will represent bounds for all the chance variability and not 
the assignable variability. Consequently, assignable causes will tend to generate points that are 
outside of the control limits, while chance variability will tend to generate points that are 
within the control limits. 

When control charts are applied to production processes, the time order of production is a 
logical basis for rational subgrouping. Even though time order is preserved, it is still possible 
to form subgroups erroneously. If some of the observations in the subgroup are taken at the end 
of one 8-hour shift and the remaining observations are taken at the start of the next 8-hour shift, 
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any differences between shifts might not be detected. Time order is frequently a good basis for 
forming subgroups because it allows us to detect assignable causes that occur over time. 

Two general approaches to constructing rational subgroups are used. In the first ap- 
proach, each subgroup consists of units that were produced at the same time (or as closely to- 
gether as possible). This approach is used when the primary purpose of the control chart is to 
detect process shifts. It minimizes variability due to assignable causes within a sample, and it 
maximizes variability between samples if assignable causes are present. It also provides bet- 
ter estimates of the standard deviation of the process in the case of variables control charts. 
This approach to rational subgrouping essentially gives a *snapshot" of the process at each 
point in time where a sample is collected. 

In the second approach, each sample consists of units of product that are representative of 
all units that have been produced since the last sample was taken. Essentially, each subgroup 
is a random sample of all process output over the sampling interval. This method of rational 
subgrouping is often used when the control chart is employed to make decisions about the ac- 
ceptance of all units of product that have been produced since the last sample. In fact, if the 
process shifts to an out-of-control state and then back in control again between samples, it is 
sometimes argued that the first method of rational subgrouping defined above will be ineffec- 
tive against these types of shifts, and so the second method must be used. 

When the rational subgroup is a random sample of all units produced over the sampling 
interval, considerable care must be taken in interpreting the control charts. If the process mean 
drifts between several levels during the interval between samples, the range of observations 
within the sample may consequently be relatively large. It is the within-sample variability that 
determines the width of the control limits on an X chart, so this practice will result in wider 
limits on the X chart. This makes it harder to detect shifts in the mean. In fact, we can often 
make any process appear to be in statistical control just by stretching out the interval between 
observations in the sample. It is also possible for shifts in the process average to cause points 
on a control chart for the range or standard deviation to plot out of control, even though no 
shift in process variability has taken place. 

There are other bases for forming rational subgroups. For example, suppose a process con- 
sists of several machines that pool their output into a common stream. If we sample from this 
common stream of output, it will be very difficult to detect whether or not some of the machines 
are out of control. A logical approach to rational subgrouping here is to apply control chart tech- 
niques to the output for each individual machine. Sometimes this concept needs to be applied to 
different heads on the same machine, different workstations, different operators, and so forth. 

The rational subgroup concept is very important. The proper selection of samples re- 
quires careful consideration of the process, with the objective of obtaining as much useful in- 
formation as possible from the control chart analysis. 


16-4.4 Analysis of Patterns on Control Charts 


A control chart may indicate an out-of-control condition either when one or more points fall be- 
yond the control limits, or when the plotted points exhibit some nonrandom pattern of behavior. 
For example, consider the X chart shown in Fig. 16-4. Although all 25 points fall within the con- 
trol limits, the points do not indicate statistical control because their pattern is very nonrandom 
in appearance. Specifically, we note that 19 of the 25 points plot below the center line, while 
only 6 of them plot above. If the points are truly random, we should expect a more even distri- 
bution of them above and below the center line. We also observe that following the fourth point, 
five points in a row increase in magnitude. This arrangement of points is called a run. Since the 
Observations are increasing, we could call it a run up; similarly, a sequence of decreasing points 


Figure 16-4 An X 
control chart. 


Figure 16-5 An X 
chart with a cyclic pat- 
tern. 
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is called a run down. This control chart has an unusually long run up (beginning with the fourth 
point) and an unusually long run down (beginning with the eighteenth point). 

In general, we define a run as a sequence of observations of the same type. In addition to 
runs up and runs down, we could define the types of observations as those above and below the 
center line, respectively, so two points in a row above the center line would be a run of length 2. 

A run of length 8 or more points has a very low probability of occurrence in a random 
sample of points. Consequently, any type of run of length 8 or more is often taken as a signal 
of an out-of-control condition. For example, eight consecutive points on one side of the cen- 
ter line will indicate that the process is out of control. 

Although runs are an important measure of nonrandom behavior on a control chart, other 
types of patterns may also indicate an out-of-control condition. For example, consider the X 
chart in Fig. 16-5. Note that the plotted sample averages exhibit a cyclic behavior, vet they all 
fall within the control limits. Such a pattern may indicate a problem with the process, such as 
operator fatigue, raw material deliveries, and heat or stress buildup. The yield may be im- 
proved by eliminating or reducing the sources of variability causing this cyclic behavior 
(see Fig. 16-6). In Fig. 16-6, LSL and USL denote the lower and upper specification limits of 
the process. These limits represent bounds within which acceptable product must fall and they 
are often based on customer requirements. 

The problem is one of pattern recognition, that is, recognizing systematic or nonrandom 
patterns on the control chart and identifying the reason for this behavior. The ability to interpret 
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Figure 16-6 (a) 
Variability with the 
cyclic pattern. 

(b) Variability with 

the cyclic pattern elim- 
inated. 


Figure 16-7 The 
Western Electric zone 
rules. 
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a particular pattern in terms of assignable causes requires experience and knowledge of the 
process. That is, we must not only know the statistical principles of control charts, but we 
must also have a good understanding of the process. 

The Western Electric Handbook (1956) suggests a set of decision rules for detecting non- 
random patterns on control charts. Specifically, the Western Electric rules would conclude 
that the process is out of control if either 


1. One point plots outside 3-sigma control limits. 
2. Two out of three consecutive points plot beyond a 2-sigma limit. 


3. Four out of five consecutive points plot at a distance of 1-sigma or beyond from the 
center line. 


4. Eight consecutive points plot on one side of the center line. 


We have found these rules very effective in practice for enhancing the sensitivity of control 
charts. Rules 2 and 3 apply to one side of the center line at a time. That is, a point above the 
upper 2-sigma limit followed immediately by a point below the lower 2-sigma limit would not 
signal an out-of-control alarm. 

Figure 16-7 shows an X control chart for the piston ring process with the 1-sigma, 
2-sigma, and 3-sigma limits used in the Western Electric procedure. Notice that these inner 
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limits (sometimes called warning limits) partition the control chart into three zones A, B, and 
C on each side of the center line. Consequently, the Western Electric rules are sometimes 
called the run rules for control charts. Notice that the last four points fall in zone B or beyond. 
Thus, since four of five consecutive points exceed the 1-sigma limit, the Western Electric 
procedure will conclude that the pattern is nonrandom and the process is out of control. 
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When dealing with a quality characteristic that can be expressed as a measurement, it is cus- 
tomary to monitor both the mean value of the quality characteristic and its variability. Control 
over the average quality is exercised by the control chart for averages, usually called the X 
chart. Process variability can be controlled by either a range chart (R chart) or a standard de- 
viation chart (S chart), depending on how the population standard deviation is estimated. 

Suppose that the process mean and standard deviation u and o are known and that we can 
assume that the quality characteristic has a normal distribution. Consider the X chart. As dis- 
cussed previously, we can use p as the center line for the control chart, and we can place the 
upper and lower 3-sigma limits at 


UCL = y + 3o/ Vn 


LCL = pu — 30/Vn 
CL = wu 





When the parameters and o are unknown, we usually estimate them on the basis of 
preliminary samples, taken when the process is thought to be in control. We recommend the 
use of at least 20 to 25 preliminary samples. Suppose m preliminary samples are available, 
each of size n. Typically, n will be 4, 5, or 6; these relatively small sample sizes are widely 
used and often arise from the construction of rational subgroups. Let the sample mean for the 
ith sample be X;. Then we estimate the mean of the population, u, by the grand mean 





Thus, we may take X as the center line on the X control chart. 

We may estimate o from either the standard deviation or the range of the observations 
within each sample. The sample size is relatively small, so there is little loss in efficiency in 
estimating o from the sample ranges. 

The relationship between the range R of a sample from a normal population with known 
parameters and the standard deviation of that population is needed. Since R is a random 
variable, the quantity W — R/o, called the relative range, is also a random variable. The 
parameters of the distribution of W have been determined for any sample size n. The mean of 
the distribution of W is called d, and a table of d, for various n is given in Appendix Table X. 
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X Control Chart 
(from R) 


The standard deviation of W is called d4. Because R = oW 


Ma = d) Or = dzo (16-4) 


Let R; be the range of the ith sample, and let 





be the average range. Then R is an estimator of wp, and from Equation 16-4 an unbiased 
estimator of a is 





Therefore, we may use as our upper and lower control limits for the X chart 











UCL = X + R LCL = X= R 16-7 
dVn dVn ue 
Define the constant 
An = 2 (16-8) 
^ dy Vn 


Now, once we have computed the sample values x and r, the X control chart may be defined 
as follows: 


The center line and upper and lower control limits for an X control chart are 


UCL =x + Ayr LCL = x — Aor (16-9) 


where the constant A, is tabulated for various sample sizes in Appendix Table X. 





The parameters of the R chart may also be easily determined. The center line will obvi- 
ously be R. To determine the control limits, we need an estimate of Op, the standard deviation 
of R. Once again, assuming the process is in control, the distribution of the relative range, W, 
will be useful. We may estimate op from Equation 16-4 as 


(16-10) 


R Chart 
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and we would use as the upper and lower control limits on the R chart 


UCL = R + SRi $ a) 
7 dy d; 


_ 8d. 3d3\ — 
LCL=R-—R=(1-— JR (16-11) 
dy dy 


Setting D, = 1 — 3d;/d, and D, = 1 + 3d;/d, leads to the following definition. 


The center line and upper and lower control limits for an R chart are 


UCL = Dar CL=r LCL = D3r (16 12) 


where r is the sample average range, and the constants D, and D, are tabulated for 
various sample sizes in Appendix Table X. 





The LCL for an R chart can be a negative number. In that case, it is customary to set LCL 
to zero. Because the points plotted on an R chart are nonnegative, no points can fall below an 
LCL of zero. 

When preliminary samples are used to construct limits for control charts, these limits are 
customarily treated as trial values. Therefore, the m sample means and ranges should be plotted 
on the appropriate charts, and any points that exceed the control limits should be investigated. If 
assignable causes for these points are discovered, they should be eliminated and new limits for 
the control charts determined. In this way, the process may be eventually brought into statistical 
control and its inherent capabilities assessed. Other changes in process centering and dispersion 
may then be contemplated. Also, we often study the R chart first because 1f the process variabil- 
ity is not constant over time the control limits calculated for the X chart can be misleading. 

Rather than base control charts on ranges, a more modern approach is to calculate the 
standard deviation of each subgroup and plot these standard deviations to monitor the process 
standard deviation o. This is called an S chart. When an S chart is used, it is common to use 
these standard deviations to develop control limits for the X chart. Typically, the sample size 
used for subgroups is small (fewer than 10) and in that case there is usually little difference in 
the X chart generated from ranges or standard deviations. However, because computer soft- 
ware is often used to implement control charts, S charts are quite common. Details to construct 
these charts follow. 

In Section 7-2.2 on the CD, it was shown that S is a biased estimator of o. That 1s, 
E(S) = cyo where c, is a constant that is near, but not equal to, 1. Furthermore, a calculation 
similar to the one used for E(S) can derive the standard deviation of the statistic S with the re- 
sult oy 1 — A Therefore, the center line and three-sigma control limits for S are 


LCL = ao — 30 V1 — cj CL = c40. 


UCL = co + 3o V1 — cj (16-13) 
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S Chart 


X Control Chart 
(from S) 


EXAMPLE 16-1 


Assume that there are m preliminary samples available, each of size n, and let S; denote the 
standard deviation of the ith sample. Define 


(16-14) 





Because E(S) = cyo, an unbiased estimator of ø is S/c4 That is, 


QST (16-15) 


A control chart for standard deviations follows. 


UM = 5 d= VISE UCL gc sees Tem (16-16) 





The LCL for an S chart can be a negative number, in that case, it is customary to set LCL to zero. 
When an S chart is used, the estimate for o in Equation 16-15 is commonly used to calculate 
the control limits for an X chart. This produces the following control limits for an X chart. 


S S 


UCL =x+3 (16-17) 


cA Vn cA Vn 





A component part for a jet aircraft engine is manufactured by an investment casting 
process. The vane opening on this casting 1s an important functional parameter of the part. 
We will illustrate the use of X and R control charts to assess the statistical stability of this 
process. Table 16-1 presents 20 samples of five parts each. The values given in the table 
have been coded by using the last three digits of the dimension; that is, 31.6 should be 
0.50316 inch. 

The quantities x = 33.3 and r = 5.8 are shown at the foot of Table 16-1. The value of A, 
for samples of size 5 is 4; — 0.577. Then the trial control limits for the X chart are 


x + Ayr = 33.32 + (0.577)(5.8) = 33.32 + 3.35 
or 


UCL = 36.67 LCL = 29.97 
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Table 16-1 Vane-Opening Measurements 


Sample 
Number x; 25 X3 X4 Xs X ip S 

1 33 29 31 Cy" 33 31:6 4 1.67332 
2 33 31 35 37 31 33.4 6 2.60768 
9 35 37 33 34 36 35.0 4 1.58114 
4 30 31 33 34 33 322 4 1.64317 
5 33 34 35 33 34 33.8 2 0.83666 
6 38 2T 39 40 38 38.4 3 1.14018 
Ü 30 31 32 34 31 31:6 4 1.51658 
8 29 39 38 39 39 36.8 10 4.38178 
9 28 33 35 36 43 35.0 15 5.43139 
10 38 33 32 35 32 34.0 6 2.54951 
11 28 30 28 32 31 29.8 4 1.78885 
12 3l 35 35 35 34 34.0 4 1.73205 
13 2] 32 34 35 37 33.0 10 3.80789 
14 33 53 35 sii 36 34.8 4 1.78885 
15 35 37 32 35 39 35.6 I 2.60768 
16 33 33 27 31 30 30.8 6 2.48998 
17 35 34 34 30 32 33.0 5 2.00000 
18 32 Do 30 30 33 31.6 3 1.51658 
19 25 277 34 27 28 25.2 9 3.42053 
20 23 35 36 33 30 33.8 6 2.38747 
x — 33.32 r=5.8 s = 2.345 


For the R chart, the trial control limits are 


UCL = Dr = (2.115)(5.8) = 12.27 
LCL = Dr = (0)(5.8) = 0 


The X and R control charts with these trial control limits are shown in Fig. 16-8. Notice that 
samples 6, 8, 11, and 19 are out of control on the X chart and that sample 9 is out of control on the 
R chart. (These points are labeled with a “1” because they violate the first Western Electric rule.) 

For the S chart, the value of c, = 0.94. 

Therefore, 


35 3(2.345 
V1 —- a= 302.345) JTT 098 = 2,553 


C4 0.94 


and the trial control limits are 


UCL = 2.345 + 2.553 = 4.898 
LCL = 2.345 — 2.553 = —0.208 


The LCL is set to zero. If 5 is used to determine the control limits for the X chart, 
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Figure 16-8 The X and R control charts for vane opening. 


and this result is nearly the same as from 7. The S chart is shown in Fig. 16-9. Because the con- 
trol limits for the X chart calculated from 5 are nearly the same as from r, the chart is not shown. 

Suppose that all of these assignable causes can be traced to a defective tool in the wax- 
molding area. We should discard these five samples and recompute the limits for the X and R 
charts. These new revised limits are, for the X chart, 


UCL = x + AF = 33.21 + (0.577)(5.0) = 36.10 
LCL = x — AF = 33.21 — (0.577)(5.0) = 30.33 


and for the R chart, 


UCL = D,r = (2.115)(5.0) = 10.57 
LCL = D4r = (0)(5.0) = 0 
9 el 

P 5 UCL = 4.899 » 

Ed j j 

5S3 S= 2.348 / /N /N 

£7 i S ed ew N+ v N : 
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Figure 16-9. The S control chart for vane opening. 
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Figure 16-10 The X and R control charts for vane opening, revised limits. 


The revised control charts are shown in Fig. 16-10. Notice that we have treated the first 
20 preliminary samples as estimation data with which to establish control limits. These lim- 
its can now be used to judge the statistical control of future production. As each new sample 
becomes available, the values of x and r should be computed and plotted on the control charts. 
It may be desirable to revise the limits periodically, even if the process remains stable. The 
limits should always be revised when process improvements are made. 


Computer Construction of X and R Control Charts 

Many computer programs construct X and R control charts. Figures 16-8 and 16-10 show 
charts similar to those produced by Minitab for the vane-opening data. This program will 
allow the user to select any multiple of sigma as the width of the control limits and use the 
Western Electric rules to detect out-of-control points. The program will also prepare a 
summary report as in Table 16-2 and exclude subgroups from the calculation of the control 
limits. 


Table 16-2 Summary Report from Minitab for the Vane-Opening Data 


Test Results for Xbar Chart 

TEST 1. One point more than 3.00 sigmas from center line. 
Test Failed at points: 6 8 11 19 

Test Results for R Chart 

TEST 1. One point more than 3.00 sigmas from center line. 
Test Failed at points: 9 


SZ 
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EXERCISES FOR SECTION 16-5 


16-1. An extrusion die is used to produce aluminum rods. 
The diameter of the rods is a critical quality characteristic. The 
following table shows x and r values for 20 samples of five 
rods each. Specifications on the rods are 0.5035 + 0.0010 inch. 
The values given are the last three digits of the measurement; 
that is, 34.2 is read as 0.50342. 


Sample Do r 
1 34.2 3 
2 31.6 4 
3 31.8 4 
4 33.4 5 
5 35.0 4 
6 32.1 2 
7 32.6 7 
8 33.8 9 
9 34.8 10 

10 38.6 4 
1 35.4 8 
12 34.0 6 
13 36.0 4 
14 212 7 
15 35.2 3 
16 33.4 10 
17 35.0 4 
18 34.4 7 
19 33.9 8 
20 34.0 4 


(a) Using all the data, find trial control limits for X and R 
charts, construct the chart, and plot the data. 

(b) Use the trial control limits from part (a) to identify 
out-of-control points. If necessary, revise your control 
limits, assuming that any samples that plot outside the 
control limits can be eliminated. 


16-2. Twenty-five samples of size 5 are drawn from a 
process at one-hour intervals, and the following data are 
obtained: 


25 25 25 
SX; = 362.75 = Sr, = 860 Ss; = 3.64 


i=1 i-l i=1 


(a) Find trial control limits for X and R charts. 
(b) Repeat part (a) for X and S charts. 


16-3. The pull strength of a wire-bonded lead for an inte- 

grated circuit monitored. The following table provides data for 

20 samples each of size three. 

(a) Use all the data to determine trial control limits for X 
and R charts, construct the control limits, and plot the 
data. 


(b) Use the control limits from part (a) to identify out-of-control 
points. If necessary, revise your control limits assuming that 
any samples that plot outside of the control limits can be 
eliminated. 

(c) Repeat parts (a) and (b) for X and S charts. 


Sample Number Xi X3 X3 
1 15.4 15.6 15.3 
2 15.4 17.1 15.2 
3 16.1 16.1 13.5 
4 13.5 12.5 10.2 
5 18.3 16.1 17.0 
6 19.2 17.2 19.4 
7 14.1 12.4 11.7 
8 15.6 13.3 13.6 
9 13.9 14.9 15.5 

10 18.7 2 20.1 
11 153 13.1 13.7 
12 16.6 18.0 18.0 
13 17.0 15.2 18.1 
14 16.3 16.5 17.7 
15 8.4 Ta 8.4 
16 11.1 13.8 11.9 
17 16.5 LLI 18.5 
18 18.0 14.1 15.9 
19 17.8 17.3 12.0 
20 11.5 10.8 11.2 


16-4. Samples of size n = 6 are collected from a process 

every hour. After 20 samples have been collected, we calcu- 

late x = 20.0 and r/d; = 1.4. 

(a) Find trial control limits for X and R charts. 

(b) If s/c, = 1.5, determine trial control limits for X and S 
charts. 


16-5. Control charts for X and R are to be set up for an im- 
portant quality characteristic. The sample size is n — 5, and x 
and r are computed for each of 35 preliminary samples. The 
summary data are 

35 35 

Sx, = 7805 Yr, = 1200 

j=l i=l 
(a) Find trial control limits for X and R charts. 
(b) Assuming that the process is in control, estimate the 

process mean and standard deviation. 

16-6. Control charts are to be constructed for samples of size 
n = 4, and x and s are computed for each of 20 preliminary sam- 
ples as follows: 


N 
© 


20 


x; = 4460 X s; = 271.6 
i=l 


~ 
Il 
— 


(a) Determine trial control limits for X and S charts. 
(b) Assuming the process is in control, estimate the process 
mean and standard deviation. 


16-7. 


following table, for 25 samples of five parts each. 


Sample 
Number 


= m 
FPF O ooo u ANa A U Nee 


NO NO NO F2 NN N RRR RF Fr Re eR 
MA BWNYNrR DO oO WAN HW t BW rn 


X1 


0.0629 
0.0630 
0.0628 
0.0634 
0.0619 
0.0613 
0.0630 
0.0628 
0.0623 
0.0631 
0.0635 
0.0623 
0.0635 
0.0645 
0.0619 
0.0631 
0.0616 
0.0630 
0.0636 
0.0640 
0.0628 
0.0615 
0.0630 
0.0635 
0.0623 


X2 


0.0636 
0.0631 
0.0631 
0.0630 
0.0628 
0.0629 
0.0639 
0.0627 
0.0626 
0.0631 
0.0630 
0.0630 
0.0631 
0.0640 
0.0644 
0.0627 
0.0623 
0.0630 
0.0631 
0.0635 
0.0625 
0.0625 
0.0632 
0.0629 
0.0629 


X3 


0.0640 
0.0622 
0.0633 
0.0631 
0.0630 
0.0634 
0.0625 
0.0622 
0.0633 
0.0633 
0.0638 
0.0630 
0.0630 
0.0631 
0.0632 
0.0630 
0.0631 
0.0626 
0.0629 
0.0629 
0.0616 
0.0619 
0.0630 
0.0635 
0.0630 


X4 


0.0635 
0.0625 
0.0633 
0.0632 
0.0619 
0.0625 
0.0629 
0.0625 
0.0630 
0.0631 
0.0635 
0.0627 
0.0630 
0.0640 
0.0622 
0.0628 
0.0620 
0.0629 
0.0635 
0.0635 
0.0620 
0.0619 
0.0631 
0.0631 
0.0626 


16-6 CONTROL CHARTS FOR INDIVIDUAL MEASUREMENTS 


The thickness of a metal part is an important qual- 
ity parameter. Data on thickness (in inches) are given in the 


Xs 


0.0640 
0.0627 
0.0630 
0.0633 
0.0625 
0.0628 
0.0627 
0.0627 
0.0624 
0.0630 
0.0633 
0.0629 
0.0630 
0.0642 
0.0635 
0.0629 
0.0625 
0.0628 
0.0634 
0.0634 
0.0623 
0.0622 
0.0630 
0.0633 
0.0628 
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(a) Using all the data, find trial control limits for X and R 
charts, construct the chart, and plot the data. Is the process 
in statistical control? 

(b) Repeat part (a) for X and S charts. 

(c) Use the trial control limits from part (a) to identify out-of- 
control points. List the sample numbers of the out-of-control 
points. 

16-8. The copper content of a plating bath is measured three 

times per day, and the results are reported in ppm. The x and r 

values for 25 days are shown in the following table: 


(a) Using all the data, find trial control limits for X and R charts, 
construct the chart, and plot the data. Is the process in 
statistical control? 

(b) If necessary, revise the control limits computed in part (a), 
assuming that any samples that plot outside the control 
limits can be eliminated. 


Day x r Day xX r 
1 5.45 1.21 14 7.01 1.45 
2 5.39 0.95 15 5.83 1.37 
3 6.85 1.43 16 6.35 1.04 
4 6.74 1.29 17 6.05 0.83 
5 5.83 1.35 18 7.11 1.35 
6 1.22 0.88 19 7:32 1.09 
7 6.39 0.92 20 5.90 1:22 
8 6.50 1.13 21 5.50 0.98 
9 7.15 1.25 22 6.32 1.21 

10 3.92 1.05 23 6.55 0.76 
11 6.45 0.98 24 5.90 1.20 
12 5.38 1.36 25 3.95 1.19 
13 6.03 0.83 


16-6 CONTROL CHARTS FOR INDIVIDUAL 


MEASUREMENTS 


In many situations, the sample size used for process control is n = 1; that is, the sample con- 
sists of an individual unit. Some examples of these situations are as follows: 


L. 


Automated inspection and measurement technology is used, and every unit manu- 
factured is analyzed. 


The production rate is very slow, and it is inconvenient to allow sample sizes of n > 1 
to accumulate before being analyzed. 


Repeat measurements on the process differ only because of laboratory or analysis 
error, as in many chemical processes. 
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4. In process plants, such as papermaking, measurements on some parameters such 
as coating thickness across the roll will differ very little and produce a standard 
deviation that is much too small if the objective 1s to control coating thickness 
along the roll. 


In such situations, the individuals control chart is useful. The control chart for individu- 
als uses the moving range of two successive observations to estimate the process variability. 
The moving range is defined as MR; = |X; — X; . 

An estimate of o is 

. MR MR 
- MI 


—— = 16-18 

d» 1.128 l ) 
because d; = 1.128 when two consecutive observations are used to calculate a moving range. 
It is also possible to establish a control chart on the moving range using D4 and D, for n — 2. 
The parameters for these charts are defined as follows. 


Individuals 
Control Chart The center line and upper and lower control limits for a control chart for individuals 
are 


UC ee ae 
AT. 


CL =x (16-19) 


Penis 
x 7 


and for a control chart for moving ranges 
UCL = Da4mr = 3.267 mr 


CL — mr 
LCL = D4mr = 0 





The procedure is illustrated in the following example. 


EXAMPLE 16-2 Table 16-3 shows 20 observations on concentration for the output of a chemical process. The 
observations are taken at one-hour intervals. If several observations are taken at the same 
time, the observed concentration reading will differ only because of measurement error. Since 
the measurement error is small, only one observation is taken each hour. 

To set up the control chart for individuals, note that the sample average of the 20 concen- 
tration readings is x = 99.1 and that the average of the moving ranges of two observations 
shown in the last column of Table 16-3 is mr = 2.59. To set up the moving-range chart, we 
note that D = 0 and D, = 3.267 for n = 2. Therefore, the moving-range chart has center line 
mr = 2.59, LCL = 0, and UCL = Dymr = (3.267)(2.59) = 8.46. The control chart is shown 
as the lower control chart in Fig. 16-11 on page 618. This control chart was constructed by 
Minitab. Because no points exceed the upper control limit, we may now set up the control chart 
for individual concentration measurements. If a moving range of n = 2 observations is used, 
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d, = 1.128. For the data in Table 16-3 we have 


UCL — x 4 3*2 991 + 3229. = 105.99 
di i 1.128 | 

CL = ¥ = 99.1 

tee =e = 991 951 
D Ms 1428 ^^ 


The control chart for individual concentration measurements is shown as the upper con- 
trol chart in Fig. 16-11. There is no indication of an out-of-control condition. 


The chart for individuals can be interpreted much like an ordinary X control chart. A shift 
in the process average will result in either a point (or points) outside the control limits, or a 
pattern consisting of a run on one side of the center line. 

Some care should be exercised in interpreting patterns on the moving-range chart. The 
moving ranges are correlated, and this correlation may often induce a pattern of runs or cycles 
on the chart. The individual measurements are assumed to be uncorrelated, however, and any 
apparent pattern on the individuals' control chart should be carefully investigated. 

The control chart for individuals is very insensitive to small shifts in the process mean. 
For example, if the size of the shift in the mean is one standard deviation, the average number 
of points to detect this shift is 43.9. This result is shown later in the chapter. While the per- 
formance of the control chart for individuals is much better for large shifts, in many situations 
the shift of interest is not large and more rapid shift detection is desirable. In these cases, we 
recommend the cumulative sum control chart (discussed in Section 16-10) or an exponentially 
weighted moving-average chart (Montgomery, 2001). 


Table 16-3 Chemical Process Concentration Measurements 


Concentration Moving Range 
Observation X mr 

1 102.0 

2 94.8 7.2 

d 98.3 229 

4 98.4 0.1 

5 102.0 3.6 

6 98.5 3.5 

7 99.0 0.5 

8 97:1 1.3 

9 100.0 2:3 
10 98.1 1.9 
11 101.3 3.2 
12 98.7 2.6 
13 101.1 2.4 
14 98.4 Zul 
15 97.0 1.4 
16 96.7 0.3 
17 100.3 3.6 
18 101.4 1.1 
19 97.2 4.2 
20 101.0 3.8 


x = 99.1 mr = 2.59 
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Some individuals have suggested that limits narrower than 3-sigma be used on the chart 
for individuals to enhance its ability to detect small process shifts. This is a dangerous sug- 
gestion, for narrower limits will dramatically increase false alarms such that the charts may be 
ignored and become useless. If you are interested in detecting small shifts, use the cumulative 
sum or exponentially weighted moving-average control chart referred to on the previous page. 


EXERCISES FOR SECTION 16-6 


—. 16-9. ‘Twenty successive hardness measurements are made (a) Using all the data, compute trial control limits for indi- 
Ss on a metal alloy, and the data are shown in the following table. vidual observations and moving-range charts. Construct 
the chart and plot the data. Determine whether the 
process is in statistical control. If not, assume assignable 


_ Observation Hardness Observation Hardness _ causes can be found to eliminate these samples and re- 
1 51 11 51 vise the control limits. 
2 52 12 57 (b) Estimate the process mean and standard deviation for the 
2 54 (2 58 in-control process. 
4 55 14 50 16-10. In a semiconductor manufacturing process CVD 
metal thickness was measured on 30 wafers obtained over ap- 
5 55 15 53 . . . 
proximately two weeks. Data are shown in the following table. 
D al 16 52 (a) Using all the data, compute trial control limits for indi- 
7 52 17 54 vidual observations and moving-range charts. Construct 
8 50 18 50 the chart and plot the data. Determine whether the 
9 51 19 56 process is in statistical control. If not, assume assignable 
10 56 20 53 causes can be found to eliminate these samples and re- 


vise the control limits. 
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(b) Estimate the process mean and standard deviation for the (a) Using all the data, compute trial control limits for individ- 
in-control process. ual observations and moving-range charts. Construct 

the control chart and plot the data. Determine whether 

the process is in statistical control. If not, assume assigna- 


Water = Water x ble causes can be found to eliminate these samples and 
1 16.8 16 15.4 revise the control limits. 
2 14.9 17 14.3 (b) Estimate the process mean and standard deviation for the 
3 18.3 18 16.1 in-control process. 
4 16.5 19 15.8 16-12. The viscosity of a chemical intermediate is meas- , 
5 17.1 20 15.9 ured every hour. Twenty samples each of size n = 1, are in the al 
6 17.4 21 15.2 following table. = 
7 15.9 22 16.7 EP a ne 
8 14.4 23 152 C —X1 2€ 
9 15.0 24 14.7 l 495 
10 15.7 25 17.9 2 491 
11 17.1 26 14.8 3 501 
12 15.9 27 17.0 4 501 
13 16.4 28 16.2 5 512 
14 15.8 29 15.6 6 540 
15 15.4 30 16.3 7 492 
8 504 
— 16-11. The diameter of holes is measured in consecutive 2 542 
order by an automatic sensor. The results of measuring 25 10 508 
~ holes are in the following table. 11 493 
12 507 
Sample Diameter Sample Diameter 13 503 
1 9.94 14 9.99 "i 1 
2 9:95 15 10.12 z T 
3 10.09 16 9.81 a bi 
4 9.08 17 9.73 E don 
5 10.11 18 10.14 = 200 
6 9.09 19 9.06 e 2 
7 10.11 20 10.06 m iiid 
8 9.84 21 10.11 (a) Using all the data, compute trial control limits for individ- 
9 9.82 22 9.95 ual observations and moving-range charts. Determine 
10 10.38 25 9.02 whether the process is in statistical control. If not, assume 
11 9.99 24 10.09 assignable causes can be found to eliminate these samples 
12 10.41 25 9.85 and revise the control limits. 
13 10.36 (b) Estimate the process mean and standard deviation for the 


in-control process. 


16-7 PROCESS CAPABILITY 


It is usually necessary to obtain some information about the process capability, that is, the 
performance of the process when it is operating in control. Two graphical tools, the toler- 
ance chart (or tier chart) and the histogram, are helpful in assessing process capability. 
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Figure 16-12 
Tolerance diagram of 
vane openings. 
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Sample number 


The tolerance chart for all 20 samples from the vane-manufacturing process is shown in 
Fig. 16-12. The specifications on vane opening are 0.5030 — 0.0010 in. In terms of the 
coded data, the upper specification limit is USL — 40 and the lower specification limit is 
LSL = 20, and these limits are shown on the chart in Fig. 16-12. Each measurement is plot- 
ted on the tolerance chart. Measurements from the same subgroup are connected with 
lines. The tolerance chart is useful in revealing patterns over time in the individual meas- 
urements, or it may show that a particular value of x or r was produced by one or two un- 
usual observations in the sample. For example, note the two unusual observations in sam- 
ple 9 and the single unusual observation in sample 8. Note also that it is appropriate to plot 
the specification limits on the tolerance chart, since it is a chart of individual measure- 
ments. It is never appropriate to plot specification limits on a control chart or to use 
the specifications in determining the control limits. Specification limits and control lim- 
its are unrelated. Finally, note from Fig. 16-12 that the process is running off-center from 
the nominal dimension of 30 (or 0.5030 inch). 

The histogram for the vane-opening measurements 1s shown in Fig. 16-13. The observa- 
tions from samples 6, 8, 9, 11, and 19 (corresponding to out of-control points on either the Ed 
or R chart) have been deleted from this histogram. The general impression from examining 
this histogram is that the process is capable of meeting the specification but that it is running 
off-center. 

Another way to express process capability is in terms of an index that is defined as 
follows. 


Figure 16-13 
Histogram for vane 
opening. 
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= 
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LSL 1 USL 
Nominal dimension 


Vane opening 


The process capability ratio (PCR) is 


USE es MESE 
HORE (16-20) 








The numerator of PCR is the width of the specifications. The limits 3o on either side of the 
process mean are sometimes called natural tolerance limits, for these represent limits that an 
in-control process should meet with most of the units produced. Consequently, 60 is often re- 
ferred to as the width of the process. For the vane opening, where our sample size is 5, we 
could estimate o as 


r 5.0 


= = 2.1 
d, 2326 3 





Therefore, the PCR is estimated to be 


USL — LSL = 
POR = = BT 155 
66 6(2.15) 


The PCR has a natural interpretation: (1/PCR)100 is just the percentage of the specifica- 
tions' width used by the process. Thus, the vane-opening process uses approximately 
(1/1.55)100 = 64.5% of the specifications’ width. 

Figure 16-14(a) shows a process for which the PCR exceeds unity. Since the process 
natural tolerance limits lie inside the specifications, very few defective or nonconform- 
ing units will be produced. If PCR = 1, as shown in Fig. 16-14(b), more nonconform- 
ing units result. In fact, for a normally distributed process, if PCR = 1, the fraction 
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PCR»1 


LSL u USL 
ix 3g — | 3g —— — 


(a) 







PCR= 1 
Nonconforming Nonconforming 
units b* units 
u 
—— 35 ——_>|x——- 3e — 
LSL USL 

(b) 

PCR < 1 


Nonconforming 
units 


Nonconforming 
units 


Figure 16-14 
LSL USL 

Process fallout and the H 

process capability ratio 30 — —-|4— — 30 tl 


(PCR). (c) 


nonconforming is 0.27%, or 2700 parts per million. Finally, when the PCR is less than 
unity, as in Fig. 16-14(c), the process is very yield-sensitive and a large number of non- 
conforming units will be produced. 

The definition of the PCR given in Equation 16-19 implicitly assumes that the process 
is centered at the nominal dimension. If the process is running off-center, its actual capa- 
bility will be less than indicated by the PCR. It is convenient to think of PCR as a meas- 
ure of potential capability, that is, capability with a centered process. If the process is not 
centered, a measure of actual capability is often used. This ratio, called PCR;, is defined 
below. 


PCR, 


USE n u = LSD 


PCR, = min (16-21) 


30 30 





In effect, PCR, is a one-sided process capability ratio that is calculated relative to the specifi- 
cation limit nearest to the process mean. For the vane-opening process, we find that the 


EXAMPLE 16-3 
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estimate of the process capability ratio PCR, is 


— . | USL —x X= LSL 
PCI = mn ~ 
36 36 
. | 40 — 33.19 33.19 — 20 
= min | ———--— = 1.06, —————— = 2.04 | = 1.06 
3(2.15) 3(2.15) 


__~ Note that if PCR = PCR,, the process is centered at the nominal dimension. Since 
PCR, = 1.06 for the vane-opening process and PCR = 1.55, the process is obviously run- 
ning off-center, as was first noted in Figs. 16-14 and 16-17. This off-center operation was ul- 
timately traced to an oversized wax tool. Changing the tooling resulted in a substantial im- 
provement in the process (Montgomery, 2001). 

The fractions of nonconforming output (or fallout) below the lower specification limit and 
above the upper specification limit are often of interest. Suppose that the output from a normally 
distributed process in statistical control is denoted as X. The fractions are determined from 


P(X < LSL) = P(Z<(LSL—p)/o) P(X > USL) = P(Z > (USL — p)/o) 


For an electronic manufacturing process a current has specifications of 100 + 10 mil- 
liamperes. The process mean u and standard deviation ø are 107.0 and 1.5, respectively. The 
process mean is nearer to the USL. Consequently, 


PCR = (110 — 90)/6-1.5) = 2.22 and PCR, = (110 — 107)/(3 * 1.5) = 0.67 


The small PCR, indicates that the process is likely to produce currents outside of the specifi- 
cation limits. From the normal distribution in Appendix Table I 


P(X < LSL) = P(Z < (90 — 107)/1.5) = P(Z < -11.33) = 0 
P(X > USL) = P(Z > (110 — 107)/1.5) = P(Z > 2) = 0.023 


For this example, the relatively large probability of exceeding the USL is a warning of po- 
tential problems with this criterion even if none of the measured observations in a preliminary 
sample exceed this limit. We emphasize that the fraction-nonconforming calculation assumes 
that the observations are normally distributed and the process is in control. Departures from 
normality can seriously affect the results. The calculation should be interpreted as an approx- 
imate guideline for process performance. To make matters worse, ju and o need to be esti- 
mated from the data available and a small sample size can result in poor estimates that further 
degrade the calculation. 

Montgomery (2001) provides guidelines on appropriate values of the PCR and a table re- 
lating fallout for a normally distributed process in statistical control to the value of PCR. 
Many U.S. companies use PCR = 1.33 as a minimum acceptable target and PCR = 1.66asa 
minimum target for strength, safety, or critical characteristics. Some companies require that 
internal processes and those at suppliers achieve a PCR, = 2.0. Figure 16-15 illustrates a 
process with PCR — PCR, — 2.0. Assuming a normal distribution, the calculated fallout for 
this process is 0.0018 parts per million. A process with PCR, = 2.0 is referred to as a six- 
sigma process because the distance from the process mean to the nearest specification is six 
standard deviations. The reason that such a large process capability is often required is that it 
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Figure 16-15 Mean 
of a six-sigma process 
shifts by 1.5 standard 

deviations. 
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is difficult to maintain a process mean at the center of the specifications for long periods of 
time. A common model that is used to justify the importance of a six-sigma process is illus- 
trated by referring to Fig. 16-15. If the process mean shifts off-center by 1.5 standard devia- 
tions, the PCR, decreases to 4.50/30 = 1.5. Assuming a normally distributed process, the 
fallout of the shifted process is 3.4 parts per million. Consequently, the mean of a 6-sigma 
process can shift 1.5 standard deviations from the center of the specifications and still main- 
tain a fallout of 3.4 parts per million. 

In addition, some U.S. companies, particularly the automobile industry, have adopted the 
terminology C, = PCR and C,, = PCR,. Because C, has another meaning in statistics (in 
multiple regression) we prefer the traditional notation PCR and PCR,. 

We repeat that process capability calculations are meaningful only for stable 
processes; that is, processes that are in control. A process capability ratio indicates 
whether or not the natural or chance variability in a process is acceptable relative to the 


specifications. 


EXERCISES FOR SECTION 16-7 


16-13. A normally distributed process uses 66.7% of the 
specification band. It is centered at the nominal dimension, lo- 
cated halfway between the upper and lower specification limits. 
(a) Estimate PCR and PCR,. Interpret these ratios. 

(b) What fallout level (fraction defective) is produced? 


16-14. Reconsider Exercise 16-1. Use the revised control 

limits and process estimates. 

(a) Estimate PCR and PCR,. Interpret these ratios. 

(b) What percentage of defectives is being produced by this 
process? 


16-15. Reconsider Exercise 16-2, where the specification 

limits are 14.50 — 0.50. 

(a) What conclusions can you draw about the ability of the 
process to operate within these limits? Estimate the per- 
centage of defective items that will be produced. 

(b) Estimage PCR and PCR,. Interpret these ratios. 


16-16. Reconsider Exercise 16-3. Using the process esti- 
mates, what is the fallout level if the coded specifications are 
10 = 5 mm? Estimate PCR and interpret this ratio. 


16-17. A normally distributed process uses 85% of the spec- 
ification band. It is centered at the nominal dimension, located 
halfway between the upper and lower specification limits. 


(a) Estimate PCR and PCR,. Interpret these ratios. 
(b) What fallout level (fraction defective) is produced? 


16-18.  Reconsider Exercise 16-5. Suppose that the quality 
characteristic is normally distributed with specification at 220 — 
40. What is the fallout level? Estimate PCR and PCR, and in- 
terpret these ratios. 

16-19.  Reconsider Exercise 16-6. Suppose that the variable 
is normally distributed with specifications at 220 = 50. What 
is the proportion out of specifications? Estimate and interpret 
PCR and PCR,. 

16-20. Reconsider Exercise 16-4(a). Assuming that both 
charts exhibit statistical control and that the process specifica- 
tions are at 20 + 5, estimate PCR and PCR, and interpret these 
ratios. 

16-21. Reconsider Exercise 16-8. Given that the specifica- 
tions are at 6.0 + 1.0, estimate PCR and PCR, and interpret 
these ratios. 

16-22. Reconsider 16-7(b). What are the natural tolerance 
limits of this process? 

16-23. Reconsider 16-12. The viscosity specifications are at 
500 + 25. Calculate estimates of the process capability ratios 
PCR and PCR, for this process and provide an interpretation. 
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16-8 ATTRIBUTE CONTROL CHARTS 
16-8.1 P Chart (Control Chart for Proportions) 


Often it is desirable to classify a product as either defective or nondefective on the basis of com- 
parison with a standard. This classification is usually done to achieve economy and simplicity in 
the inspection operation. For example, the diameter of a ball bearing may be checked by deter- 
mining whether it will pass through a gauge consisting of circular holes cut in a template. This 
kind of measurement would be much simpler than directly measuring the diameter with a device 
such as a micrometer. Control charts for attributes are used in these situations. Attribute control 
charts often require a considerably larger sample size than do their variable measurements coun- 
terparts. In this section, we will discuss the fraction-defective control chart, or P chart. 
Sometimes the P chart is called the control chart for fraction nonconforming. 

Suppose D is the number of defective units in a random sample of size n. We assume that 
D is a binomial random variable with unknown parameter p. The fraction defective 


P- 
of each sample is plotted on the chart. Furthermore, the variance of the statistic P is 


s Pp - p) 
Mir n 


Therefore, a P chart for fraction defective could be constructed using p as the center line and 
control limits at 


p(l — p) p(l — p) 


UCL pray) XA LCL p —3 7 (16-22) 


However, the true process fraction defective is almost always unknown and must be estimated 
using the data from preliminary samples. 

Suppose that m preliminary samples each of size n are available, and let D; be the number 
of defectives in the ith sample. The Ê, = D;/n is the sample fraction defective in the ith 
sample. The average fraction defective is 


P= n4 = mn 2, (16-23) 


P Chart 
The center line and upper and lower control limits for the P chart are 








you =p + 3 JP TENE: 


where p is the observed value of the average fraction defective. 
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EXAMPLE 16-4 


Table 16-4 Number of Defectives in Samples of 100 


Ceramic Substrates 


Sample No. of Defectives Sample No. of Defectives 
1 44 11 36 
2 48 12 52 
3 32 13 35 
4 50 14 41 
5 29 15 42 
6 31 16 30 
7 46 17 46 
8 52 18 38 
9 44 19 26 

10 48 20 30 


These control limits are based on the normal approximation to the binomial distribu- 
tion. When p is small, the normal approximation may not always be adequate. In such 
cases, we may use control limits obtained directly from a table of binomial probabilities. 
If p is small, the lower control limit obtained from the normal approximation may be a 
negative number. If this should occur, it is customary to consider zero as the lower control 
limit. 


Suppose we wish to construct a fraction-defective control chart for a ceramic substrate pro- 
duction line. We have 20 preliminary samples, each of size 100; the number of defectives in 
each sample is shown in Table 16-4. Assume that the samples are numbered in the sequence 
of production. Note that p = (800/2000) = 0.40; therefore, the trial parameters for the con- 
trol chart are 





(0.40)(0.60) 
UCL = 0.40 + 3,| —— > = 0.55 CL — 040 
100 
(0.40)(0.60) 
LCL = 0.40 — 34| — o = 025 


The control chart is shown in Fig. 16-16. All samples are in control. If they were not, we 
would search for assignable causes of variation and revise the limits accordingly. This chart 
can be used for controlling future production. 

Although this process exhibits statistical control, its defective rate (p = 0.40) is very 
poor. We should take appropriate steps to investigate the process to determine why such a 
large number of defective units is being produced. Defective units should be analyzed to de- 
termine the specific types of defects present. Once the defect types are known, process 
changes should be investigated to determine their impact on defect levels. Designed experi- 
ments may be useful in this regard. 


Computer software also produces an NP chart. This is just a control chart of nÊ = D, the 
number of defectives in a sample. The points, center line, and control limits for this chart are 
just multiples (times n) of the corresponding elements of a P chart. The use of an NP chart 
avoids the fractions in a P chart. 


Figure 16-16 P chart 
for a ceramic substrate. 
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16-8.2 U Chart (Control Chart for Defects per Unit) 


U Chart 


It is sometimes necessary to monitor the number of defects in a unit of product rather than 

the fraction defective. Suppose that in the production of cloth it is necessary to control the 

number of defects per yard or that in assembling an aircraft wing the number of missing riv- 

ets must be controlled. In these situations we may use the control chart for defects per unit, 

or the U chart. Many defects-per-unit situations can be modeled by the Poisson distribution. 
If each sample consists of n units and there are C total defects in the sample, 


is the average number of defects per unit. A U chart may be constructed for such data. 

If the number of defects in a unit is a Poisson random variable with parameter N, the mean and 
variance of this distribution are both A. Each point on the chart is U, the average number of defects 
per unit from a sample of n units. Therefore, the mean of U is À and the variance of U is \/n. 


A 
uch =2+34/* 
Ñ 
yet =a s A (16-25) 


If there are m preliminary samples, and the number of defects per unit in these samples are U}, 


U,,..., Um the estimator of the average number of defects per unit is 
px 16-26 
U = wi 2 U; (16-26) 


UCL =u + 3 = (16-27) 


where u is the average number of defects per unit. 
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EXAMPLE 16-5 


These control limits are based on the normal approximation to the Poisson distribution. 
When J is small, the normal approximation may not always be adequate. In such cases, we 
may use control limits obtained directly from a table of Poisson probabilities. If u is small, 
the lower control limit obtained from the normal approximation may be a negative number. 
If this should occur, it is customary to consider zero as the lower control limit. 


Printed circuit boards are assembled by a combination of manual assembly and automation. A 
flow solder machine is used to make the mechanical and electrical connections of the leaded 
components to the board. The boards are run through the flow solder process almost continu- 
ously, and every hour five boards are selected and inspected for process-control purposes. The 
number of defects in each sample of five boards is noted. Results for 20 samples are shown in 
Table 16-5. 

The center line for the U chart is 


and the upper and lower control limits are 


UcL - à e 3 S = 16 +3,/48 = 
Let = 0-3/2 = 16 - o JT <0 


The control chart is plotted in Fig. 16-17. Because LCL is negative, it is set to 0. From the con- 
trol chart in Fig. 16-17, we see that the process is in control. However, eight defects per group 
of five circuit boards are too many (about 8/5 = 1.6 defects/board), and the process needs 
improvement. An investigation needs to be made of the specific types of defects found on the 
printed circuit boards. This will usually suggest potential avenues for process improvement. 


a 


Computer software also produces a C chart. This is just a control chart of C, the total of 
defects in a sample. The points, center line, and control limits for this chart are just multiples 


Table 16-5 Number of Defects in Samples of Five Printed Circuit Boards 


Number of Defects per Number of Defects per 
Sample Defects Unit u; Sample Detects Unit u; 
1 6 1.2 11 9 1.8 
2 + 0.8 12 15 3.0 
3 8 1.6 13 8 1.6 
4 10 2.0 14 10 2.0 
5 9 1.8 15 8 1.6 
6 12 2.4 16 2 0.4 
7 16 9.2 I7 7 1.4 
8 2 0.4 18 I 0.2 
9 D 0.6 19 7 1.4 


mÓ 
© 
mÓ 
© 


2.0 20 13 2.6 


UCL = 3.3 A 


Defects per unit, u 
NO 
Z l 
NS > 
N 
/ 
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Figure 16-17 U chart 
of defects per unit on 
printed circuit boards. 
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(times n) of the corresponding elements of a U chart. The use of a C chart avoids the fractions 


that can occur in a U chart. 


EXERCISES FOR SECTION 16-8 


16-24. Suppose the following fraction defective has been 


found in successive samples of size 100 (read down): 
= 


0.09 0.03 0.12 
0.10 0.05 0.14 
0.13 0.13 0.06 
0.08 0.10 0.05 
0.14 0.14 0.14 
0.09 0.07 0.11 
0.10 0.06 0.09 
0.15 0.09 0.13 
0.13 0.08 0.12 
0.06 0.11 0.09 


(a) Using all the data, compute trial control limits for a 
fraction-defective control chart, construct the chart, and 
plot the data. 

(b) Determine whether the process is in statistical control. If not, 
assume assignable causes can be found and out-of-control 
points eliminated. Revise the control limits. 


16-25. The following represent the number of solder de- 

fects observed on 24 samples of five printed circuit boards: 7, 

6, 8, 10, 24, 6, 5, 4, 8, 11, 15, 8, 4, 16, 11, 12, 8, 6, 5, 9, 7, 14, 

8,21. 

(a) Using all the data, compute trial control limits for a U con- 
trol chart, construct the chart, and plot the data. 

(b) Can we conclude that the process is in control using a U 
chart? If not, assume assignable causes can be found, list 
points and revise the control limits. 


16-26. The following represent the number of defects 


per 1000 feet in rubber-covered wire: 1, 1, 3, 7, 8, 10, 5, 13, 
0, 19, 24, 6, 9, 11, 15, 8, 3, 6, 7, 4, 9, 20, 11, 7, 18, 10, 6, 4, 
0,9, 7, 3, 1, 8, 12. Do the data come from a controlled 
process? 

16-27. Consider the data in Exercise 16-25. Set up a C chart 
for this process. Compare it to the U chart in Exercise 16-25. 
Comment on your findings. 

16-28. The following are the numbers of defective sol- 
der joints found during successive samples of 500 solder 
Joints: 


Day | No. of Defectives Day No. of Defectives 
1 106 12 37 
2 116 13 25 
3 164 14 88 
4 89 15 101 
5 99 16 64 
6 40 17 51 
7 112 18 74 
8 36 19 71 
9 69 20 43 

10 74 21 80 
11 42 


(a) Using all the data, compute trial control limits for a 
fraction-defective control chart, construct the chart, and 
plot the data. 

(b) Determine whether the process is in statistical control. If not, 
assume assignable causes can be found and out-of-control 
points eliminated. Revise the control limits. 


CY 
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16-9 CONTROL CHART PERFORMANCE 


Specifying the control limits is one of the critical decisions that must be made in designing a 
control chart. By moving the control limits further from the center line, we decrease the risk 
of a type I error—that 1s, the risk of a point falling beyond the control limits, indicating an 
out-of-control condition when no assignable cause is present. However, widening the control 
limits will also increase the risk of a type II error—that is, the risk of a point falling between 
the control limits when the process is really out of control. If we move the control limits closer 
to the center line, the opposite effect is obtained: The risk of type I error is increased, while the 
risk of type II error is decreased. 

The control limits on a Shewhart control chart are customarily located a distance of plus 
or minus three standard deviations of the variable plotted on the chart from the center line. 
That is, the constant k in equation 16-1 should be set equal to 3. These limits are called 
3-sigma control limits. 

A way to evaluate decisions regarding sample size and sampling frequency is through the 
average run length (ARL) of the control chart. Essentially, the ARL is the average number of 
points that must be plotted before a point indicates an out-of-control condition. For any Shewhart 
control chart, the ARL can be calculated from the mean of a geometric random variable 
(Montgomery 2001). Suppose that p is the probability that any point exceeds the control limits. Then 


(16-28) 





Thus, for an X chart with 3-sigma limits, p — 0.0027 is the probability that a single point falls 
outside the limits when the process is in control, so 


1 1 
ARL = 5 = 000279 





is the average run length of the X chart when the process is in control. That is, even if the process 
remains in control, an out-of-control signal will be generated every 370 points, on the average. 

Consider the piston ring process discussed in Section 16-4.2, and suppose we are sampling 
every hour. Thus, we will have a false alarm about every 370 hours on the average. Suppose we 
are using a sample size of n — 5 and that when the process goes out of control the mean shifts to 
74.0135 millimeters. Then, the probability that X falls between the control limits of Fig. 16-3 is 
equal to 


P[73.9865 = X < 74.0135 when p = 74.0135] 

73.9865 — 74.0135 74.0135 — 74.0135 

= P| s Zs MM 
0.0045 0.0045 


= P|-6=Z50]=055 


IA 


Therefore, p in Equation 16-28 is 0.50, and the out-of-control ARL is 


ae 
Pp 05 


Figure 16-18 
Process mean shift 
of 20. 


16-9 CONTROL CHART PERFORMANCE 631 


Table 16-6 Average Run Length (ARL) for an X Chart with 3-Sigma 


Control Limits 


Magnitude of ARL ARL 
Process Shift n= n=4 
0 370.4 370.4 

0.50 155.2 43.9 
1.00 43.9 6.3 
1.50 15.0 2.0 
2.00 6.3 1.2 
3.00 2.0 1.0 


That is, the control chart will require two samples to detect the process shift, on the aver- 
age, so two hours will elapse between the shift and its detection (again on the average). 
Suppose this approach is unacceptable, because production of piston rings with a mean di- 
ameter of 74.0135 millimeters results in excessive scrap costs and delays final engine as- 
sembly. How can we reduce the time needed to detect the out-of-control condition? One 
method is to sample more frequently. For example, if we sample every half hour, only one 
hour will elapse (on the average) between the shift and its detection. The second possibil- 
ity is to increase the sample size. For example, if we use n = 10, the control limits in 
Fig. 16-3 narrow to 73.9905 and 74.0095. The probability of X falling between the control 
limits when the process mean is 74.0135 millimeters is approximately 0.1, so p = 0.9, and 
the out-of-control ARL is 


AB eue P coge 
. P 09 ^" 


Thus, the larger sample size would allow the shift to be detected about twice as quickly as the 
old one. If it became important to detect the shift in the first hour after it occurred, two con- 
trol chart designs would work: 


Design 1 Design 2 
sample size: n = 5 Sample size: n = 10 
Sampling frequency: every half hour Sampling frequency: every hour 


Table 16-6 provides average run lengths for an X chart with 3-sigma control limits. The aver- 
age run lengths are calculated for shifts in the process mean from 0 to 3.00 and for sample 
sizes of n = 1 and n = 4 by using 1/p, where p is the probability that a point plots outside of 
the control limits. Figure 16-18 illustrates a shift in the process mean of 2o. 


u ———> U+ 20 
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EXERCISES FOR SECTION 16-9 


16-29. Consider the X control chart in Fig. 16-3. Suppose 

that the mean shifts to 74.010 millimeters. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 


16-30. An X chart uses samples of size 4. The center line is 
at 100, and the upper and lower 3-sigma control limits are at 
106 and 94, respectively. 

(a) What is the process o? 

(b) Suppose the process mean shifts to 96. Find the 
probability that this shift will be detected on the next 
sample. 

(c) Find the ARL to detect the shift in part (b). 


16-31. Consider the revised X control chart in Exercise 16-1 

with G = 2.466, UCL = 37.404, LCL = 30.780, and n = 5. 

Suppose that the mean shifts to 36. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 

16-32. Consider the X control chart in Exercise 16-2(a) 

with r = 0.344, UCL = 14.708, LCL = 14.312, and n = 5. 

Suppose that the mean shifts to 14.6. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 


16-33. Consider the X control chart in Exercise 16-3(a) 
with r = 6.750, UCL = 15.630, LCL = 5.795, and n = 4. 
Suppose that the mean shifts to 13. 


(a) What is the probability that this shift will be detected on 
the next sample? 
(b) What is the ARL after the shift? 


16-34. Consider the X control chart in Exercise 16-4(a) 

with 6 = 1.40, UCL = 21.88, LCL = 18.12, and n = 5. 

Suppose that the mean shifts to 17. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 


16-35. Consider the X control chart in Exercise 16-5 with 

r = 34.286, UCL = 242.780, LCL = 203.220, and n = 5. 

Suppose that the mean shifts to 210. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 


16-36. Consider the revised X control chart in Exercise 16-7 

with 6 = 0.000924, UCL = 0.0635, LCL = 0.0624, and 

n = 5. Suppose that the mean shifts to 0.0625. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 


16-37. Consider the revised X control chart in Exercise 16-8 

with 6 = 0.669, UCL = 7.443, LCL = 5.125, and n = 3. 

Suppose that the mean shifts to 5.5. 

(a) What is the probability that this shift will be detected on 
the next sample? 

(b) What is the ARL after the shift? 


16-10 CUMULATIVE SUM CONTROL CHART 


In Sections 16-5 and 16-6 we have presented basic types of Shewhart control charts. A ma- 
jor disadvantage of any Shewhart control chart is that the chart is relatively insensitive to 
small shifts in the process, say, on the order of about 1.50 or less. One reason for this relatively 
poor performance in detecting small process shifts is that the Shewhart chart makes use of 
only the information in the last plotted point, and it ignores the information in the sequence of 
points. This problem can be addressed, to some extent by adding criteria such as the Western 
Electric rules to a Shewhart chart, but the use of these rules reduces the simplicity and ease 
of interpretation of the chart. These rules would also cause the in-control average run length 
of a Shewhart chart to drop below 370. This increase in the false alarm rate can have serious 
practical consequences. 

A very effective alternative to the Shewhart control chart is the cumulative sum control 
chart (or CUSUM). This chart has much better performance (in terms of ARL) for detecting 
small shifts than the Shewhart chart, but it does not cause the in-control ARL to drop signifi- 
cantly. This section will illustrate the use of the CUSUM for sample averages and individual 
measurements. 

The CUSUM chart plots the cumulative sums of the deviations of the sample values from 
a target value. For example, suppose that samples of size n = 1 are collected, and X, is the 
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average of the jth sample. Then if pọ is the target for the process mean, the cumulative sum 
control chart is formed by plotting the quantity 


S; — * (X; — wo) (16-29) 


i 


against the sample number 7. Now, S; is called the cumulative sum up to and including the 
ith sample. Because they combine information from several samples, cumulative sum charts 
are more effective than Shewhart charts for detecting small process shifts. Furthermore, 
they are particularly effective with samples of n — 1. This makes the cumulative sum con- 
trol chart a good candidate for use in the chemical and process industries where rational 
subgroups are frequently of size 1, as well as in discrete parts manufacturing with automatic 
measurement of each part and online control using a microcomputer directly at the work 
center. 

If the process remains in control at the target value pu, the cumulative sum defined in 
equation 16-29 should fluctuate around zero. However, if the mean shifts upward to some 
value u; > p, Say, an upward or positive drift will develop in the cumulative sum S. 
Conversely, if the mean shifts downward to some p. < uo, a downward or negative drift in S; 
will develop. Therefore, if a trend develops in the plotted points either upward or downward, 
we should consider this as evidence that the process mean has shifted, and a search for the 
assignable cause should be performed. 

This theory can easily be demonstrated by applying the CUSUM to the chemical process 
concentration data in Table 16-3. Since the concentration readings are individual measure- 
ments, we would take X, = X; in computing the CUSUM. Suppose that the target value for the 
concentration is [uy = 99. Then the CUSUM is 


i 


> (X% — 99) 


i= 


iA 
| 


i— l 

= (X, — 99) + 2 (X, = 99) 
= 

A 99) T Si 


Table 16-7 shows the computation of this CUSUM, where the starting value of the 
CUSUM, Sp, is taken to be zero. Figure 16-19 plots the CUSUM from the last column of Table 
16-7. Notice that the CUSUM fluctuates around the value of 0. 

The graph in Fig. 16-19 is not a control chart because it lacks control limits. There are 
two general approaches to devising control limits for CUSUMS. The older of these two 
methods is the V-mask procedure. A typical V mask is shown in Fig. 16-20(a). It is a 
V-shaped notch in a plane that can be placed at different locations on the CUSUM chart. The 
decision procedure consists of placing the V mask on the cumulative sum control chart with 
the point O on the last value of s; and the line OP parallel to the horizontal axis. If all the pre- 
vious cumulative sums, S4, S2, . . . , 5; ,, lie within the two arms of the V mask, the process is 
in control. However, if any s; lies outside the arms of the mask, the process is considered to 
be out of control. In actual use, the V mask would be applied to each new point on the 
CUSUM chart as soon as it was plotted. In the example shown in Fig. 16-20(b), an upward 
shift in the mean is indicated, since at least one of the points that have occurred earlier than 
sample 22 now lies below the lower arm of the mask, when the V mask is centered on the 
thirtieth observation. If the point lies above the upper arm, a downward shift in the mean is 
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Table 16-7 CUSUM Computations for the Chemical Process Concentration Data in Table 16-3 


Observation, i 35 Xo 5 SoS 9D) sS 
1 102.0 3.0 3.0 
2 94.8 —4.2 weal 
3 98.3 =) — 1.9 
4 98.4 —0.6 m 
5 102.0 3.0 0.5 
6 98.5 =D 0.0 
7 99.0 0.0 0.0 
8 Olt =h — 3 
9 100.0 1.0 E05 

10 98.1 —0.9 —L2 
11 101.3 PS 1.1 
12 98.7 =03 0.8 
13 101.1 2.1 2.9 
14 98.4 —0.6 2.3 
15 97.0 x 0.3 
16 96.7 x —20 
17 100.3 1.3 220 7 
18 101.4 2.4 L5 
19 DIM = 1,5 =ü. 
20 101.0 2.0 1:9 


indicated. Thus, the V mask forms a visual frame of reference similar to the control limits on 
an ordinary Shewhart control chart. For the technical details of designing the V mask, see 
Montgomery (2001). 

While some computer programs plot CUSUMS with the V-mask control scheme, 
we feel that the other approach to CUSUM control, the tabular CUSUM, is superior. 
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Figure 16-20 The cumulative sum control chart. (a) The V-mask and scaling. (b) The cumulative 
sum control chart in operation. 


The tabular procedure is particularly attractive when the CUSUM is implemented on a 
computer. 

Let $5(7) be an upper one-sided CUSUM for period i and 5,(i) be a lower one-sided 
CUSUM for period i. These quantities are calculated from 


St) = malor (Gig a ss (16-30) 


Se) — x Os Ma — Esq (16-31) 


where the starting values s;,(0) = s;(0) = 0. 





In Equations 16-30 and 16-31 K is called the reference value, which is usually chosen 
about halfway between the target jy) and the value of the mean corresponding to the out-of- 
control state, jj, = pọ + A. That is, K is about one-half the magnitude of the shift we are in- 
terested 1n, or 


i 
2 


Notice that S,,(i) and S; (7) accumulate deviations from the target value that are greater than 
K, with both quantities reset to zero upon becoming negative. If either 5S, (7) or S, (1) exceeds 
a constant H, the process is out of control. This constant H is usually called the decision 
interval. 
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EXAMPLE 16-6 


A Tabular CUSUM 


We will illustrate the tabular CUSUM by applying it to the chemical process concentration data 
in Table 16-7. The process target is 4) = 99, and we will use K = 1 as the reference value and 
H = 10 as the decision interval. The reasons for these choices will be explained later. 

Table 16-8 shows the tabular CUSUM scheme for the chemical process concentration 
data. To illustrate the calculations, note that 


sy(i) = max[0, x; — (mo + K) + sali — 1)) = max[0, x; — (99 + 1) + sy(i — 1)] 
= max[0, x; — 100 + sg(i — 1)| 
si (i) = n ^ — K) — x; + si(i — 1)] = max[0,(99 — 1) — x; + s;(i — 1)] 


= max|0, 98 — x; + sz(i — 1)] 
Therefore, for observation 1 the CUSUMS are 
sg(1) = max[0, x, — 100 + 55 (0)|] = max[0, 102.0 — 100 + 0] = 2.0 


and 
sj(1) = max[0, 98 — x, + s;(0)] = max[0, 98 — 102.0 + 0| = 0 


as shown in Table 16-8. The quantities ny and n; in Table 16-8 indicate the number of periods that 
the CUSUM s,,(i) or s;(i) have been nonzero. Notice that the CUSUMS in this example never ex- 
ceed the decision interval H = 10. We would therefore conclude that the process is in control. 


When the tabular CUSUM indicates that the process is out of control, we should search 
for the assignable cause, take any corrective actions indicated, and restart the CUSUMS at 


Table 16-8 The Tabular CUSUM for the Chemical Process Concentration Data 


OD aUo Upper CUSUM Lower CUSUM 
i X; X ~ 100 Sy(i) ng DOR si) n; 
1 102.0 2.0 2.0 1 —4.0 0.0 0 
p 94.8 —9.2 0.0 0 3.2 3.2 1 
3 98.3 —h 0.0 0 —0.3 29 2 
4 98.4 —1.6 0.0 0 —0.4 2.9 3 
5 102.0 2.0 2.0 1 —4.0 0.0 0 
6 98.5 13 0.5 2 —0.5 0.0 0 
7 99.0 = 120 0.0 0 —1.0 0.0 0 
8 97.7 =2.3 0.0 0 0.3 0.3 1 
9 100.0 0.0 0.0 0 —2.0 0.0 0 
10 98.1 —1.9 0.0 0 —0.1 0.0 0 
11 101.3 1.3 1.3 1 zz e 0.0 0 
12 98.7 = 0.0 0 —D 0.0 0 
13 101.1 1.1 1.1 1 =3.1 0.0 0 
14 98.4 —1.6 0.0 0 —0.4 0.0 0 
15 97.0 —3.0 0.0 0 1.0 1.0 1 
16 96.7 —3.3 0.0 0 1.3 2.9 2 
17 100.3 0.3 0.3 1 =23 0.0 0 
18 101.4 1.4 1.7 2 —3.4 0.0 0 
19 97.2 —2.8 0.0 0 0.8 0.8 1 
20 101.0 1.0 1.0 0 —3.0 0.0 0 


Figure 16-21 The 
CUSUM status chart 
for Example 15-6. 
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zero. It may be helpful to have an estimate of the new process mean following the shift. This 
can be computed from 


sii) 





ù = 16-32 
i oy. $45 TE eN? ! l 
Mo nr” if sz(i) 


It is also useful to present a graphical display of the tabular CUSUMS, which are some- 
times called CUSUM status charts. They are constructed by plotting s7(7) and s; (7) versus the 
sample number. Figure 16-21 shows the CUSUM status chart for the data in Example 16-6. 
Each vertical bar represents the value of 55 (7) and s;(i) in period i. With the decision interval 
plotted on the chart, the CUSUM status chart resembles a Shewhart control chart. We have 
also plotted the sample statistics x; for each period on the CUSUM status chart as the solid 
dots. This frequently helps the user of the control chart to visualize the actual process per- 
formance that has led to a particular value of the CUSUM. 

The tabular CUSUM is designed by choosing values for the reference value K and the de- 
cision interval H. We recommend that these parameters be selected to provide good average 
run-length values. There have been many analytical studies of CUSUM ARL performance. 
Based on these studies, we may give some general recommendations for selecting and K. 
Define H = hoy and K = koy, where o is the standard deviation of the sample variable used 
in forming the CUSUM (if n = 1, oy = oy). Using h = 4 or h = 5 and k = 1/2 will gener- 
ally provide a CUSUM that has good ARL properties against a shift of about loy (or 1o) in 
the process mean. If much larger or smaller shifts are of interest, set k = 5/2, where 8 is the size 
of the shift in standard deviation units. Some practitioners prefer to use a standardized variable 
y; = (x; — uo)y/o x as the basis of the CUSUM. In that case, Equations 16-30 and 16-31 become 


sy(i) = max[0, y; — K + syli — 1)] and . sj;(i) = max[0, K — y; + s;(i — 1)] 


106 6 
H=5 
105 5 
104 4 
103 3 
102 | 2 e e F 
Sy) e e 
101 1 o 
m e 

100 O e 

99 1 e i 

un ales ° 9 e 
98 | 2 e © 
e 
97 3 e 
96 4 
H=5 
95 5 
@ 
94 
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Table 16-9 Average Run Lengths for a CUSUM Control Chart 


With K = 1/2 
Shift in Mean 
(multiple of oy) h=A4 =5 

0 168 465 
0.25 74.2 139 
0.50 26.6 38.0 
0.75 133 17.0 
1.00 8.38 10.4 
1.50 4.75 5.13 
2.00 3.34 4.01 
2.50 2.62 3.11 
3.00 2.19 2.57 
4.00 1.71 2.01 


For this scheme, we would usually select K = 1/2 and H = 40r H = 5. 

To illustrate how well the recommendations of h = 4 or h = 5 with k = 1/2 work, con- 
sider these average run lengths in Table 16-9. Notice that a shift of 1oy would be detected in 
either 8.38 samples (with k = 1/2 and h = 4) or 10.4 samples (with k = 1/2 and h = 5). By 
comparison, Table 16-1 shows that an X chart would require approximately 43.9 samples, on 
the average, to detect this shift. 

These design rules were used for the CUSUM in Example 16-6. We assumed that the 
process standard deviation o = 2. (This is a reasonable value; see Example 16-2.) Then with 
k = 1/2 andh = 5, we would use 


K= ko = 'A(2) = 1 and H= ho = 5(2) = 10 


in the tabular CUSUM procedure. 

Finally, we should note that supplemental procedures such as the Western Electric rules 
cannot be safely applied to the CUSUM, because successive values of S,,(i) and S;(i) are not 
independent. In fact, the CUSUM can be thought of as a weighted average, where the weights 
are stochastic or random. In effect, all the CUSUM values are highly correlated, thereby caus- 
ing the Western Electric rules to give too many false alarms. 


EXERCISES FOR SECTION 16-10 


16-38. The purity of a chemical product is measured every (a) Set up a CUSUM control chart for this process. Use 


zx two hours. The results of 20 consecutive measurements are as o = 0.8 in setting up the procedure, and assume that the 
c= follows: desired process target is 90. Does the process appear to 
ERR a be in control? 
Sample —  — Punty Sample Purity _ (b) Suppose that the next five observations are 90.75, 90.00, 
l 89.11 11 88.55 91.15, 90.95, and 90.86. Apply the CUSUM in part (a) to 
2 90.59 12 90.43 these new observations. Is there any evidence that the 
3 21.03 13 21.04 process has shifted out of control? 
4 89.46 14 88.17 
5 39.78 15 91.23 16-39. The diameter of holes is measured in consecutive — 
6 90.05 16 90.92 order by an automatic sensor. The results of measuring 25 
7 90.63 17 88.86 holes follow. ind 
8 90.75 18 90.87 (a) Estimate the process standard deviation. 
E: a "i s (b) Set up a CUSUM control procedure, assuming that the 


target diameter is 10.0 millimeters. Does the process 


Sample Diameter Sample Diameter 
1 9.94 14 9.99 
2 9.93 15 10.12 
3 10.09 16 9.81 
4 9.98 17 9.73 
D 10.11 18 10.14 
6 9.99 19 9.96 
7 10.11 20 10.06 
8 9.84 21 10.11 
9 9.82 22 9.05 

10 10.38 23 9.92 
11 9.99 24 10.09 
I2 10.41 23 9.85 
13 10.36 


appear to be operating in a state of statistical control at the de- 
sired target level? 

16-40. The concentration of a chemical product is meas- 
ured by taking four samples from each batch of material. The 
average concentration of these measurements is shown for the 
last 20 batches in the following table: 
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(a) Suppose that the process standard deviation is o. = 8 and 
that the target value of concentration for this process is 
100. Design a CUSUM scheme for the process. Does the 
process appear to be in control at the target? 

(b) How many batches would you expect to be produced with 
off-target concentration before it would be detected by the 
CUSUM control chart if the concentration shifted to 104? 
Use Table 16-9. 


16-41. Consider a standardized CUSUM with H = 5 and 
K = 1/2. Samples are taken every two hours from the 
process. The target value for the process is [uy = 50 and 
o = 2. Use Table 16-9. 

(a) If the sample size is » = 1, how many samples would be 
required to detect a shift in the process mean to u = 51 on 
average? 

(b) If the sample size is increased to n = 4, how does this af- 
fect the average run length to detect the shift to w = 51 
that you determined in part (a)? 


16-42. A process has a target of py = 100 and a standard 
deviation of o = 4. Samples of size n = 1 are taken every two 
hours. Use Table 16-9. 

(a) Suppose the process mean shifts to y = 102. How many 
hours of production will occur before the process shift is 
detected by a standardized CUSUM with H = 5 and 
K = 1/2? 


Batch Concentration Batch Concentration (b) It is important to detect the shift defined in part (a) more 
1 104.5 11 95.4 quickly. A proposal is made to reduce the sampling 
2 99.9 2 94.5 frequency to 0.5 hour. How will this affect the CUSUM 
3 106.7 13 104.5 control procedure? How much more quickly will the shift 
be detected? 
: Boe E ii (c) Suppose that the 0.5 hour sampling interval in part (b) is 
5 94.8 15 97.7 adopted. How often will false alarms occur with this new 
6 94.6 16 97.0 sampling interval? How often did they occur with the old 
3 104.4 17 95.8 interval of two hours? 
g 99.4 18 97.4 (d) A proposal is made to increase the sample size ton = 4 and 
9 100.3 19 99.0 retain the two-hour sampling interval. How does this sug- 
gestion compare in terms of average detection time to the 
10 100.3 20 102.6 suggestion of decreasing the sampling interval to 0.5 hour? 
16-11 OTHER SPC PROBLEM-SOLVING TOOLS 


While the control chart is a very powerful tool for investigating the causes of variation in a 
process, it is most effective when used with other SPC problem-solving tools. In this section 
we illustrate some of these tools, using the printed circuit board defect data in Example 16-4. 

Figure 16-17 shows a U chart for the number of defects in samples of five printed circuit 
boards. The chart exhibits statistical control, but the number of defects must be reduced. The 
average number of defects per board is 8/5 = 1.6, and this level of defects would require ex- 


tensive rework. 


The first step in solving this problem is to construct a Pareto diagram of the individual de- 
fect types. The Pareto diagram, shown in Fig. 16-22, indicates that insufficient solder and solder 
balls are the most frequently occurring defects, accounting for (109/160) 100 = 68% of the 
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Figure 16-22 Pareto 
diagram for printed 
circuit board defects. 


Figure 16-23 Cause- 
and-effect diagram for 
the printed circuit 
board flow solder 
process. 
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observed defects. Furthermore, the first five defect categories on the Pareto chart are all solder- 
related defects. This points to the flow solder process as a potential opportunity for improvement. 

To improve the flow solder process, a team consisting of the flow solder operator, the 
shop supervisor, the manufacturing engineer responsible for the process, and a quality engi- 
neer meets to study potential causes of solder defects. They conduct a brainstorming session 
and produce the cause-and-effect diagram shown in Fig. 16-23. The cause-and-effect diagram 
is widely used to display the various potential causes of defects in products and their interre- 
lationships. They are useful in summarizing knowledge about the process. 

As a result of the brainstorming session, the team tentatively identifies the following vari- 
ables as potentially influential in creating solder defects: 


1. Flux specific gravity 


2. Solder temperature 


Wave turbulance Temperature Amount 


Exhaust Wave height 


Conveyor speed Specific gravity 


Contact time 
Solder 
defects 


Temperature 


Maintenance 


Conveyor angle Wave fluidity Type 


Alignment of pallet Orientation 
Solderability 


Pallet loading 
Contaminated lead 


Figure 16-24 Defect 
concentration diagram 
for a printed circuit 
board. 
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Front 


Region of insufficient solder 





Back 


Conveyor speed 
Conveyor angle 


Solder wave height 


d: 10 Es 


Preheat temperature 
7. Pallet loading method 


A statistically designed experiment could be used to investigate the effect of these seven vari- 
ables on solder defects. 

In addition, the team constructed a defect concentration diagram for the product. A de- 
fect concentration diagram is just a sketch or drawing of the product, with the most frequently 
occurring defects shown on the part. This diagram is used to determine whether defects occur 
in the same location on the part. The defect concentration diagram for the printed circuit board 
is shown in Fig. 16-24. This diagram indicates that most of the insufficient solder defects are 
near the front edge of the board, where it makes initial contact with the solder wave. Further 
investigation showed that one of the pallets used to carry the boards across the wave was bent, 
causing the front edge of the board to make poor contact with the solder wave. 

When the defective pallet was replaced, a designed experiment was used to investigate 
the seven variables discussed earlier. The results of this experiment indicated that several of 
these factors were influential and could be adjusted to reduce solder defects. After the results 
of the experiment were implemented, the percentage of solder joints requiring rework was re- 
duced from 146 to under 100 parts per million (0.01 96). 


16-12 IMPLEMENTING SPC 


The methods of statistical process control can provide significant payback to those companies 
that can successfully implement them. While SPC seems to be a collection of statistically 
based problem-solving tools, there is more to the successful use of SPC than simply learning 
and using these tools. Management involvement and commitment to the quality-improvement 
process is the most vital component of SPC's potential success. Management is a role model, 
and others in the organization will look to management for guidance and as an example. A 
team approach is also important, for it is usually difficult for one person alone to introduce 
process improvements. Many of the “magnificent seven” problem-solving tools are helpful in 
building an improvement team, including cause-and-effect diagrams, Pareto charts, and defect 
concentration diagrams. The basic SPC problem-solving tools must become widely known 
and widely used throughout the organization. Continuous training in SPC and quality im- 
provement is necessary to achieve this widespread knowledge of the tools. 

The objective of an SPC-based quality-improvement program is continuous improve- 
ment on a weekly, quarterly, and annual basis. SPC is not a one-time program to be applied 
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when the business is in trouble and later abandoned. Quality improvement must become part 
of the culture of the organization. 

The control chart is an important tool for process improvement. Processes do not natu- 
rally operate in an in-control state, and the use of control charts is an important step that must 
be taken early in an SPC program to eliminate assignable causes, reduce process variability, 
and stabilize process performance. To improve quality and productivity, we must begin to 
manage with facts and data, and not just rely on judgment. Control charts are an important part 
of this change in management approach. 

In implementing a company-wide SPC program, we have found that the following ele- 
ments are usually present in all successful efforts: 


Management leadership 
A team approach 
Education of employees at all levels 


Emphasis on continuous improvement 


SE Hes lee SS 


A mechanism for recognizing success 


We cannot overemphasize the importance of management leadership and the team approach. 
Successful quality improvement is a “top-down” management-driven activity. It is also im- 
portant to measure progress and success and to spread knowledge of this success throughout 
the organization. When successful improvements are communicated throughout the company, 
this can provide motivation and incentive to improve other processes and to make continuous 
improvement a normal part of the way of doing business. 

The philosophy of W. Edwards Deming provides an important framework for imple- 
menting quality and productivity improvement. Deming's philosophy is summarized in his 14 
points for management. The adherence to these management principles has been an important 
factor in Japan's industrial success and continues to be the catalyst in that nation's quality- and 
productivity-improvement efforts. This philosophy has also now spread rapidly in the West. 
Deming's 14 points are as follows. 


1. Create a constancy of purpose focused on the improvement of products and serv- 
ices. Constantly try to improve product design and performance. Investment in re- 
search, development, and innovation will have a long-term payback to the organization. 


2. Adopt a new philosophy of rejecting poor workmanship, defective products, or 
bad service. It costs as much to produce a defective unit as it does to produce a 
good one (and sometimes more). The cost of dealing with scrap, rework, and other 
losses created by defectives is an enormous drain on company resources. 


3. Do not rely on mass inspection to “control” quality. All inspection can do is sort 
out defectives, and at this point it is too late because we have already paid to pro- 
duce these defectives. Inspection occurs too late in the process, it is expensive, and 
it 1s often ineffective. Quality results from the prevention of defectives through 
process improvement, not inspection. 


4. Donotaward business to suppliers on the basis of price alone, but also consider 
quality. Price is a meaningful measure of a supplier's product only if it is consid- 
ered in relation to a measure of quality. In other words, the total cost of the item 
must be considered, not just the purchase price. When quality is considered, the 
lowest bidder is frequently not the low-cost supplier. Preference should be given to 
suppliers who use modern methods of quality improvement in their business and 
who can demonstrate process control and capability. 


10. 


11. 


12. 


13. 


14. 
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Focus on continuous improvement. Constantly try to improve the production and 
service system. Involve the workforce in these activities and make use of statistical 
methods, particularly the SPC problem-solving tools discussed in the previous sec- 
tion. 


Practice modern training methods and invest in training for all employees. 
Everyone should be trained in the technical aspects of their job, as well as in mod- 
ern quality- and productivity-improvement methods. The training should encourage 
all employees to practice these methods every day. 


Practice modern supervision methods. Supervision should not consist merely of 
passive surveillance of workers, but should be focused on helping the employees 
improve the system in which they work. The first goal of supervision should be to 
improve the work system and the product. 


Drive out fear. Many workers are afraid to ask questions, report problems, or point 
out conditions that are barriers to quality and effective production. In many organi- 
zations the economic loss associated with fear is large; only management can elim- 
inate fear. 


Break down the barriers between functional areas of the business. Teamwork 
among different organizational units is essential for effective quality and productiv- 
ity improvement to take place. 


Eliminate targets, slogans, and numerical goals for the workforce. A target such 
as "zero defects" is useless without a plan as to how to achieve this objective. In 
fact, these slogans and “programs” are usually counterproductive. Work to improve 
the system and provide information on that. 


Eliminate numerical quotas and work standards. These standards have histori- 
cally been set without regard to quality. Work standards are often symptoms of man- 
agement's inability to understand the work process and to provide an effective man- 
agement system focused on improving this process. 


Remove the barriers that discourage employees from doing their jobs. Management 
must listen to employee suggestions, comments, and complaints. The person who is 
doing the job is the one who knows the most about it, and usually has valuable ideas 
about how to make the process work more effectively. The workforce is an impor- 
tant participant in the business, and not just an opponent in collective bargaining. 


Institute an ongoing program of training and education for all employees. 
Education in simple, powerful statistical techniques should be mandatory for all 
employees. Use of the basic SPC problem-solving tools, particularly the control 
chart, should become widespread in the business. As these charts become wide- 
spread, and as employees understand their uses, they will be more likely to look for 
the causes of poor quality and to identify process improvements. Education is a way 
of making everyone partners in the quality-improvement process. 


Create a structure in top management that will vigorously advocate the first 13 
points. 


As we read Deming's 14 points, we notice two things. First, there is a strong emphasis on 
change. Second, the role of management in guiding this change process is of dominating im- 
portance. But what should be changed, and how should this change process be started? For ex- 
ample, if we want to improve the yield of a semiconductor manufacturing process, what 
should we do? It is in this area that statistical methods most frequently come into play. To im- 
prove the semiconductor process, we must determine which controllable factors in the process 
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influence the number of defective units produced. To answer this question, we must collect 
data on the process and see how the system reacts to changes in the process variables. 
Statistical methods, including the SPC and experimental design techniques in this book, can 


contribute to this knowledge. 


SUPPLEMENTAL EXERCISES 


16-43. The diameter of fuse pins used in an aircraft engine 
a application is an important quality characteristic. Twenty-five 
c—7 samples of three pins each are shown as follows: 


sample 
Number Diameter 
1 64.030 64.002 64.019 
2 63.995 63.992 64.001 
3 63.988 64.024 64.021 
4 64.002 63.996 63.993 
5 63.992 64.007 64.015 
6 64.009 63.994 63.997 
7 63.995 64.006 63.994 
8 63.985 64.003 63.993 
9 64.008 63.995 64.009 
10 63.998 74.000 63.990 
11 63.994 63.998 63.994 
E 64.004 64.000 64.007 
13 63.983 64.002 63.998 
14 64.006 63.967 63.994 
15 64.012 64.014 63.998 
16 64.000 63.984 64.005 
17 63.994 64.012 63.986 
18 64.006 64.010 64.018 
19 63.984 64.002 64.003 
20 64.000 64.010 64.013 
21 63.988 64.001 64.009 
22 64.004 63.999 63.990 
29 64.010 63.989 63.990 
24 64.015 64.008 63.993 
25 63.982 63.984 63.995 


(a) Set up X and R charts for this process. If necessary, revise 
limits so that no observations are out-of-control. 

(b) Estimate the process mean and standard deviation. 

(c) Suppose the process specifications are at 64 + 0.02. 
Calculate an estimate of PCR. Does the process meet a 
minimum capability level of PCR = 1.33? 

(d) Calculate an estimate of PCR,. Use this ratio to draw con- 
clusions about process capability. 


(e) To make this process a six-sigma process, the variance o^ 
would have to be decreased such that PCR, = 2.0. What 
should this new variance value be? 

(f) Suppose the mean shifts to 64.01. What is the probability 
that this shift will be detected on the next sample? What is 
the ARL after the shift? 

16-44. Rework Exercise 16-43 with X and S charts. 

16-45. Plastic bottles for liquid laundry detergent are 

formed by blow molding. Twenty samples of n = 100 bottles 

are inspected in time order of production, and the fraction de- 
fective in each sample is reported. The data are as follows: 


sample Fraction Defective 
1 0.12 
2 0.15 
3 0.18 
4 0.10 
5 0.12 
6 0.11 
7 0.05 
8 0.09 
9 0.13 
10 0.13 
11 0.10 
12 0.07 
13 0.12 
14 0.08 
15 0.09 
16 0.15 
17 0.10 
18 0.06 
19 0.12 
20 0.13 


(a) Set up a P chart for this process. Is the process in statisti- 
cal control? 


(b) Suppose that instead of n = 100, n = 200. Use the data 


given to set up a P chart for this process. Revise the con- 
trol limits if necessary. 


(c) Compare your control limits for the P charts in parts (a) and 


(b). Explain why they differ. Also, explain why your assess- 
ment about statistical control differs for the two sizes of n. 
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(a) Suppose that a shift in the process mean of magnitude 
o occurs. Verify that the ARL for detecting the shift is 
ARL = 43.9. 


16-46. Cover cases for a personal computer are manufac- 
T tured by injection molding. Samples of five cases are taken 
c—7 from the process periodically, and the number of defects is 


noted. Twenty-five samples follow: 


sample No. of Defects sample No. of Defects 
1 3 14 8 
2 2 15 0 
3 0 16 2 
4 ] 17 4 
5 4 18 3 
6 3 19 5 
7 2 20 0 
8 4 21 2 
9 1 22 1 
10 0 23 9 
11 2 24 3 
12 3 25 2 
13 2 


(a) Using all the data, find trial control limits for this U chart 
for the process. 

(b) Use the trial control limits from part (a) to identify out-of- 
control points. If necessary, revise your control limits. 

(c) Suppose that instead of samples of 5 cases, the sample 
size was 10. Repeat parts (a) and (b). Explain how this 
change alters your answers to parts (a) and (b). 

16-47. Consider the data in Exercise 16-46. 


(a) Using all the data, find trial control limits for a C chart for 
this process. 

(b) Use the trial control limits of part (a) to identify out-of- 
control points. If necessary, revise your control limits. 

(c) Suppose that instead of samples of 5 cases, the sample 
was 10 cases. Repeat parts (a) and (b). Explain how this 
alters your answers to parts (a) and (b). 

16-48. Suppose that a process is in control and an X chart is 

used with a sample size of 4 to monitor the process. Suddenly 

there is a mean shift of 1.5o. 

(a) If 3-sigma control limits are in use on the X chart, what is 
the probability that this shift will remain undetected for 
three consecutive samples? 

(b) If 2-sigma control limits are in use on the X chart, what is 
the probability that this shift will remain undetected for 
three consecutive samples? 

(c) Compare your answers to parts (a) and (b) and explain 
why they differ. Also, which limits you would recommend 
using and why? 

16-49. Consider the control chart for individuals with 

3-sigma limits. 


(b) Find the ARL for detecting a shift of magnitude 2o in the 
process mean. 

(c) Find the ARL for detecting a shift of magnitude 3o in the 
process mean. 

(d) Compare your answers to parts (a), (b), and (c) and ex- 
plain why the ARL for detection is decreasing as the mag- 
nitude of the shift increases. 


16-50. Consider a control chart for individuals, applied to a 
continuous 24-hour chemical process with observations taken 
every hour. 

(a) If the chart has 3-sigma limits, verify that the in-control 
ARL is ARL = 370. How many false alarms would occur 
each 30-day month, on the average, with this chart? 

(b) Suppose that the chart has 2-sigma limits. Does this re- 
duce the ARL for detecting a shift in the mean of magni- 
tude c? (Recall that the ARL for detecting this shift with 
3-sigma limits is 43.9.) 

(c) Find the in-control ARL if 2-sigma limits are used on the 
chart. How many false alarms would occur each month 
with this chart? Is this in-control ARL performance satis- 
factory? Explain your answer. 

16-51. The depth of a keyway 1s an important part quality 

characteristic. Samples of size n — 5 are taken every four hours 

from the process and 20 samples are summarized as follows: 


Sample X r 
1 139.7 1.1 
2 139.8 1.4 
3 140.0 1.3 
4 140.1 1.6 
5 139.8 0.9 
6 139.9 1.0 
7 139.7 1.4 
8 140.2 1.2 
9 139.3 1.1 

10 140.7 1.0 
11 138.4 0.8 
12 138.5 0.9 
13 137.9 1.2 
14 138.5 1.1 
15 140.8 1.0 
16 140.5 1.3 
17 139.4 1.4 
18 139.9 1.0 
19 15155 1.5 
20 139.2 1.3 
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(a) Using all the data, find trial control limits for X and R 
charts. Is the process in control? 

(b) Use the trial control limits from part (a) to identify out-of- 
control points. If necessary, revise your control limits. 
Then, estimate the process standard deviation. 

(c) Suppose that the specifications are at 140 — 2. Using the 
results from part (b), what statements can you make about 
process capability? Compute estimates of the appropriate 
process capability ratios. 

(d) To make this process a “6-sigma process,” the variance o? 
would have to be decreased such that PCR, — 2.0. What 
should this new variance value be? 

(e) Suppose the mean shifts to 139.7. What is the probability 
that this shift will be detected on the next sample? What is 
the ARL after the shift? 


16-52. A process is controlled by a P chart using samples of 

size 100. The center line on the chart is 0.05. 

(a) What is the probability that the control chart detects a shift 
to 0.08 on the first sample following the shift? 

(b) What is the probability that the control chart does not de- 
tect a shift to 0.07 on the first sample following the shift 
but does detect it on the second sample? 

(c) Suppose that instead of a shift in the mean to 0.07, the 
mean shifts to 0.10. Repeat parts (a) and (b). 

(d) Compare your answers for a shift to 0.07 and for a shift to 
0.10. Explain why they differ. Also, explain why a shift to 
0.10 is easier to detect. 

16-53. Suppose the average number of defects in a unit is 

known to be 8. If the mean number of defects in a unit shifts to 

16, what is the probability that it will be detected by the U 

chart on the first sample following the shift 

(a) if the sample size is n — 4? 

(b) if the sample size is n = 10? 

Use a normal approximation for U. 


16-54. Suppose the average number of defects in a unit is 


known to be 10. If the mean number of defects in a unit shifts 

to 14, what is the probability that it will be detected by the U 

chart on the first sample following the shift 

(a) if the sample size is n — 1? 

(b) if the sample size is n — 4? 

Use a normal approximation for U. 

16-55. Suppose that an X control chart with 2-sigma lim- 

its is used to control a process. Find the probability that a 

false out-of-control signal will be produced on the next sam- 

ple. Compare this with the corresponding probability for the 

chart with 3-sigma limits and discuss. Comment on when 

you would prefer to use 2-sigma limits instead of 3-sigma 

limits. 

16-56. Consider the X control chart with 2-sigma limits in 

Exercise 16-50. 

(a) Find the probability of no signal on the first sample but a 
signal on the second. 

(b) What is the probability that there will not be a signal in 
three samples? 


16-57. Suppose a process has a PCR = 2, but the mean is 
exactly three standard deviations above the upper specifica- 
tion limit. What is the probability of making a product outside 
the specification limits? 


16-58. Consider the hardness measurement data in Exercise 
16-9. Set up a CUSUM scheme for this process using u = 50 
ando = 2,sothat K = 1 and H = 10. Is the process in control? 


16-59. Consider the data in Exercise 16-10. Set up a 
CUSUM scheme for this process assuming that ju = 80 is the 
process target. Explain how you determined your estimate of 
o and the CUSUM parameters K and H. 

16-60. Reconsider the data in Exercise 16-12. Construct a 
CUSUM control chart for this process using jug = 500 as the 


process target. Explain how you determined your estimate of 
c and the CUSUM parameters H and K. 


MIND-EXPANDING EXERCISES 


16-61. Suppose a process is in control, and 3-sigma 
control limits are in use on the X chart. Let the mean 
shift by 1.50. What is the probability that this shift will 
remain undetected for three consecutive samples? What 
would its probability be if 2-sigma control limits were 
used? The sample size is 4. 


16-62. Consider an X control chart with k-sigma con- 
trol limits. Develop a general expression for the proba- 
bility that a point will plot outside the control limits 
when the process mean has shifted by 6 units from the 
center line. 


16-63. Suppose that an X chart is used to control a 
normally distributed process and that samples of size n 
are taken every n hours and plotted on the chart, which 
has k-sigma limits. 

(a) Find a general expression for the expected number 
of samples and time that will be taken until a false 
action signal is generated. 

(b) Suppose that the process mean shifts to an out-of- 
control state, say u; = jug + do. Find an expres- 
sion for the expected number of samples that 
will be taken until a false action is generated. 
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MIND-EXPANDING EXERCISES 


(c) Evaluate the in-control ARL for k = 3. How, does 
this change if k = 2? What do you think about the 
use of 2-sigma limits in practice? 

(d) Evaluate the out-of-control ARL for a shift of 1 
sigma, given that n = 5. 

16-64. Suppose a P chart with center line at p with k- 

sigma control limits is used to control a process. There is 

a critical fraction defective p, that must be detected with 

probability 0.50 on the first sample following the shift to 

this state. Derive a general formula for the sample size 
that should be used on this chart. 


16-65. Suppose that a P chart with center line at p and 
k-sigma control limits is used to control a process. What 
is the smallest sample size that can be used on this control 
chart to ensure that the lower control limit is positive? 


16-66. A process is controlled by a P chart using sam- 
ples of size 100. The center line on the chart is 0.05. 
What is the probability that the control chart detects a 
shift to 0.08 on the first sample following the shift? 
What is the probability that the shift is detected by at 
least the third sample following the shift? 


16-67. Consider a process where specifications on a 
quality characteristic are 100 — 15. We know that the stan- 
dard deviation of this normally distributed quality charac- 
teristic is 5. Where should we center the process to mini- 
mize the fraction defective produced? Now suppose the 
mean shifts to 105 and we are using a sample size of 4 on 
an X chart. What is the probability that such a shift will be 
detected on the first sample following the shift? What is the 
average number of samples until an out-of-control point 
occurs? Compare this result to the average number of ob- 
servations until a defective occurs (assuming normality). 


16-68. The NP Control Chart. An alternative to the 
control chart for fraction defective 1s a control chart based 
on the number of defectives, or the VP control chart. The 
chart has centerline at np, and the control limits are 


UCL = np + 3V np(1 — p) 
LCL = npm DUE) 
and the number of defectives for each sample is plotted 


on the chart. 
(a) Verify that the control limits given above are correct. 


Sample, i 1 D 3 4 
n; 20 25 20 25 
Pi 0.2 0.16 0.25 0.08 


(b) Apply this control chart to the data in Example 16-4. 
(c) Will this chart always provide results that are equiv- 
alent to the usual P chart? 


16-69. The EWMA Control Chart. The exponen- 
tially weighted moving average (or EWMA) is defined 
as follows: 


Z,=AX,+ (1—A)Z., 


where 0 < X < 1, and the starting value of the EWMA at 
time ¢ = 0 is Zo = wo (the process target). An EWMA 
control chart is constructed by plotting the Z, values on a 
chart with center line at pọ and appropriate control limits. 
(a) Verify that E(Z,) = po 

(b) Let o2 be V(Zj), and show that 


n= Sn -a-am 


(c) Use the results of part (b) to determine the control 
limits for the EWMA chart. 

(d) As A —> 1, the EWMA control chart should perform 
like a standard Shewhart X chart. Do you agree with 
this statement? Why? 

(e) As A — 0, the EWMA control chart should perform 
like a CUSUM. Provide an argument as to why this 
1S SO. 

(f) Apply this procedure to the data in Example 16-2. 

16-70. Standardized Control Chart. Consider the P 

chart with the usual 3-sigma control limits. Suppose that 

we define a new variable: 


Ê, — P 
P(1- P) 
n 


Zi = 


as the quantity to plot on a control chart. It is proposed that 
this new chart will have a center line at O with the upper 
and lower control limits at +3. Verify that this standard- 
ized control chart will be equivalent to the original p chart. 
16-71. Unequal Sample Sizes. One application of the 
standardized control chart introduced in Exercise 16-70 
is to allow unequal sample sizes on the control chart. 
Provide details concerning how this procedure would be 
implemented and illustrate using the following data: 


5 6 7 8 9 10 
50 30 25 25 25 20 
0.3 0.1 SETA 0.16 (IP 0:15 
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IMPORTANT TERMS AND CONCEPTS 


In the E-book, click on any 
term or concept below to 
go to that subject. 

ARL 

Assignable causes 

Attributes control 
charts 

Average run length 

C chart 

Cause-and-effect 
diagram 

Center line 

Chance causes 

Control chart 


Control limits 
Cumulative sum control 
chart 
Defect concentration 
diagram 
Defects-per-unit chart 
Deming's 14 points 
False alarm 
Fraction-defective 
control chart 
Implementing SPC 
Individuals control 
chart 
Moving range 


NP chart 

P chart 

Pareto diagram 

PCR 

PCR, 

Problem-solving 
tools 

Process capability 

Process capability 
ratio 

Quality control 

R chart 

Rational subgroup 

Run rule 


S chart 

Shewhart control chart 

Six-sigma process 

Specification limits 

Statistical process 
control 

Statistical quality 
control 

U chart 

V mask 

Variables control charts 

Warning limits 

Western Electric rules 

X chart 
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Table I Summary of Common Probability Distributions 


Name 


Discrete 
Uniform 


Binomial 


Geometric 


Negative binomial 


Hypergeometric 


Poisson 


Continuous 


Uniform 


Normal 


Exponential 
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Weibull 


Lognormal 
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Table II Cumulative Standard Normal Distribution 

Z — 0.09 — 0.08 — 0.07 — 0.06 —0.05 — 0.04 20:03 —0.02 —0.01 — 0.00 
—3.9 0.000033 0.000034 0.000036 0.000037 0.000039 0.000041 0.000042 0.000044 0.000046 0.000048 
—3.8 0.000050 0.000052 0.000054 0.000057 0.000059 0.000062 0.000064 0.000067 0.000069 0.000072 
—3.7 0.000075 0.000078 0.000082 0.000085 0.000088 0.000092 0.000096 0.000100 0.000104 0.000108 
—3.6 0.000112 0.000117 0.000121 0.000126 0.000131 0.000136 0.000142 0.000147 0.000153 0.000159 
—3.5 0.000165 0.000172 0.000179 0.000185 0.000193 0.000200 0.000208 0.000216 0.000224 0.000233 
—3.4 0.000242 0.000251 0.000260 0.000270 0.000280 0.000291 0.000302 0.000313 0.000325 0.000337 
—3.3 0.000350 0.000362 0.000376 0.000390 0.000404 0.000419 0.000434 0.000450 0.000467 0.000483 
—3.2 0.000501 0.000519 0.000538 0.000557 0.000577 0.000598 0.000619 0.000641 0.000664 0.000687 
—3.] 0.000711 0.000736 0.000762 0.000789 0.000816 0.000845 0.000874 0.000904 0.000935 0.000968 
—3.0 0.001001 0.001035 0.001070 0.001107 0.001144 0.001183 0.001223 0.001264 0.001306 0.001350 
—2.9 0.001395 0.001441 0.001489 0.001538 0.001589 0.001641 0.001695 0.001750 0.001807 0.001866 
—2.8 0.001926 0.001988 0.002052 0.002118 0.002186 0.002256 0.002327 0.002401 0.002477 0.002555 
—2.7 0.002635 0.002718 0.002803 0.002890 0.002980 0.003072 0.003167 0.003264 0.003364 0.003467 
—2.6 0.003573 0.003681 0.003793 0.003907 0.004025 0.004145 0.004269 0.004396 0.004527 0.004661 
—2.5 0.004799 0.004940 0.005085 0.005234 0.005386 0.005543 0.005703 0.005868 0.006037 0.006210 
—2.4 0.006387 0.006569 0.006756 0.006947 0.007143 0.007344 0.007549 0.007760 0.007976 0.008198 
—2.3 0.008424 0.008656 0.008894 0.009137 0.009387 0.009642 0.009903 0.010170 0.010444 0.010724 
—2.2 0.011011 0.011304 0.011604 0.011911 0.012224 0.012545 0.012874 0.013209 0.013553 0.013903 
—2.1 0.014262 0.014629 0.015003 0.015386 0.015778 0.016177 0.016586 0.017003 0.017429 0.017864 
—2.0 0.018309 0.018763 0.019226 0.019699 0.020182 0.020675 0.021178 0.021692 0.022216 0.022750 
—].9 0.023295 0.023852 0.024419 0.024998 0.025588 0.026190 0.026803 0.027429 0.028067 0.028717 
—].8 0.029379 0.030054 0.030742 0.031443 0.032157 0.032884 0.033625 0.034379 0.035148 0.035930 
—].7 0.036727 0.037538 0.038364 0.039204 0.040059 0.040929 0.041815 0.042716 0.043633 0.044565 
—].6 0.045514 0.046479 0.047460 0.048457 0.049471 0.050503 0.051551 0.052616 0.053699 0.054799 
—].5 4 0.055917 0.057053 0.058208 0.059380 0.060571 0.061780 0.063008 0.064256 0.065522 0.066807 
—].4 0.068112 0.069437 0.070781 0.072145 0.073529 0.074934 0.076359 0.077804 0.079270 0.080757 
—].3 0.082264 0.083793 0.085343 0.086915 0.088508 0.090123 0.091759 0.093418 0.095098 0.096801 
—].2 0.098525 0.100273 0.102042 0.103835 0.105650 0.107488 0.109349 0.111233 0.113140 0.115070 
—1.1 0.117023 0.119000 0.121001 0.123024 0.125072 0.127143 0.129238 0.131357 0.133500 0.135666 
—].0 0.137857 0.140071 0.142310 0.144572 0.146859 0.149170 0.151505 0.153864 0.156248 0.158655 
—0.9 0.161087 0.163543 0.166023 0.168528 0.171056 0.173609 0.176185 0.178786 0.181411 0.184060 
—0.8 0.186733 0.189430 0.192150 0.194894 0.197662 0.200454 0.203269 0.206108 0.208970 0.211855 
—0.7 0.214764 0.217695 0.220650 0.223627 0.226627 0.229650 0.232695 0.235762 0.238852 0.241964 
—0.6 0.245097 0.248252 0.251429 0.254627 0.257846 0.261086 0.264347 0.267629 0.270931 0.274253 
—0.5 0.277595 0.280957 0.284339 0.287740 0.291160 0.294599 0.298056 0.301532 0.305026 0.308538 
—0.4 0.312067 0.315614 0.319178 0.322758 0.326355 0.329969 0.333598 0.337243 0.340903 0.344578 
—0.3 0.348268 0.351973 0.355691 0.359424 0.363169 0.366928 0.370700 0.374484 0.378281 0.382089 
—0.2 0.385908 0.389739 0.393580 0.397432 0.401294 0.405165 0.409046 0.412936 0.416834 0.420740 
—0.1 0.424655 0.428576 0.432505 0.436441 0.440382 0.444330 0.448283 0.452242 0.456205 0.460172 

0.0 0.464144 0.468119 0.472097 0.476078 0.480061 0.484047 0.488033 0.492022 0.496011 0.500000 


654 


APPENDIX A 


STATISTICAL TABLES AND CHARTS 


Table II Cumulative Standard Normal Distribution (continued) 


Z 


0.0 
0.1 
0.2 
0.5 
0.4 
0.5 
0.6 
0.7 
0.8 
0.9 
1.0 
1.1 
12 
1.3 
1.4 
1:5 
1.6 
1.7 
1.8 
1;9 
2.0 
2l 
2. 
2:9 
2.4 
2,9 
2.6 
2.1 
2:5 
2.9 
3.0 
3.1 
22 
33 
3.4 
3.5 
3.6 
9o 
3.8 
3.9 


0.00 


0.500000 
0.539828 
0.579260 
0.617911 
0.655422 
0.691462 
0.725747 
0.758036 
0.788145 
0.815940 
0.841345 
0.864334 
0.884930 
0.903199 
0.919243 
0.953195 
0.945201 
0.955435 
0.964070 
0.971283 
0.977250 
0.982136 
0.986097 
0.989276 
0.991802 
0.993790 
0.995339 
0.996533 
0.997445 
0.998134 
0.998650 
0.999032 
0.999313 
0.999517 
0.999663 
0.999767 
0.999841 
0.999892 
0.999928 
0.999952 


0.01 


0.503989 
0.543795 
0.583166 
0.621719 
0.659097 
0.694974 
0.729069 
0.761148 
0.791030 
0.818589 
0.843752 
0.866500 
0.886860 
0.904902 
0.920730 
0.934478 
0.946301 
0.956367 
0.964852 
0.971933 
0.977784 
0.982571 
0.986447 
0.989556 
0.992024 
0.993963 
0.995473 
0.996636 
0.997523 
0.998193 
0.998694 
0.999065 
0.999336 
0.999533 
0.999675 
0.999776 
0.999847 
0.999896 
0.99993] 
0.999954 


0.02 


0.507978 
0.547758 
0.587064 
0.625516 
0.662757 
0.698468 
0.73237] 
0.764238 
0.793892 
0.821214 
0.846136 
0.868643 
0.888767 
0.906582 
0.922196 
0.935744 
0.947384 
0.957284 
0.965621 
0.972571 
0.978308 
0.982997 
0.986791 
0.989830 
0.992240 
0.994132 
0.995604 
0.996736 
0.997599 
0.998250 
0.998736 
0.999096 
0.999359 
0.999550 
0.999687 
0.999784 
0.999853 
0.999900 
0.999933 
0.999956 


0.03 


0.511967 
0.551717 
0.590954 
0.629300 
0.666402 
0.701944 
0.735653 
0.767305 
0.796731 
0.823815 
0.848495 
0.870762 
0.890651 
0.908241 
0.923641 
0.936992 
0.948449 
0.958185 
0.966375 
0.973197 
0.978822 
0.983414 
0.987126 
0.990097 
0.992451 
0.994297 
0.995751 
0.996833 
0.997673 
0.998305 
0.998777 
0.999126 
0.999381 
0.999566 
0.999698 
0.999792 
0.999858 
0.999904 
0.999936 
0.999958 


0.04 


0.515953 
0.555760 
0.594835 
0.633072 
0.670031 
0.705401 
0.738914 
0.770350 
0.799546 
0.826391 
0.850830 
0.872857 
0.892512 
0.909877 
0.925066 
0.938220 
0.949497 
0.959071 
0.967116 
0.973810 
0.979525 
0.983823 
0.987455 
0.990358 
0.992656 
0.994457 
0.995855 
0.996928 
0.997744 
0.998359 
0.998817 
0.999155 
0.999402 
0.999581 
0.999709 
0.999800 
0.999864 
0.999908 
0.999938 
0.999959 





0.05 


0.519959 
0.559618 
0.598706 
0.636831 
0.673645 
0.708840 
0.742154 
0.773373 
0.802338 
0.828944 
0.853141 
0.874928 
0.894350 
0.911492 
0.926471 
0.939429 
0.950529 
0.959941 
0.967843 
0.974412 
0.979818 
0.984222 
0.987776 
0.990613 
0.992857 
0.994614 
0.995975 
0.997020 
0.997814 
0.998411 
0.998856 
0.999184 
0.999423 
0.999596 
0.999720 
0.999807 
0.999869 
0.999917 
0.99994] 
0.999961 


0.06 


0.532922 
0.563559 
0.602568 
0.640576 
0.677242 
0.712260 
0.745373 
0.776373 
0.805106 
0.831472 
0.855428 
0.876976 
0.896165 
0.913085 
0.927855 
0.940620 
0.951543 
0.960796 
0.968557 
0.975002 
0.980301 
0.984614 
0.988089 
0.990863 
0.993053 
0.994766 
0.996093 
0.997110 
0.997882 
0.998462 
0.998893 
0.999211 
0.999443 
0.999610 
0.999730 
0.999815 
0.999874 
0:999915 
0.999943 
0.999963 


0.07 


0.527903 
0.567495 
0.606420 
0.644309 
0.680822 
0.715661 
0.748571 
0.779350 
0.807850 
0.833977 
0.857690 
0.878999 
0.897958 
0.914657 
0.929219 
0.941792 
0.952540 
0.961636 
0.969258 
0.975581 
0.980774 
0.984997 
0.988396 
0.991106 
0.993244 
0.994915 
0.996207 
0.997197 
0.997948 
0.998511 
0.998930 
0.999238 
0.999462 
0.999624 
0.999740 
0.999821 
0.999879 
0.999918 
0.999946 
0.999964 


0.08 


0.531881 
0.571424 
0.610261 
0.648027 
0.684386 
0.719043 
0.751748 
0.782305 
0.810570 
0.836457 
0.859929 
0.881000 
0.899727 
0.916207 
0.930563 
0.942947 
0.953521 
0.962462 
0.969946 
0.976148 
0.981237 
0.985371 
0.988696 
0.991344 
0.993431 
0.995060 
0.996319 
0.997252 
0.998012 
0.998559 
0.998965 
0.999264 
0.999481 
0.999638 
0.999749 
0.999828 
0.999883 
0.999922 
0.999948 
0.999966 


0.09 


0.535856 
0.575345 
0.614092 
0.651732 
0.687933 
0.722405 
0.754903 
0.785236 
0.813267 
0.838913 
0.862143 
0.882977 
0.901475 
0.917736 
0.931888 
0.944083 
0.954486 
0.963273 
0.970621 
0.976705 
0.981691 
0.985738 
0.988989 
0.991576 
0.993613 
0.995201 
0.996427 
0.997365 
0.998074 
0.998605 
0.998999 
0.999289 
0.999499 
0.999650 
0.999758 
0.999835 
0.999888 
0.999925 
0.999950 
0.999967 


Table III Percentage Points x; , of the Chi-Squared Distribution 


v a .995 
l .00+ 
2 .01 
3 .07 
4 21 
5 41 
6 .68 
7 .99 
8 1.34 
9 1.73 

10 2.16 
11 2.60 
12 3.07 
13 3:94 
14 4.07 
15 4.60 
16 5.14 
17 5.70 
18 6.26 
19 6.84 
20 7.43 
21 8.03 
22 8.64 
23 9.26 
24 9.89 
23 10.52 
26 11.16 
27 11.81 
28 12.46 
29 13.12 
30 13.79 
40 20.71 
50 27.99 
60 35.53 
70 43.28 
80 51.17 
90 59.20 
100 67.33 


v = degrees of freedom. 


.990 


.00+ 
.02 
al 
30 
29 
.87 
1.24 
1.65 
2.09 
2.56 
3.05 
3.5y 
4.11 
4.66 
3.23 
5.61 
6.41 
7.01 
7.63 
8.26 
8.90 
9.54 
10.20 
10.86 
11.52 
12.20 
12.88 
15:57 
14.26 
14.95 
22.16 
29.71 
37.48 
45.44 
53.54 
61.75 
70.06 


975 


00+ 
.05 
al 
48 
.83 
1.24 
1.69 
2.18 
2.70 
3.25 
3.02 
4.40 
5.01 
5.63 
6.27 
6.91 
7.56 
8.23 
8.91 
9.59 
10.28 
10.98 
11.69 
12.40 
13.12 
13.84 
14.57 
15.31 
16.05 
16.79 
24.43 
32.36 
40.48 
48.76 
34.15 
65.65 
74.22 


.950 


.00 + 


.10 
39 
71 
1.15 
1.64 
2l 
2.419 
3.33 
3.94 
4.57 
35.29 
3.09 
6.57 
7.26 
7.96 
8.67 
9.39 
10.17 
10.85 
11.59 
12.34 
15:09 
13.85 
14.61 
15.38 
16.15 
16.93 
LEM 
18.49 
26.51 
34.76 
43.19 
51.74 
60.39 
69.13 
77.93 


900 


.02 
SA 
.58 
1.06 
1.61 
2.20 
2,53 
3.49 
4.17 
4.87 
5.58 
6.30 
7.04 
1:19 
8.55 
9.51 
10.09 
10.87 
11.65 
12.44 
13.24 
14.04 
14.85 
15.66 
16.47 
17.29 
18.11 
18.94 
19.77 
20.60 
29.05 
37.69 
46.46 
55:59 
64.28 
419.29 
82.36 


.500 


.45 
1.39 
2:94 
3.36 
4.35 
3.95 
6.35 
7.34 
8.34 
9.34 

10.34 
11.34 
12.34 
13.34 
14.34 
15.34 
16.34 
17.34 
18.34 
19.34 
20.34 
21.34 
22.34 
23.34 
24.34 
25.34 
26.34 
27.34 
28.34 
29.34 
39.34 
49.33 
59.33 
69.33 
79.33 
59.35 
99.33 


.100 


Zu 
4.61 
6.25 
7.78 
9.24 
10.65 
12.02 
13.36 
14.68 
[5:99 
17.28 
18.55 
19.81 
21.06 
22.31 
23.54 
24.77 
23.99 
27.20 
28.41 
29.62 
30.81 
32.01 
33.20 
34.28 
35.56 
36.74 
37.92 
39.09 
40.26 
51.81 
63.17 
74.40 
53.59 
96.58 
107.57 
118.50 


.050 


3.84 
5:99 
7.81 
9.49 
11.07 
12.59 
14.07 
15.51 
16.92 
18.31 
19.68 
21.03 
22.36 
23.68 
25.00 
26.30 
21.59 
28.87 
30.14 
31.41 
32.07 
35,97 
35.17 
36.42 
37.65 
38.89 
40.11 
41.34 
42.56 
43.77 
55.76 
67.50 
79.08 
90.53 
101.88 
113.14 
124.34 


APPENDIX A 
025 .010 
5.02 6.63 
7.38 9:21 
9.35 11.34 
11.14 13.28 
12.83 15.09 
14.45 16.81 
16.01 18.48 
17.59 20.09 
19.07 21.67 
20.48 Z2 21 
21.92 24.72 
23.34 26.22 
24.74 27.69 
26.12 29.14 
27.49 30.58 
28.85 32.00 
30.19 33.41 
31.53 34.81 
32.85 36.19 
34.17 20593 
35.48 38.93 
36.78 40.29 
38.08 41.64 
39.36 42.98 
40.65 44.3] 
41.92 45.64 
43.19 46.96 
44.46 48.28 
45.72 49.59 
46.98 50.89 
59.34 63.69 
71.42 76.15 
83.30 88.38 
95.02 100.42 
106.63 112.33 
118.14 124.12 
129.56 135.61 


655 


.005 


7.88 
10.60 
12.84 
14.86 
16.75 
18.55 
20.28 
21.96 
23.99 
25.19 
26.76 
28.30 
29.82 
31.32 
32.80 
34.27 
35.72 
37.16 
38.58 
40.00 
41.40 
42.80 
44.18 
45.56 
46.93 
48.29 
49.65 
50.99 
52.34 
33.67 
66.77 
79.49 
91.95 

104.22 
116.32 
128.30 
140.17 


656 APPENDIX A STATISTICAL TABLES AND CHARTS 


0 Lo, v 
Table IV Percentage Points t, , of the t-Distribution 

v d .A0 25 .10 .05 .025 .01 .005 .0025 .001 .0005 

1 20 1.000 3.078 6.314 12.706 31.821 63.657 127.32 318.31 636.62 
2 .289 .816 1.886 2.920 4.303 6.965 9.925 14.089 23.326 31.598 
3 2I .765 1.638 2.353 3.182 4.541 5.841 7.453 10.213 12.924 
4 .271 .741 1.533 2.132 2.116 3.747 4.604 5.598 TADS 8.610 
5 .267 12.7 1.476 2.015 2.941 3.365 4.032 4.773 5.893 6.869 
6 .265 .718 1.440 1.943 2.447 3.143 3.707 4.317 5.208 5.959 
7 .263 711 1.415 1.895 2.365 2.998 3.499 4.029 4.785 5.408 
8 .262 .706 1.397 1.860 2.306 2.896 3.355 3.833 4.501 5.041 
9 .261 .703 1.383 1.833 2.262 2.821 3.250 3.690 4.297 4.781 
10 .260 .700 1,572 1.812 2.228 2.764 3.169 3.581 4.144 4.587 
11 .260 .697 1.363 1.796 2.201 2.718 3.106 3.497 4.025 4.437 
12 .259 .695 1.356 1.782 2.179 2.681 3.055 3.428 3.930 4.318 
13 .259 .694 1:350 1.771 2.160 2.650 3.012 3.372 3.852 4.221 
14 .258 .692 1.345 1.761 2.145 2.624 2.977 3.326 3.787 4.140 
I5 .258 .691 1.341 1.753 2.131 2.602 2.947 3.286 3.733 4.073 
16 .258 .690 1.337 1.746 2.120 2.583 2.921 3.292 3.686 4.015 
17 PeT .689 1.333 1.740 2.110 2.567 2.898 3.222 3.646 3.965 
18 29] .688 1.330 1.734 2.101 2532 2.878 3.197 3.610 3.922 
19 29l .688 1.328 1.729 2.093 2.539 2.861 3.174 3.579 3.883 
20 251 .687 1.325 1.725 2.086 2.528 2.845 3.153 3.552 3.850 
21 ol .686 1.323 1:721 2.080 2.518 2.831 3.135 3,021 3.819 
22 .256 .686 1.321 1.717 2.074 2.508 2.819 3.119 3.505 3.792 
23 .256 .685 1.319 1.714 2.069 2.500 2.807 3.104 3.485 3.767 
24 .256 .685 1.318 1.711 2.064 2.492 2.797] 3.091 3.467 3.745 
25 .256 .684 1.316 1.708 2.060 2.485 2.191 3.078 3.450 3.725 
26 .256 .684 1.315 1.706 2.056 2.479 2.779 3.067 3.435 3.707 
2 .256 .684 1.314 1.703 2.052 2.473 2.111 3.057 3.42] 3.690 
28 .256 .683 1.313 1.701 2.048 2.467 2.763 3.047 3.408 3.674 
29 .256 .683 1.311 1.699 2.045 2.462 2.756 3.038 3.396 3.659 
30 .256 .683 1.310 1.697 2.042 2.457 2.750 3.030 3.385 3.646 
40 .255 .681 1.303 1.684 2.021 2.423 2.704 2.971 3.307 3.551 
60 .254 .679 1.296 1.671 2.000 2.390 2.660 2.915 3.232 3.460 
120 .254 .677 1.289 1.658 1.980 2.358 2.617 2.860 3.160 3.373 
oo 259 .674 1.282 1.645 1.960 2.326 2.016 2.807 3.090 3.29] 


v — degrees of freedom. 


LSO 


fa, V4 Vo 
Table V Percentage Points fà», of the F-Distribution 


fo25. 


Degrees of freedom for the numerator (v,) 
v l 2 3 4 5 6 7 8 9 10 12 15 


5.83 7.50 8.20 8.58 8.82 8.98 9.10 9.19 9.26 9.32 9.41 9.49 
221 3.00 315 3:29 3.28 3:31 3.34 3195 A37 3.95 3.39 3.41 
2.02 2.28 2.36 2.39 2.41 2.42 2.43 2.44 2.44 2.44 2.45 2.46 
1.81 2.00 2.05 2.06 2.07 2.08 2.08 2.08 2.08 2.08 2.08 2.08 
1.69 1.85 1.88 1.89 1.89 1.89 1.89 1.89 1.89 1.89 1.89 1.89 
1.62 1.76 1.78 1.79 1.79 1.78 1.78 1.78 1-37 LT77 L3 1.76 
1:37 1.70 1.72 1.72 LI LI 1.70 1.70 1.70 1.69 1.68 1.68 
1.54 1.66 1.67 1.66 1.66 1.65 1.64 1.64 1.63 1.63 1.62 1.62 
9 1.51 1.62 1.63 1.63 1.62 1.61 1.60 1.60 1.59 1.59 1.58 1:57 
10 1.49 1.60 1.60 1.59 1.59 1.58 1:57 1.56 1.56 1.55 1.54 1.53 
11 1.47 1.58 1.58 1:57 1.56 1.55 1.54 1:53 1.53 1:52 1.51 1.50 
12 1.46 1.56 1.56 1:55 1.54 1.53 1:52 1.51 1.51 1.50 1.49 1.48 
13 1.45 1.53 1,35 1:55 1.22 151 1.50 1.49 1.49 1.48 1.47 1.46 
14 1.44 1:22 1:53 1:52 L1 1.50 1.49 1.48 1.47 1.46 1.45 1.44 
15 1.43 1.32 1:52 1:51 1.49 1.48 1.47 1.46 1.46 1.45 1.44 1.43 
16 1.42 Ld LSI 1.50 1.48 1.47 1.46 1.45 1.44 1.44 1.43 1.41 
17 1.42 L5 1.50 1.49 1.47 1.46 1.45 1.44 1.43 1.43 1.41 1.40 
18 1.41 1.50 1.49 1.48 1.46 1.45 1.44 1.43 1.42 1.42 1.40 1:39 
1.41 1.49 1.49 1.47 1.46 1.44 1.43 1.42 1.41 1.41 1.40 1.38 
20 1.40 1.49 1.48 1.47 1.45 1.44 1.43 1.42 1.41 1.40 1:39 1.37 
21 1.40 1.48 1.48 1.46 1.44 1.43 1.42 1.41 1.40 1.39 1.38 1.57 
22 1.40 1.48 1.47 1.45 1.44 1.42 1.41 1.40 1.39 1.39 1.57 1.36 
23 1:39 1.47 1.47 1.45 1.43 1.42 1.41 1.40 1.39 1.38 1.57 1,35 
24 1.39 1.47 1.46 1.44 1.43 1.41 1.40 1.39 1.38 1:38 1.36 1.35 
25 1.39 1.47 1.46 1.44 1.42 1.41 1.40 1.39 1.38 1.37 1.36 1.34 
26 1.38 1.46 1.45 1.44 1.42 1.41 1.39 1.38 Re L37 1:35 1.34 
27 1.38 1.46 1.45 1.43 1.42 1.40 1:39 1.38 1:37 1.36 1:35 1,33 
28 1.38 1.46 1.45 1.43 1.41 1.40 1.39 1.38 1:37 1.36 1.34 1.33 
29 1.38 1.45 1.45 1.43 1.41 1.40 1.38 1.37 1.36 1.35 1.34 1352 
30 1.38 1.45 1.44 1.42 1.41 1.39 1,38 1.37 1.36 1.35 1.34 1.32 
40 1.36 1.44 1.42 1.40 139 1.37 1.36 1135 1.34 L39 1:31 1:30 
60 er) 1.42 1.41 1.38 1.37 1,35 1.39 1.32 Lai 1.30 1.29 1.27 
120 1.34 1.40 1,29 1:37 1:35 1.33 lai 1.30 1:29 1.28 1.26 1.24 

oo 1.32 1,39 1:37 1.35 1.33 L3 1:29 1.28 127 1:23 1.24 1.22 


co —-109 tA BWN — 


Degrees of freedom for the denominator (v;) 
o 


20 


9,58 
3.43 
2.46 
2.08 
1.88 
1.76 
1.67 
1.61 
1.56 
1.52 
1.49 
1.47 
1.45 
1.43 
1.41 
1.40 
1.39 
1.38 
1:37 
1.36 
1.35 
1.34 
1.34 
1:25 
1.33 
132 
1.22 
lal 
1.31 
1.30 
1:28 
125 
1.22 
1.19 


24 


9.65 
3.43 
2.46 
2.08 
1.88 
1.45 
1.67 
1.60 
1.56 
1:37 
1.49 
1.46 
1.44 
1.42 
1.41 
1.39 
1.35 
1.37 
1.36 
1.35 
1.34 
1:33 
1.33 
1:22 
132 
1.31 
1.31 
1.30 
1.30 
1.29 
1.26 
1.24 
1,21 
1.18 


30 


9.67 
3.44 
2.47 
2.08 
1.88 
1.75 
1.66 
1.60 
1.55 
ISi 
1.48 
1.45 
1.43 
1.41 
1.40 
1.38 
1:37 
1.36 
1:35 
1.34 
1.33 
1,32 
1:52 
1.31 
L3 
1.30 
1.30 
1.29 
1:29 
1.28 
1:25 
1.22 
TL. I9 
1.16 


40 


2,71 
3.45 
2.47 
2.08 
1.88 
L73 
1.66 
1,59 
1.54 
1.51 
1.47 
1.45 
1.42 
1.41 
1.39 
1.37 
1.36 
1.35 
1.34 
193 
1,22 
1.31 
1.31 
1.30 
1.29 
1.29 
1.28 
1.28 
1:27 
L27 
1.24 
121 
1.18 
1.14 


60 


9.16 
3.46 
2.47 
2.08 
1.87 
1.74 
1.65 
1.59 
1.54 
1.50 
1.47 
1.44 
1.42 
1.40 
1.38 
1.36 
1.35 
1.34 
1.35 
1.32 
Lol 
1.30 
1.30 
1:29 
1.28 
1.25 
1.27 
1:27 
1.26 
1:20 
1:22 
1:19 
1.16 
1.12 


120 


9.80 
3.47 
2.47 
2.08 
1.87 
1.74 
1.65 
1.58 
1:53 
1.49 
1.46 
1.43 
1.41 
1.39 
1:37 
1.35 
1.34 
1.33 
1.22 
1.31 
1.30 
1.29 
1.28 
1.28 
1.27 
1.26 
1.26 
k23 
1.25 
1.24 
L2 
TT 
1.13 
1.08 


869 


Table V Percentage Points of the F-Distribution (continued) 


l 39.86 49.50 53.59 55.83 57.24 58.20 58.91 59.44 
2 8.53 9.00 9.16 9.24 9.29 9.39 9.33 9.37 
3 5.54 5.46 5.39 5.34 5.91 5.28 3.27 229 
4 4.54 4.32 4.19 4.11 4.05 4.01 3.98 3.95 
5 4.06 3.78 3.62 3.22 3.45 3.40 3.97 3.34 
6 3.78 3.46 3.29 3.18 IA 3.05 3.01 2.98 
7 3.99 3.26 3.07 2,96 2.88 2,83 2:158 2.19 
8 3.46 3.H 2.92 2.61 2.73 2.67 2.02 2:59 
9 3190 3.01 2.8] 2.69 2.61 2i 2:91 2.47 
10 3.29 2192 219 2.61 Dus 2.46 2.41 2.38 


~ i 3.23 286 266 2.54 245 239 234 2.30 
£ 12 3.18 281 261 248 2.39 2.33 2.28 2.24 
E 13 3.14 276 256 9436 235 228 223 259 
S 14 310 2.73 252 239 231 224 219 2.15 
* 15 307 290. 2490 936 227 9251 216 Xi 
8 16 305 2.67 246 2.33 2.24 218 2.13 2.09 
S 17 303 2.64 244 2.31 222 215 210 2.06 
E 18 3.01 2.62 242 2.29 220 2.13 2.08 2.04 
Ş 19 299 261 240 227 218 211 206 2.02 
E 20 2.97 2.9 238 225 216 209 204 2.00 
= 2 2.96 2.57 236 223 214 208 202 1.98 
2 22 2.95 256 235 2.22 213 2.06 201 1.97 
2 23 294 255 234 221 2.11 205 199 195 


24 2.93 2.54 233 219 210 2.04 198 1.94 
25 290 2.53 232 218 2.09 202 197 1.93 
26 291 2.52 231 217 208 201 196 192 
27 290 2.51 230 217 207 200 195 191 
28 2.89 2.50 2.29 2.16 206 200 194 1.90 
29 2.89 2.50 228 215 2.06 199 193 1.89 
30 2.88 2.49 228 2.14 2.03 1.98 193 1.88 
40 284 244 223 2.09 200 193 187  L83 
60 2.79 2.39 218 2.04 1.95 187 182 177 

120 275 2.35 213 41.99 190 182 177 1.72 

oc 271 2.30 2.08 1.94 185 177 172 167 


Joioxs; 


Degrees of freedom for the numerator (v,) 
V5 1 2 3 4 5 6 7 8 9 10 2 15 


59.86 
9.38 
5.24 
3.94 
23:22 
2.96 
2.12 
2.56 
2.44 
2.25 
2.247 
2.2] 
2.10 
2:12 
2.09 
2.06 
2.03 
2.00 
1.98 
1.96 
1.95 
1.93 
1:92 
1:91 
1.89 
1.88 
1.87 
1.87 
1.86 
1.85 
1:79 
1.74 
1.68 
1.63 


60.19 
9.39 
2023 
3.97 
3.30 
2.94 
2.70 
2.54 
2.42 
2.32 
2.25 
2.19 
2.14 
2.10 
2.06 
2.03 
2.00 
1.98 
1.96 
1.94 
1.92 
1.90 
1.89 
1.88 
1.87 
1.86 
1.35 
1.84 
1.83 
1.82 
1.76 
Ly 
1.65 
1.60 


60.71 
9.41 
22 
3.90 
3.27 
2.90 
2467 
2.50 
2.38 
2.28 
2.2] 
2,15 
2.10 
2.05 
2.02 
1,99 
1.96 
1.93 
1.91 
1.89 
1.87 
1.86 
1.84 
1.83 
1.82 
1.81 
1.80 
1:79 
1.78 
1.77 
L7 
1.66 
1.60 
1,35 


61.22 
9.42 
5.20 
3.87 
3.24 
2.87 
2.63 
2.46 
2.34 
2.24 
247 
2.10 
2.05 
2.01 
1.97 
1.94 
1.91 
1.89 
1.86 
1.84 
1.83 
1.81 
1.80 
1.78 
1.77 
1.76 
1.75 
1.74 
1.73 
1.72 
1.66 
1.60 
1.55 
1.49 


61.74 
9.44 
5.18 
3.84 
351 
2.84 
2.59 
2.42 
2.30 
2.20 
2:39 
2.06 
2.01 
1.96 
1.92 
1.89 
1.86 
1.84 
1.81 
1.79 
1.78 
1.76 
1.74 
1.73 
i72 
1.71 
1.70 
1.69 
1.68 
1.67 
1.61 
1.54 
1.48 
1.42 


62.00 
9.45 
5.18 
3.83 
3.19 
2.82 
2.58 
2.40 
228 
2.18 
2.10 
2.04 
1.95 
1.94 
1.90 
1.87 
1.84 
1.81 
1.79 
LTT 
1.75 
1:73 
1:72 
1.70 
1.69 
1.68 
1.67 
1.66 
1.65 
1.64 
L57 
[51 
1.45 
1.38 


62.26 
9.46 
5.17 
3.82 
317 
2.80 
2.56 
2.38 
2.25 
2.16 
2.08 
2.01 
1.96 
1.91 
1.87 
1.84 
1.81 
1.78 
1.76 
1.74 
1: 
1.70 
1.69 
1.67 
1.66 
1.65 
1.64 
1.63 
1.62 
1.61 
1.54 
1.48 
1.41 
1.34 


62,53 
9.47 
5.16 
3.80 
3.16 
2:18 
2.54 
2.36 
222 
2:13 
2.05 
1.99 
1:93 
1.89 
1.85 
1.81 
1.78 
1.75 
Le 
LT 
1.69 
1:67 
1.66 
1.64 
1.63 
1.61 
1.60 
1.59 
1.58 
1.57 
Ll 
1.44 
1.37 
1.30 


62.79 
9.47 
5.15 
3.79 
3.14 
2.76 
2.5] 
2.34 
2:01 
PA 
2.03 
1.96 
1.90 
1.86 
1.82 
1.78 
1.75 
175 
1.70 
1.68 
1.66 
1.64 
1.62 
1.61 
1.59 
1.58 
1.57 
1.56 
1.55 
1.54 
1.47 
1.40 
1.32 
1.24 


699 


Table V Percentage Points of the F-Distribution (continued) 


1 | 1614 
2| 1851 
3 | 10.13 
4 7.71 
5 6.61 
6 5.99 
7 5.59 
8 5.32 
9 5.12 
~ 10 4.96 
go 4.84 
3 12 4.75 
* 13 4.67 
= 14 4.60 
> i5 4.54 
S 16 4.49 
S 17| 445 
E 18 441 
Z 19| 438 
E 20 4.35 
p 2l 4.32 
2 59 4.30 
en 
È 23 4.28 
24 4.26 
25 4.24 
26 4.23 
37 4.21 
28 4.20 
29 4.18 
30 4.17 
40 4.08 
60 4.00 
120 3.92 
oc 3.84 


199.5 
19.00 
9.55 
6.94 
9.79 
5.]4 
4.74 
4.46 
4.26 
4.10 
3.98 
3.89 
3.61 
3.74 
3.68 
3.63 
3,59 
3195 
3152 
3.49 
3.47 
3.44 
3.42 
3.40 
3.39 
3.57 
3.95 
3.34 
3.39 
21942 
3,23 
315 
3.07 
3.00 


215.7 
19.16 
9.28 
6.59 
5.41 
4.76 
4.35 
4.07 
3.86 
3.71 
3,59 
3.49 
3.41 
3.34 
3.29 
3.24 
3.20 
3.16 
3:13 
3.10 
3.07 
3.05 
3.05 
3.01 
2.09 
2.98 
2.96 
2:95 
2.93 
2.92 
2.84 
2.76 
2.68 
2.60 


224.6 
19.25 
9.12 
6.39 
5.19 
4.53 
4.12 
3.84 
3.63 
3.48 
3.36 
3.20 
3.18 
aol 
3.06 
3:01 
2.96 
2.93 
2.90 
2.57 
2.84 
2,02 
2.80 
2.78 
2.16 
2.74 
2.13 
2.71 
2.70 
2,69 
2.61 
2,93 
2.45 
2.37 


230.2 
19.30 
9.01 
6.26 
5.05 
4.39 
3.97 
3.69 
3.48 
3:59 
3.20 
AI 
3.03 
2.96 
2.90 
2.85 
2.81 
2.17 
2.74 
2.71 
2.68 
2.66 
2.64 
2.02 
2.60 
2.59 
Zo) 
2.56 
2:99 
2:53 
2.45 
PSY 
219 
2.21 


234.0 
19.33 
8.94 
6.16 
4.95 
4.28 
3.87 
3.58 
23.97 
242 
3.09 
3.00 
2.92 
2.82 
2.19 
2.74 
2.70 
2.66 
2,63 
2.60 
2.57 
2.55 
253 
2.51 
2.49 
2.47 
2.46 
2.45 
2.43 
2.42 
2.34 
2.23 
2. T7 
2,10 


236.8 
19.55 
8.89 
6.09 
4.88 
4.21 
3.79 
3.50 
329 
3.14 
3.01 
2.91 
2.83 
2.76 
2:4 
2.66 
2.61 
2.58 
2.54 
2.91 
2.49 
2.46 
2.44 
2.42 
2.40 
2.39 
2,37 
2.36 
2.35 
2.33 
2,23 
2.17 
2.09 
2.01 


Po.05,»,v, 


Degrees of freedom for the numerator (v,) 


8 


238.9 
19.37 
8.85 
6.04 
4.82 
4.15 
3.73 
3.44 
2425 
3.07 
2.95 
2.85 
2.1) 
2.10 
2.64 
2.99 
2199 
2.21 
2.48 
2.45 
2.42 
2.40 
2.37 
2.36 
2.34 
2.32 
Z1 
2,29 
2.28 
227 
2.18 
2.10 
2.02 
1.94 


9 


240.5 
19.38 
8.81 
6.00 
4.77 
4.10 
3.68 
3.39 
3.18 
3.02 
2.90 
2.80 
2.71 
2,05 
2.59 
2.54 
2.49 
2.46 
2.42 
2.39 
2.37 
2.34 
22 
2.30 
2.28 
zo 
2.253 
2.24 
2,22 
2.21 
2.12 
2.04 
1.96 
1.88 


10 


241.9 
19.40 
8.79 
5.96 
4.74 
4.06 
3.64 
3.35 
3.14 
2.98 
2.85 
2:13 
2.07 
2.60 
2.54 
2.49 
2.45 
2.4] 
2:38 
2.99 
Doe 
2.30 
2.21 
2.45 
2.24 
224 
2,20 
2,19 
2.18 
2.16 
2.08 
1.99 
1.91 
1.83 


I» 


243.9 
19.41 
8.74 
5.91 
4.68 
4.00 
3.97 
3.28 
3.07 
2.91 
2,19 
2.69 
2.60 
2.53 
2.48 
2.42 
2.38 
2.34 
2.2] 
2.28 
2.23 
2.23 
2.20 
2,18 
2.16 
2. I5 
2,13 
242 
2.10 
2.09 
2.00 
1.92 
1.83 
1:25 


I5 


245.9 
19.43 
8.70 
5.86 
4.62 
3.94 
3.51 
224 
3.01 
2.85 
2:2 
2.62 
2:23 
2.46 
2.40 
2:35 
2,21 
221 
2,23 
2.20 
2.18 
2,15 
2,123 
2:11 
2.09 
2.07 
2.06 
2.04 
2.03 
2.01 
1:92 
1.84 
1.75 
1.67 


20 


248.0 
19.45 
8.66 
5.80 
4.56 
3.87 
3.44 
2415 
2.94 
2.17 
2.65 
2.54 
2.46 
2:39 
2.33 
2.28 
2.23 
2.19 
2.16 
2.12 
2.10 
2.07 
2.05 
2.03 
2.01 
1:99 
1.97 
1.96 
1.94 
1.93 
1.84 
1.75 
1.66 
1:57 


24 


249.1 
19.45 
8.64 
oe 
4.53 
3.84 
3.41 
2,17 
2.90 
2.74 
2.61 
2:31 
2.42 
2,35 
2,20 
2.24 
2.19 
2,19 
2,11 
2.08 
2.05 
2.03 
2.01 
1.95 
1.96 
1.95 
1.93 
1.91 
1.90 
1:89 
1:79 
1.70 
1,61 
1:52 


30 


250.1 
19.46 
8.62 
em 
4.50 
3.81 
3.38 
3.08 
2.86 
2.70 
2.57 
2.47 
2.38 
221 
2,29 
2,19 
2,159 
2411 
2.07 
2.04 
2,01 
1.98 
1.96 
1.94 
192 
1.90 
1.88 
1.87 
1.85 
1.84 
1.74 
1.65 
1:33 
1.46 


40 


251.1 
19.47 
8.59 
25.72 
4.46 
27 
3.34 
3.04 
2.83 
2.66 
2223 
2.43 
2.34 
2.47 
2.20 
2.15 
2.10 
2.06 
2.03 
1.99 
1.96 
1.94 
1.91 
1.89 
1:87 
1.85 
1.84 
1.82 
1.81 
1:79 
1.69 
1.59 
1.55 
1.39 


60 


250.2 
19.48 
8.57 
5.69 
4.43 
3.74 
3.30 
3.01 
2.79 
2402 
2.49 
2.38 
2.30 
2.42 
2.16 
eine al 
2.06 
2,02 
1.98 
1,95 
1:92 
1.89 
1.86 
1.84 
1.82 
1.80 
1:79 
Ld 
175 
1.74 
1.64 
1:53 
1.43 
1.32 


120 


2533 
19.49 
8.55 
5.66 
4.40 
3.70 
3247 
2,97 
2:12 
2.58 
2.45 
2.34 
2.23 
2.18 
2.1 
2.06 
2.01 
1.97 
1.93 
1.90 
1.87 
1.84 
1.81 
1.79 
1:77 
1:35 
1.13 
LE 
1.70 
1.68 
1.58 
1.47 
1.35 
1.22 


oo 


254.3 
19.50 
8.53 
5.63 
4.36 
3.67 
3.23 
2,93 
24 
2.54 
2.40 
2.30 
2l 
2,13 
2.07 
2.01 
1.96 
1.92 
1.88 
1.84 
1.81 
1.78 
1.76 
1:73 
L71 
1.69 
1.67 
1.65 
1.64 
1.62 
1.5] 
1.39 
1.25 
1.00 


099 


Table V Percentage Points of the F-Distribution (continued) 


Na 2 3 4 5 6 7 


Degrees of freedom for the denominator (v;) 


O oo uuna BWN KF 


647.8 
38.51 
17.44 
12.22 
10.01 

8.81 
8.07 
dod 
7.21 
6.94 
6.72 
6.55 
6.41 
6.30 
6.20 
6.12 
6.04 
5.98 
5.92 
3:07 
5.83 
5.79 
Fia 
312 
5.69 
5.66 
5.63 
5.61 
5.59 
5457 
5.42 
3.29 
5,15 
5.02 


799,5 
39.00 
16.04 
10.65 

8.43 
7.26 
6.54 
6.06 
5,71 
5.46 
5.26 
5.10 
4.97 
4.86 
4.77 
4.69 
4.62 
4.56 
4.5] 
4.46 
4.42 
4.38 
4.35 
4.32 
4.29 
4.27 
4.24 
4.22 
4.20 
4.18 
4.05 
2.93 
3.80 
3.69 


864.2 
39.17 
15.44 

9.98 
7.76 
6.60 
5.89 
5.42 
5.08 
4.83 
4.63 
4.47 
4.35 
4.24 
4.15 
4.08 
4.01 
3.95 
3.90 
3.86 
3.82 
3.78 
3.75 
3.72 
3.69 
3.67 
3.65 
3.63 
3.61 
3.59 
3.46 
3.34 
3:23 
3.12 


899.6 
49.25 
15.10 

9.60 
7.39 
6.23 
Sud 
5.05 
4.72 
4.47 
4.28 
4.12 
4.00 
3.89 
3.80 
3.79 
3.66 
3.61 
3.56 
3.51 
3.48 
3.44 
3.41 
3.38 
3.35 
2:93 
3.91 
3.29 
34 
3.25 
3.13 
3.01 
2.89 
2.79 


921.8 
39.30 
14.88 

9.36 
7.15 
5.99 
5.29 
4.82 
4.48 
4.24 
4.04 
3.89 
3.17 
3.66 
3.58 
3.50 
3.44 
3.38 
3,99 
3.29 
3.25 
24d 
3.18 
3.15 
3.13 
3.10 
3.08 
3.06 
3.04 
3.03 
2.90 
2,19 
2.67 
2.57 


957.1 
39,33 
14.73 

9.20 
6.98 
5.82 
5.12 
4.65 
4.32 
4.07 
3.88 
2019 
3.60 
3.50 
3.41 
3.34 
3128 
dud 
3.17 
3.19 
3.09 
3.05 
3.02 
2.99 
2.97 
2.94 
2:94 
2.90 
2.88 
2.8) 
2.74 
2.63 
222 
2.4] 


948.2 
39.36 
14.62 

9.07 
6.85 
5.70 
4.99 
4.53 
4.20 
3.95 
3.76 
3.61 
3.48 
3.38 
24249 
2:22 
3.16 
3.10 
3:05 
3.01 
2.97 
2.93 
2.90 
2,87 
2.85 
2.82 
2.80 
2:18 
2.76 
2.19 
2.02 
21 
2.39 
2,29 


0.025,» v, 


Degrees of freedom for the numerator (v,) 


8 


956.7 
39.957 
14.54 

8.98 
6.76 
5.60 
4.90 
4.43 
4.10 
3.85 
3.66 
391 
3.59 
3.29 
3.20 
2.12 
3.06 
3.01 
2.96 
291 
2.87 
2.84 
281 
2.78 
2.13 
2.13 
2.41 
2.69 
2.67 
2103 
233 
2.4] 
2.30 
2,09 


9 


963.3 
39.39 
14.47 

8.90 
6.68 
3:92 
4.82 
4.36 
4.03 
3.78 
3:59 
3.44 
2:31 
2.21 
2,12 
3.05 
2.98 
2:93 
2.88 
2.84 
2.80 
2.76 
2.13 
2.70 
2.68 
2.65 
2.63 
2.61 
2.59 
2:37 
2.45 
2133 
2,22 
2,11 


10 


968.6 
39.40 
14.42 

8.84 
6.62 
5.46 
4.76 
4.30 
3.96 
3.72 
3132 
Ja 
3:25 
3,15 
3.06 
2.99 
2,92 
2.87 
2.82 
ald 
2:13 
2.70 
2,07 
2.64 
2451 
229 
2:93 
2.59 
2:93 
Zal 
24,39 
22) 
2.16 
2.05 


ie 


976.7 
39.41 
14.34 

8.75 
6.52 
3497 
4.67 
4.20 
3.87 
3.062 
3.43 
3.28 
3,15 
3.05 
2,96 
2.89 
2.82 
2.77 
2:9 
2.68 
2.64 
2.60 
Zal 
2.54 
251 
2.49 
2.47 
2.45 
2.43 
2.41 
229 
2,17 
2.05 
1.94 


I5 


984.9 
39.43 
14.25 

8.66 
6.43 
32.27 
4.57 
4.10 
3d 
3.52 
3.33 
3.18 
3.05 
2.93 
2.86 
2.19 
22 
2.67 
2,02 
2:37 
2:55 
2.50 
2.47 
2.44 
2.4] 
2.39 
2.306 
2.34 
2.54 
2,91 
2.18 
2.06 
1.94 
1.83 


20 


993.1 
39.45 
14.17 

8.56 
6.33 
IA 
4.47 
4.00 
3.67 
3.42 
3.23 
3.07 
2,95 
2.84 
2.76 
2.68 
2.62 
2.56 
2:31 
2.46 
2.42 
2.39 
2.30 
2.33 
2.30 
2.28 
2,23 
2,23 
221 
220) 
2.07 
1.94 
1.82 
1.71 


24 


997.2 
39.46 
14.12 

8.51 
6.28 
9412 
4.42 
3.95 
3.61 
3:97 
3.17 
3.02 
2.89 
2.79 
2.70 
2,63 
2.56 
2.50 
2.45 
2.41 
2,31 
2.29 
2.30 
2.27 
2.24 
2.22 
2.19 
2.17 
2.15 
2.14 
2.01 
1.88 
1.76 
1.64 


30 


1001 
39.46 
14.08 

8.46 
6.23 
5.07 
4.36 
3.89 
3.56 
3.91 
2.12 
2.96 
2.84 
213 
2.64 
da 
2.50 
2.44 
2.39 
2,39 
Zo 
Day 
2.24 
2,21 
2,18 
2.16 
2,13 
2:11 
2.09 
2.07 
1.94 
1.82 
1.69 
1.57 


40 


1006 
39.47 
14.04 

8.41 
6.18 
5.01 
4.3] 
3.84 
3:5] 
3.26 
3.06 
2.91 
2.18 
2.67 
2.99 
2.91 
2.44 
2.38 
2,33 
2.20 
2:23 
22) 
2,19 
2 13 
2.12 
2.09 
2,07 
zu 
2,03 
2.01 
1.88 
1.74 
1.61 
1.48 


60 


1010 
39.48 
13.99 

8.36 
6.12 
4.96 
4.25 
3.78 
3.45 
3.20 
3.00 
2.85 
2.72 
2.61 
202 
2.45 
2,35 
2.32 
22) 
22 
2,18 
2.14 
2,11 
2.08 
2.05 
2.03 
2.00 
1.98 
1.96 
1.94 
1.80 
1.67 
1.53 
1.39 


120 


1014 
39.49 
13.95 

8.31 
6.07 
4.90 
4.20 
30/2 
3:39 
3.14 
2.94 
2119 
2.66 
2.99 
2.46 
2.38 
2:94 
2:26 
2.20 
2.16 
PAN 
2.08 
2.04 
2.01 
1.98 
1:95 
1.95 
1:91 
1.89 
1.87 
172 
1.58 
1.43 
1,27 


oO 


1018 
392.50 
13.90 

8.26 
6.02 
4.85 
4.14 
3.67 
3:39 
3.08 
2.88 
2.12 
2.60 
2.49 
2.40 
2.32 
229 
2.19 
2,13 
2.09 
2.04 
2.00 
1:97 
1.94 
1.9] 
1.88 
1.85 
1.83 
1.81 
1.79 
1.64 
1.48 
1.31 
1.00 


Table V Percentage Points of the F-Distribution (continued) 


Joni 


Degrees of freedom for the numerator (v,) 
V5 i| 2 3 4 5 6 7 8 E 10 12 15 


1 | 4052 4999.5 5403 5625 5764 5859 5928 5982 6022 6056 £6106 6157 
2 98.50 99.00 99.17 9925 99.30 99.35 9936 99.37 9939 9940 9942 99.43 
3 34.12 30.82 2946 28.71 2824 27.91 27.67 2749 27.35  Á 27.23 27.05 26.87 
4 21.20 18.00 1669 15.98 15.52 15.21 1498 1480 1466 1455 1437 14.20 
5 16.26 13.27 12.06 11.39 10.97 10.67 1046 110.29 10.16 10.05 9.89 9.72 
6 13.75 10.92 9.78 O15 ms 8.47 8.26 8.10 7.98 7.87 7.72 7.56 
7 12,25 955 8.45 TBS 7.46 7,19 6.99 6.84 6.72 6.62 6.47 6.31 
8 11.26 8.65 7159 7,01 6.63 6.37 6.18 6.03 591 5.81 2457 5502 
9 10.56 8.02 6.99 6.42 6.06 5.80 5.61 5.47 3.39 5.26 S1 4.96 
10 10.04 7.56 6.55 5.99 5.64 5.39 5.20 5.06 4.94 4.85 4.7] 4.56 


11 9.65 7.21 6.22 5.67 3.92 5.07 4.89 4.74 4.63 4.54 4.40 4.25 
12 9.33 6.93 5.95 5.41 5.06 4.82 4.64 4.50 4.39 4.30 4.16 4.01 
13 9.07 6.70 5.74 5.21 4.86 4.62 4.44 4.30 4.19 4.10 3.96 3.82 
14 8.86 6.51 330 5.04 4.69 4.46 4.28 4.14 4.03 3.94 3.80 3.66 
15 8.68 6.36 5.42 4.89 4.36 4.32 4.14 4.00 3.89 3.80 3.67 3492 


16 8.53 6.23 3.29 4.77 4.44 4.20 4.03 3.89 3.78 3.69 3.23 3.41 
17 8.40 6.11 5.18 4.67 4.34 4.10 3.93 3.79 3.68 3.59 3.46 3.31 
18 5.29 6.01 5.09 4.58 4.25 4.01 3.84 3.71 3.60 3.51 3197 3.23 
19 8.18 3.95 5.01 4.50 4.17 3.94 3.77 3.63 22 3.43 3.30 3.15 
20 8.10 5:65 4.94 4.43 4.10 3.87 3.70 3.56 3.46 3.37 3.23 3.09 


Degrees of freedom for the denominator (v;) 


2] 8.02 5.78 4.87 4.37 4.04 3.81 3.64 3.51 3.40 3.31 3.17 3.03 
22 7.95 5.12 4.82 4.31 3.99 3.76 3.59 3.45 3.33 3:28 3.12 2.98 
23 7.88 5.66 4.76 4.26 3.94 3.71 3.54 3.41 3.30 322] 3.07 2.93 


24 7.82 5.61 4.72 4.22 3.90 3.67 2:50 3.36 3.26 317 3.03 2.89 
25 TI 23.97 4.68 4.18 3.85 3.63 3.46 2292 3.22 3.13 299 2.85 
20 572 3193 4.64 4.14 3.82 2:99 3.42 3.29 3,18 3.09 2.96 2.81 
27 7.68 5.49 4.60 4.11 3.78 3.56 3:59 3.26 3.15 3.06 293 2.78 
28 7.64 5.45 4.57 4.07 3.75 3.39 3.36 3.23 3.12 3.03 2.90 2.15 
29 7.60 5.42 4.54 4.04 3.73 3.50 3:259 3.20 3.09 3.00 2.87 2:13 
30 7.56 3.39 4.51 4.02 3.70 3.47 3.30 SULT 3.07 2.98 2.84 2.70 
40 7.91 5.18 4.31 383 Jol 2.29 3.12 2,09 2,09 2.80 2.66 2:92 
60 7.08 4.98 4.13 3.65 3.34 3.12 2.95 2:82 212 2.63 2.50 2.39 
120 6.85 4.79 3.95 3.48 3.17 2.96 2.19 2.66 2.56 2.47 2.34 2.19 

oo 6.63 4.61 3.78 3.32 3.02 2.80 2.64 2.5] 2.41 2.92 2.18 2.04 


199 


20 


6209 
99.45 
26.69 
14.02 

9.55 
7.40 
6.16 
5.36 
4.81 
4.41 
4.10 
3.86 
3.66 
Jal 
3:37 
3.26 
3.16 
3.08 
3.00 
2.94 
2.88 
2.83 
2.78 
2.74 
2.10 
2.66 
2.63 
2.60 
Zio) 
2:235 
2.27 
2.20 
2.03 
1.88 


24 


6235 
99.46 
26.00 
135.93 

9.47 
7,91 
6.07 
525 
4.73 
4.33 
4.02 
3:78 
3.59 
3.43 
3.29 
3.18 
3.08 
3.00 
2:04 
2.86 
2.80 
2.75 
2.70 
2.66 
2.62 
2.58 
2.55 
24 
2.49 
2.47 
2.49 
2:12 
1:95 
1.79 


30 


6261 
99.47 
26.50 
13.84 

9.35 
71.23 
5.99 
5.20 
4.65 
4.25 
3.94 
3.70 
3.51 
3.29 
ERA 
3.10 
3.00 
2.02 
2.84 
2.78 
2.42 
2:67 
2.62 
2.58 
2.54 
2.50 
2.47 
2.44 
2.41 
2,30 
2.20 
2.03 
1.86 
1.70 


40 


6287 
99.47 
26.41 
13.75 

9.29 
7.14 
2.91 
2412 
4.57 
4.17 
3.86 
3.62 
3.43 
3:21 
3.13 
3.02 
2.92 
2.84 
2:16 
2:09 
2.64 
2.58 
2.54 
2.49 
2.45 
2.42 
2.38 
2.93 
24,23 
2.30 
2, HI 
1.94 
1.76 
1.59 


60 


6313 
99.48 
26,32 
13.65 

9.20 
7.06 
32082 
5403 
4.48 
4.08 
3.78 
3.54 
3.34 
3.18 
3.05 
2.93 
2.83 
2.15 
2:407 
2.61 
2,55 
2.50 
2.45 
2.40 
2.36 
2.99 
2.20 
2:20 
2:23 
2:241 
2:02 
1.84 
1.66 
1.47 


120 


6339 
99.49 
26.22 
13.56 

9.11 
6.97 
5.74 
4.95 
4.40 
4.00 
3.69 
3.45 
3.25 
3.09 
2.96 
2.84 
2.12 
2.66 
2.58 
2.52 
2.46 
2.40 
2,35 
2.21 
2.2] 
2,23 
2.20 
27 
2.14 
2.11 
1:92 
1:73 
1.55 
1:22 


CO 


6366 
99.50 
26.13 
13.46 

9.02 
6.88 
3.65 
4.46 
4.31 
3.91 
3.60 
3.36 
3.17 
3.00 
2.87 
PA E 
2,65 
21:37 
2439 
2.42 
2.36 
2:1 
2.26 
2 4l 
2.17 
2.13 
2.10 
2.06 
203 
2.01 
1.80 
1.60 
1.38 
1.00 
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Chart VI Operating Characteristic Curves 
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(a) O.C. curves for different values of n for the two-sided normal test for a level of significance a = 0.05. 
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(b) O.C. curves for different values of 7 for the two-sided normal test for a level of significance a = 0.01. 


Source: Charts Vla, e, f, k, m, and q are reproduced with permission from “Operating Characteristics 
for the Common Statistical Tests of Significance," by C. L. Ferris, F. E. Grubbs, and C. L. Weaver, 
Annals of Mathematical Statistics, June 1946. 

Charts VID, c, d, g, h, i, j, l n, o, p, and r are reproduced with permission from Engineering Statistics, 
2nd Edition, by A. H. Bowker and G. J. Lieberman, Prentice-Hall, 1972. 
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(c) O.C. curves for different values of n for the one-sided normal test for a level of significance a = 0.05. 
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(d) O.C. curves for different values of n for the one-sided normal test for a level of significance a = 0.01. 
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Chart VI Operating Characteristic Curves (continued) 
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(f) O.C. curves for different values of n for the two-sided t-test for a level of significance a = 0.01. 
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Chart VI Operating Characteristic Curves (continued) 
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(h) O.C. curves for different values of n for the one-sided t-test for a level of significance a = 0.01. 
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Chart VI Operating Characteristic Curves (continued) 
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(j) O.C. curves for different values of n for the two-sided chi-square test for a level of significance a = 0.01. 
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Chart VI Operating Characteristic Curves (continued) 


Probability of accepting Ho 





(k) O.C. curves for different values of n for the one-sided (upper tail) chi-square test for a level of 
significance a — 0.05. 


Probability of accepting Ho 





(I) O.C. curves for different values of n for the one-sided (upper tail) chi-square test for a level of 
significance a — 0.01. 
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Chart VI Operating Characteristic Curves (continued) 
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(m) O.C. curves for different values of n for the one-sided (lower tail) chi-square test for a level of 
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Probability of accepting Ho 





(n) O.C.curves for different values of n for the one-sided (lower tail) chi-square test for a level of 
significance a — 0.01. 
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Chart VI Operating Characteristic Curves (continued) 
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(p) O.C. curves for different values of n for the two-sided F-test for a level of significance a = 0.01. 
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Table VII Critical Values for the Sign Test 


















Two-sided tests 
One-sided tests 


0.01 
0.005 


0.10 
0.05 


0.05 
0.025 





0.10 0.05 0.01 Two-sided tests 
n 0.05 0.025 0.005 One-sided tests 


0 7 6 4 
0 0 7 6 3 
0 0 7 7 5 
l 0 0 8 7 6 
l | 0 8 7 6 
l l 0 9 8 6 
2 l 0 9 8 7 
2 2 l 10 9 7 
3 9, l 10 9 7 
3 2 l 10 9 8 
3 3 2 11 10 8 
4 3 2 11 10 9 
4 4 2 12 11 9 
5 i 3 12 11 9 
5 4 3 13 12 10 
B 5 3 13 12 10 
6 5 4 13 12 11 
6 5 4 14 13 11 





Table VIII Critical Values for the Wilcoxon Signed-Rank Test 


Wo 


0.10 0.05 0.02 0.01 Two-sided tests 
n* 0.05 0.025 0.01 0.005 One-sided tests 


4 
5 0 

6 2 0 

7 3 2 0 

8 5 3 1 0 
9 8 5 3 1 
10 10 8 5 3 
11 13 10 7 5 
12 17 13 9 7 
13 21 17 12 9 
14 25 21 15 12 
15 30 25 19 15 
16 35 29 23 19 
17 41 34 21 23 
18 47 40 32 21 
19 53 46 37 32 
20 60 52 43 37 
21 67 58 49 42 
22 79 65 55 48 
23 83 73 62 54 
24 9] 81 69 61 
25 100 80 76 68 


* If n — 25, W^ (or W ) is approximately normally distributed with mean n(n + 1)/4 
and variance n(n + 1)(2n + 1)/24. 
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Table IX Critical Values for the Wilcoxon Rank-Sum Test 


W0.05 
n» x 4 5 6 7j 8 9 10 

4 10 

5 11 17 

6 12 18 26 

7 13 20 27 36 

8 14 21 29 38 49 

9 15 22 31 40 51 63 

10 15 23 32 42 53 65 78 
11 16 24 34 44 38 68 81 
12 LT 26 35 46 58 71 85 
13 18 27 37 48 60 73 88 
14 19 28 38 50 63 76 9] 
15 20 29 40 52 65 79 94 
16 21 31 42 54 67 82 97 
17 21 32 43 56 70 84 100 
18 22 33 45 58 72 87 103 
19 23 34 46 60 74 90 107 
20 24 35 48 62 77 93 110 
2] 25 37 50 64 79 95 
22 26 38 51 66 82 
23 27 39 53 68 

24 28 40 55 

25 28 42 

26 29 

27 

28 


*For n; andn, > 8, W, is approximately normally distributed with mean jn nj + m + l)and variance njny(n, + n, + 1)/12. 


Table IX Critical Values for the Wilcoxon Rank-Sum Test (continued) 


Wo.01 
nj | 4 5 6 7 8 9 10 

5 15 

6 10 16 23 

7 10 17 24 32 

8 11 17 25 34 43 

9 11 18 26 35 45 56 

10 12 19 27 37 47 58 71 
11 12 20 28 38 49 61 74 
12 13 21 30 40 51 63 76 
13 14 22 31 41 53 65 79 
14 14 22 32 43 54 67 81 
15 15 23 33 44 56 70 84 
16 15 24 34 46 58 T2 86 
17 16 25 36 47 60 74 89 
18 16 26 37 49 62 76 92 
19 17 27 38 50 64 78 94 
20 18 28 39 52 66 81 97 
21 18 29 40 53 68 83 
22 19 29 42 55 70 
23 19 30 43 57 
24 20 31 44 
25 20 32 
26 21 
27 


Tul 


96 

99 
103 
106 
110 
114 
117 
121 
124 


Ti 


12 


115 
119 
123 
127 
131 
135 
139 


1 


106 
109 
112 
115 
119 
122 
125 


13 


137 
141 
145 
150 
154 


T3 


125 
129 
133 
137 
140 


14 


160 
164 
169 


14 


147 
151 
135 


I5 


185 


[s 


171 


Table X Factors for Constructing Variables Control Charts 


n* A, 
2 3.760 
3 2.394 
4 1.880 
5 1.596 
6 1.410 
7 1.277 
8 1.175 
9 1.094 

10 1.028 

11 .973 

12 .925 

13 .884 

14 .848 

15 .816 

16 .788 

17 .762 

18 .738 

19 .717 

20 .697 

21 .679 

27 .662 

23 .647 

24 .632 

25 .619 


*n > 25: A, = 3/ Vin where n = number of observations in sample. 


X Chart 


A, 
1.880 
1.023 

129 

277 

.483 

.419 

373 

.337 

.308 

.285 

.266 

.249 

2435 

22D 

EA 

.203 

.194 

.187 

.180 

.173 

.167 

.162 

.157 

A93 


Factor for Control Limits 


dy 
1.128 
1.693 
2.059 
2.326 
2.534 
2.704 
2.847 
2.970 
3.078 
3.173 
3.235 
3.336 
3.407 
3.472 
3.592 
3.588 
3.640 
3.689 
3:193 
3.778 
3.819 
3.858 
3.095 
3.951 





S Chart 


C4 
0.7979 
0.8862 
0.9213 
0.9400 
0.9515 
0.9594 
0.9650 
0.9693 
0.9727 
0.9754 
0.9776 
0.9794 
0.9810 
0.9823 
0.9835 
0.9845 
0.9854 
0.9862 
0.9869 
0.9876 
0.9882 
0.9887 
0.9892 
0.9896 


S 


= = 
— O ooco-J1X€0€(Ut BW WN 


NO h2 NN WN NN RRR RRR FR n 
A BPWNrF TODO WAND MN FW WN 
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Table XI Factors for Tolerance Intervals 


Values of k for Two-Sided Intervals 


Confidence 
Level 0.90 0.95 0.99 
Percent 
Coverage 0.90 0.95 0.99 0.90 0.95 0.99 0.90 0.95 0.99 

2 15.978 18.800 24.167 32.019 37.674 48.430 160.193 188.491 242.300 

3 5.847 6.919 8.974 8.380 9.916 12.861 18.930 22.401 29.055 

4 4.166 4.943 6.440 5.369 6.370 8.299 9.398 11.150 14.527 

5 3.949 4.152 5.423 4.275 5.079 6.634 6.612 7.855 10.260 

6 3.131 3.723 4.870 3.712 4.414 5.775 5.537 6.345 8.301 

7 2.902 3.452 4.521 3.369 4.007 5.248 4.613 5.488 7.187 

8 2.743 3.264 4.278 3.136 3.732 4.891 4.147 4.936 6.468 

9 2.626 3.125 4.098 2.967 3:542 4.631 3.822 4.550 5.966 
10 2.539 3.018 3.959 2.839 3.379 4.433 3.582 4.265 5.594 
11 2.463 2.933 3.849 2797 3.259 4.277 3.397 4.045 5.308 
17 2.404 2.863 3.758 2.655 3.162 4.150 3.250 3.870 5.079 
13 2.355 2.805 3.682 2.587 3.081 4.044 3.130 3:727 4.893 
14 2.314 2.756 3.618 2.529 3.012 3.955 3.029 3.608 4.737 
I5 2.278 2:115 3.562 2.480 2.954 3.878 2.945 3.507 4.605 
16 2.246 2.676 3.514 2.437 2.903 3.812 2.872 3.421 4.492 
I7 2.219 2.643 3.471 2.400 2.858 3.754 2.808 3.345 4.393 
18 2.194 2.614 3.433 2.366 2.819 3.702 2.155 3.279 4.307 
19 2.172 2.588 3.399 2.937 2.784 3.656 2.703 3.221 4.230 
20 ZAG? 2.564 3.368 2.310 2d 3.615 2.659 3.168 4.161 
21 2.135 2.543 3.340 2.286 2.123 3.577 2.620 3.121 4.100 
22 2.118 2.524 3.315 2.264 2.697 3.543 2.584 3.078 4.044 
23 2.103 2.506 3.292 2.244 2.673 3.512 2:35] 3.040 3.993 
24 2.089 2.489 3.270 2225 2.651 3.483 2:524 3.004 3.947 
25 2:077 2.474 A23] 2.208 2.631 3.457 2.494 2.972 3.904 
30 2.025 2.413 3.170 2.140 2.529 3.350 2.385 2.841 3.733 
40 1.959 2.334 3.066 2.052 2.445 3.213 2.247 2.677 3.518 
50 1.916 2.284 3.001 1.996 2.379 3.126 2.162 2.5776 3.385 
60 1.887 2.248 2.955 1.958 2.333 3.066 2.103 2.506 3.293 
70 1.865 2.222 2.920 1.929 2.299 3.021 2.060 2.454 3725 
80 1.848 2.202 2.894 1.907 2212 2.986 2.026 2.414 3.173 
90 1.834 2.185 2872 1.889 2.231 2.958 1.999 2.382 3.130 


100 1.822 2.172 2.854 1.874 2.2325 2.934 1.977 2.939 3.096 
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Table XI Factors for Tolerance Intervals (continued) 


Values of k for One-Sided Intervals 


Confidence 
Level 0.90 0.95 0.99 
Percent 
Coverage 0.90 0.95 0.99 0.90 0.95 0.99 0.90 0.95 0.99 

2 10.253 13.090 18.500 20.581 26.260 37.094 103.029 131.426 185.617 

3 4.258 5.311 7.340 6.155 7.656 10.553 13.995 17.370 23.896 

4 3.188 3.957 5.438 4.162 5.144 7.042 7.380 9.083 12.387 

5 2.742 3.400 4.666 3.407 4.203 5.741 5.362 6.578 8.939 

6 2.494 3.092 4.243 3.006 3.708 5.062 4.411 5.406 7.335 

7 2.333 2.894 3.972 2:159 3.399 4.642 3.859 4.128 6.412 

8 2.219 2.754 3.783 2.582 3.187 4.354 3.497 4.285 5.812 

9 2.133 2.650 3.641 2.454 3.031 4.143 3.240 3.912 5.389 
10 2.066 2.568 3:532 2.355 2.911 3.981 3.048 3.738 5.074 
11 2.011 2.503 3.443 2219 2.815 3.852 2.898 3.556 4.829 
12 1.966 2.448 3.371 2.210 2.736 3.747 2.117 3.410 4.633 
13 1.928 2.402 3.309 2.155 2.671 3.659 2.677 3.290 4.472 
14 1.895 2.363 3.251 2.109 2.614 3.585 2.593 3.189 4.337 
I3 1.867 2.329 3.212 2.068 2.566 3.520 2:921 3.102 4.222 
16 1.842 2.299 3.172 2.033 2.524 3.464 2.459 3.028 4.123 
17 1.819 2.212 3.137 2.002 2.486 3.414 2.405 2.963 4.037 
18 1.800 2.249 3.105 1.974 2.453 3.370 2:394 2.905 3.960 
19 1.782 2:221 3.077 1.949 2.423 3.331 2.314 2.854 3.892 
20 1.765 2.028 3.052 1.926 2.396 3.295 2.276 2.808 3.832 
21 1.750 2.190 3.028 1.905 2.371 3.263 2.241 2.766 3.777 
22 1.737 2.174 3.007 1.886 2.349 3.233 2.209 2.729 3.727 
23 1.724 2.159 2.987 1.869 2.328 3.206 2.180 2.694 3.681 
24 1.712 2.145 2.969 1.853 2.309 3.181 2.154 2.662 3.640 
25 1.702 2:132 2.952 1.838 2.292 3.158 2.129 2.633 3.601 
30 1.657 2.080 2.884 1.777 2.220 3.064 2.030 2515 3.447 
40 1.598 2.010 2.793 1.697 2.125 2.941 1.902 2.364 3.249 
50 1.559 1.965 2.735 1.646 2.065 2.862 1.821 2.269 3.125 
60 1.532 1.933 2.694 1.609 2.022 2.807 1.764 2.202 3.038 
70 1.511 1.909 2.662 1.581 1.990 2.765 1.722 2.153 2.974 
80 1.495 1.890 2.638 1.559 1.964 2.733 1.688 2.114 2.924 
90 1.481 1.874 2.618 1.542 1.944 2.706 1.661 2.082 2.883 


100 1.470 1.861 2.601 1.523 1.927 2.684 1.639 2.056 2.850 
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CHAPTER 2 


Section 2-1 


2-1. Let a, b denote a part above, 
below the specification. 
S — (aaa, aab, aba, abb, baa, 

bab, bba, bbb} 

2-7. Sis the sample space of 100 pos- 
sible two-digit integers. 

2-0. S= {0,1,2,...,}1n ppb. 

c = connect, b = busy, S = 

{c, bc, bbc, bbbc, bbbbc, . . .* 

(a) S = 10. 1,2. 3.5 

(b) S (c) {12, 13, 14, 15} 

(D 10.1.2, 211b (eS 

(£) 20,16 2.08.57 QD 

h © (4) {8,9, 10,...3 

Let d denote a distorted bit and let 

o denote a bit that is not distorted. 

dddd, dodd, oddd, oodd, 


dddo, dodo, oddo, oodo, 
ddod, dood, odod, oood, 


ddoo, dooo, odoo, oooo 
(b) No 
(c) {dddd, dodd, dddo, dodo, 
ddod, dood, ddoo, dooo} 


2-23. 


(a) S= 


Appendix C 
Answers to 


Selected 


Exercises 


(d) {oddd, oodd, oddo, oodo, 
odod, oood, odoo, oooo} 
(e) {dddd} 
(f) {dddd, dodd, dddo, oddd, 
ddod, oodd, ddoo} 
2? = 4096 
(a) A'N B = 10, B' = 10, 
AUB = 92 
(a) A’ = {x |x = 72.5} 
(b) B’ = {x|x < 52.5} 
(c) AN B= {x|52.5<x<72.5 
(d AUB = {x|x > 0} 
Let g denote a good board, m a 
board with minor defects, and j a 
board with major defects. 
(a) S = (gg, gm, gj, mg, mm, 
mj, jg, jm, jj) 
(b) 5 = igg, gm, gj, mg, mm, 
mj, jg, jm} 


2-25. 
2-27. 


2-29. 


2-31. 


Section 2-2 

2-35. (a) 0.4 (b) 0.8 (c) 0.6 
(d) 1 (e) 0.2 

(a) S = {1, 2,3, 4,5, 6, 7, 8} 
(b) 2/8 (c) 6/8 

(a) 0.7 (b) 0.8 


2-37. 


2-39. 


2-41. 
2-43. 
2-45. 


(a) 0.25 (b) 0.75 
5.7 X 10° 

(a) 0.86 (b) 0.79 
(d) 0.70 (e) 0.95 
(a) 0.30 (b) 0.77 
(d) 0.22 (e) 0.85 


(c) 0.14 
(f) 0.84 
(c) 0.70 
(f) 0.92 


2-47. 


Section 2-3 
2-49. (a) 0.7 (b) 0.4 (c) 0.1 
(d) 0.2 (e) 0.6 from part (b) 
(f) 0.8 
2-51. No 
(a) 350/370 (b) 362/370 
(c) 358/370 (d) 345/370 
(a) 13/130 (b) No 


Section 2-4 

2-57. (a) 86/100 (b) P(B) = 
79/100 (c) 70/79 

(d) 70/86 

(a) 345/357 (b) 5/13 

(a) 0.15 (b) 0.153 

(c) 0.72. (d) 0.733 

(e) 0.11 (f) 0.76 

(a) 15/40 (b) 14/39 

(c) 0.135 (d) 0.599 
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2-65. (a) 4/499 — 0.0080 
(b) (5/500)(4/499) = 0.000080 
(c) (495/500)(494/499) = 0.98 
(a) 0.813 (b) 0.632 


(c) 0.764 


2-67. 


Section 2-5 


2-71. 022 
2-73. 0.023 

2-75. 0.028 

2-77. (a) 0.0225 (b) 0.125 
2-79. (a) 0.20 (b) 0.20 


Section 2-6 


2-81. No 

2-83. No 

2-85. (a) No (b) 0.733 

2-87. (a) 0.59 (b) 0.328 (c) 0.41 
2-89. (a) 0.00003 (b) 0.00024 

(c) 0.00107 

0.9702 

(a) No (b) Yes 


2-91. 
2-93. 


Section 2-7 


2-95. 0.003 
2-97. (a) 0.615 (b) 0.618 
(c) 0.052 


2-99. (a) 0.9847 (b) 0.1184 


Supplemental 
2-101. The sample space S = {4, A'D}, 
A'D,, A'D}, A'D4, A'D4i 
(a) 0.19 (b) 0.15 (c) 0.99 
(d) 0.80 (e) 0.158 
2-105. (a) No (b) No (c) 40/240 
(d) 200/240 (e) 234/400 
(f) 1 
2-107. (a) 0.282 (b) 0.718 
2-109. 0.996 
2-111. (a) 0.0037 (b) 0.811 
2-113. (a) 0.0778 (b) 0.00108 
(c) 0.947 
2-115. (a) 0.9764 (b) 0.680 
2-117. (a) 0.207 (b) 0.625 (c) 0.6 
2-119. (a) 0.453 (b) 0.261 
(c) 0.881 (d) 0.547 
(e) 0.783 (f) 0.687 
2-121. 1.58 X 1077 


2-103. 


CHAPTER 3 

Section 3-1 

3-1. {0,1,2,..., 1000} 
3-3. {0,1,2,...,99999} 


APPENDIX C ANSWERS TO SELECTED EXERCISES 


Section 3-2 


3-13. f,(0) = 1/3, f,(1.5) = 1/3, 
fxQ) = 1/6, f(3) = 1/6 
(a) 1 (b) 7/8 (c) % 
(d) 7) 

(a) 9/25 (b) 4/25 

(c) 12/25 (d) 1 

P(X = 10 million) = 0.3, 
P(X = 5 million) = 0.6, 
P(X = 1 million) = 0.1 
P(X = 0) = 8 X 10°, 
P(X = 1) = 0.0012, 

P(X = 2) = 0.0576, 

P(X = 3) = 0.9412 

P(X = 15 million) = 0.6, 
P(X = 5 million) = 0.3, 
P(X = —0.5 million) = 0.1 
P(X = 0) = 0.00001, 
P(X = 1) = 0.00167, 

P(X = 2) = 0.07663, 

P(X = 3) = 0.92169 


3-15. 


3-21. 


3-25. 


Section 3-3 

3-27. (a) 7/8 (b) 1 
(d) 3/8 

F(x) = 0 for x < 1 million; 0.1 
for 1 million = x < 5 million; 
0.7 for 5 million = x < 10 mil- 
lion; 1 for 10 million = x 

F(x) = 0 for x « 0; 0.008 for 
0sxe«lIUlo4torlsxe«2 
0.488 for2 =x <3;1for3 =x 
(a 1 (b) 0.5 (c) 0.5 

(d) 0.5 

(a) 1 (b) 0.75 (c) 0.25 

(d) 0.25 (e)O (f) 0 


(c) Ty 


3-29. 


3-31. 


Section 3-4 

3399 EO = 2, VX) =2 
3-39. E(X) 2 0, (X) = 1.5 
3-41. E(X) = 6.1 million, 
V(X) = 7.89 million? 
EX) = 2.4, V(X) = 0.48 
x = 24 


3-43. 
3-45. 


Section 3-5 
3-47. E(X) = 2, V(X) = 0.667 
3-49. E(X) = 0.17, V(X) = 0.0002 


3-51. E(X) = 590.45, V(X) = 0.0825 
Section 3-6 
3-57. (a) 0.2461 (b) 0.0547 


(c) 0.0107 (d) 0.3223 


3-59. (a) 24 X 10? 
(c) 9.91 x 10 !5 
(d) 1.138 x 107 
F(x) = 0 for x < 0; 0.4219 for 
0=x< 1; 0.8438 for 1 =x « 2; 
0.9844 for 2 =x < 351for3 x 
(a) 0.3681 (b) 0.6323 

(c) 0.9198 (d) E(X) = 1, 
V(X) = 0.999 

(a) n = 50, p = 0.1 

(b) 0.1117 (c) 4.51 xX 10 ^ 
(a) 0.9961 (b) 0.9886 

(a) 0 (b) 0.2137 


(b) 0.99989 


3-61. 


3-63. 


3-65. 


3-67. 
3-69. 


Section 3-7 

3-71. (a) 0.5 (b) 0.0625 

(c) 0.0039 (d) 0.75 (e) 0.25 
(a) 0.0064 (b) 0.9984 

(c) 0.008 

(a) 0.0167 (b) 0.9224 (c) 50 
(a) 3.91 X 10. (b) 200 

(0) 2:56 x10" 

(a 5 (b) 5 

(a) 20 (b) 0.0436 
(c) 0.0459 (d) 0.0411 
(a) 3000 (b) 1431.18 


3-73. 


3-75. 
3-77. 


3-79. 
3-81. 
(e) 19 
3-83. 


Section 3-8 


3-87. (a) 0.4623 (b) 0.0002 
(c) 0.9866 (d) E(X) = 0.8, 
WX) = 0.539 
F(X) = 0 forx «0; 1/6 for 0 < 
x <1; 2/3 for 1 =x « 2; 29/30 
for 2 =x < 3; 1 for3 =x 
3-91. (a) 0.1201 (b) 0.8523 
3-93. (a) 0.7069 (b) 0.0607 

(c) 0.2811 


3-89. 


Section 3-9 


3-97. (a) 0.0183 (b) 0.2381 
(c) 0.1954 (d) 0.0298 
3-99. E(X) = 2.996, V(X) = 2.996 
3-101. (a) 0.0045 (b) 0.3679 
(c) 0.1353 (d) 0.2642 
3-103. (a) 4.54 X 1075 (b) 0.6321 
3-105. (a) 0.6065 (b) 8.9 x 10? 
(c) 0.00146 


Supplemental 


3-107. 0.3714 

3-109. (a) 0.0117 (b) 1.3333 

3-111. (a) 0.1755 (b) 0.0858 
(c) 0.2873 

3-113. 0.9810 


3-115. 
x 2 3 4 5 6 
f(x) 0.0025 0.01 0.03 0.065 0.13 
x 7 & 9 10 


f(x) 0.18 0.2225 0.2 0.16 

3-117. 299 

3-119. (a) 4.1 x 102 
(c) 0.9995 

3-121. (a) 0.6 (b) 0.8 (c) 0.7 
(d) 3.9 (e) 3.09 


(b) 10 


3-123. (a) 0.2408 (b) 0.4913 
3-125. (a) 0.3233 (b) 0.0916 
3-127. 0.0738 
3-129. (a) 0.3679 (b) 50.51 
(c) 0.9234 
CHAPTER 4 
Section 4-2 
4-1. (a) 0.3670 (b) 0.2858 (c) 0 
(d) 0.9817 (e) 0.0498 
4-3. (a) 0.4375 (b) 0.7969 
(c) 0.5625 (d) 0.7031 
(e) 0.5 
4-5. (a) 0.5 (b) 0.4375 (c) 0.125 
(d) O (e) 1 (f) 0.9655 


4-7. (a) 0.5 (b) 49.8 
4-9. (a) 0.10 (b) 2.5 


Section 4-3 

4-11. (a) 0.56 (b) 0.7 (c) O (d) 0 
4-13. 1—e 'forx^0 

4315. 1—e *? forx 4 

4-17. (a) 1.25x — 93.25 for 

74.6 « x « 75.4 (b) 0.5 

F(x) = 0 for x < 0; 0.25x? for 
Üsx2-l11orx $2 


4-21. 


Section 4-4 
4-23. E(X) = 2.6667, V(X) = 0.8889 
4-25. E(X) = 4.083, V(X) = 0.3291 
4-27. (a) E(X) = 109.39 pm, 

WX) = 33.19 um? 
(b) $54.70 
(a) E(X) = 5.1 mm, W(X) = 
0.01 mm? (b) 0.3679 


4-29. 


Section 4-5 

4-31. (a) E(X) = 3.5, WX) = 1.33, 
oy = 1.155 (b) 0.25 

(a) E(X) = 50, V(X) = 0.0208, 
Oy = 0.144 (b) F(x) = 2x 
—99.5 for 49.75 « x « 50.25 
(c) 0.7 


4-33. 


4-35. (a) E(X) = 1.85 min, V(X) = 
0.0408 min? (b) 0.7143 

(c) F(x) = (x — 1.5)/0.7 for 
13 == 2.2 

(b) 0.25 (c) 0.2140 

(d) E(X) = 0.2100 jum, 
V(X) = 0.00000833 um? 


4-37. 


Section 4-6 


4-39. (a) 0.90658 (b) 0.99865 
(c) 0.07353 (d) 0.98422 
(e) 0.95116 

(a) 128 (b) 0 (c) 1.28 
(d) —1.28 (e) 1.33 

(a) 0.93319 (b) 0.69146 
(c) 0.9545 (d) 0.00135 
(e) 0.15866 
(a) 0.93319 
(c) 0.38292 
(e) 0.54674 
(a) 0.99379 
(c) 5835 
(a) 0.0082 (b) 0.72109 
(c) 0.564 

(a) 0.00135 (b) 0.15866 
(c) 71.6 min 

(a) 0.02275 (b) 0.47725 
(c) 0.336 
(a) 0.15866 
(c) 99.7390 
(a) 0.15245 
(a) 0.06681 
(c) 0.000214 


4-41. 


4-43. 


4-45. (b) 0.89435 


(d) 0.80128 
4-47. (b) 0.13591 
4-49. 
4-51. 
4-53. 
4-55. (b) 90.0 


4-57. 
4-59. 


(b) 125.6 
(b) 0.86638 


Section 4-7 
4-61. (a) 0.075 (b) 0.85 
4-63. (a) 0.129 (b) 0.488 
4-65. 0.013 
4-67. 0.966 
4-71. (b) 330 (c) 0.0089 


Section 4-9 


4-73. (a) 0.3679 (b) 0.1353 

(c) 0.0498 (d) 29.96 

(a) 0.333 min (b) 0.333 min 
(c) 0.9986 


4-75. 


4-77. 


4-79. 
4-81. 
4-83. 
4-85. 


4-87. 


(a) 0.1353 
(c) 0.2031 
(a) 0.0498 
(a) 0.0025 
(a) 0.1353 
(a) 0.2212 
(c) 0.2212 
0.8488 


(b) 0.4866 
(d) 34.54 
(b) 0.8775 
(b) 0.6321 
(b) 0.2707 
(b) 0.2865 


(c) 5 
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4-93. (a) 5 (b) 0.1353 (c) No 


(d) 11.51 


Section 4-10 


4-97. (a) 0.1755 (b) 0.2643 

4-99. (a) 50,000 (b) 0.677 

4-101. (a) 500,000 (b) 223607 
(c) 0.0803 

4-103. (a) 0.1429 (b) 0.1847 

4-105. (a) 120 (b) 1.32934 
(c) 11.6317 


Section 4-11 
4-109. (a) 12,000 (b) 3.61 x 10'? 
4-111. (a) 0.5273 (b) 8862.3 
(c) 0.00166 
4-113. (a) 0.275 (b) 0.685 
4-115. (a) 443.11 (b) 53650.5 
(c) 0.2212 


Section 4-12 


4-117. (a) 0.9332 (b) 209522 
(c) E(X) = 133597, 

V(X) = 1.45 x 10” 
4-119. (a) 0.983 (b) 0.45 
4-121. 0 = 3.45, œ = 225 


Supplemental 


4-125. 025x* ^x + 1for2« x «4 
4-127. (a) 0.3935 (b) 0.3834 
(c) 23.03 

(a) 0.423 (b) 50 

(a) 0 = 3.43, œ? = 0.96 
(b) 0.946 

(a) 0.6915 (b) 0.683 
(c) 1.86 

(a) 0.0062 (b) 0.0124 
(c) 5.33 

0.0008 to 0.0032 

w = 11,398 

(a) 0.5633 (b) 737.5 
(a) 0.984 (b) 0.834 


4-129. 
4-133. 


4-135. 
4-137. 


4-139. 
4-141. 
4-143. 
4-145. 


CHAPTER 5 

Section 5-1 

5-l. f(xy)e0 2 fy) = 1 

5-3. E(X) = 1.8125, 
E(Y) = 2.875 

5-5. c= 1/36 

5-7. E(X) = 2.167, W(X) = 0.639, 
E(Y) = 2.167, V(Y) = 0.639 
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5-9. f(x,y) 20, S f(x,y) = 1 
5-11. E(X) = 1/8, E(Y) = 1/4 
5-13. X¥=0,Y=OandX¥+Y=4 
5-15. (b) /,(0) = 0.2338, (1) = 
0.4188, f,(2) = 0.2679, 
fx (3) = 0.0725, f. (4) = 
0.0070 
(c) E(X) = 1.2 
(d) fyjs(0) = 0.857, 
fy) = 0.143, 
(e) E(Y | X = 3) = 0.143 
Section 5-2 
5-17. (a) 0.5 (b) 0.35 (c) 0.5 
(d) 0.8 (e) 1.5 
5-19. P(X¥=1/Y=1,Z=2) =04, 
P(X =2/Y=1,Z=2)= 0.6 
5-21. (a) 0 (b) 0.072 (c) 0.736 
(d) 1 
5-23. (a) x20,vy20,z=0, 
xty+z=4 
(b) No 
5-25. (a) 0.1758 (b) 0.2198 
(c) E(X) = 1.067, 
V(X) = 0.6146 
5-27. (a) Yes (b) 0.1944 
(c) 0.0001 
5-29. (a) 0.7347 (b) 0 
(c) P(X = 0|Y = 2) = 0.0204, 
P(X = 1|Y = 2) = 0.2449, 
P(X = 2| Y = 2) = 0.7347 
(c) E(X| Y = 2) = 1.7142 
5-31. (a) binomial p = 0.03, n = 3 
E(X) = 0.03, V(X) = 0.0297 
(b) P(X =0| Y= 2) = 0.98958, 
P(X = 1 | Y = 2) = 0.01042 
(c) E(X | Y = 2) = 0.01042, 
WX |Y = 2) = 0.01031 
Section 5-3 
5-35. (a) 0.4444 (b) 0.6944 
(c) 0.5833 (d) 0.3733 (e) 2 
(f) 0 
5-37. 1/24 
5-39. (a) (2x + 1)/12 for0 <x «3 
(b) (y + D/6for1 «y «3 
(c) 2.111 
(d) 0.4167 
(e) (2-F 30/6 for 0 « x «2 
5-43. 10 
5-45. (a) 10(e ? — e ?y/3 for0 <x 


(b) 3.157€ 7? forü y « 1 
(c) 0.089 (d) 2e "^ for2 <x 


5-49. 
5-51. 


2/15 

(a) œ + 1)/7.5 for0<x< T, 
2/7.5forl «x «4 

(b) 0.5 forü0 c y «2 

(c) 1 (d) 0.25 

(a) 0.0439, 0.0019 

(b) 0.065 


5-53. 


Section 5-4 

5-55. (a) 0.25 (b) 0.0625 (c) 1 

(d) 1 (e) 2/3 

(a) 2xfor0 € x « 1 

(b) 0.25 

5-61. 6 

5-63. (a) 3x — 1)? for0c x «I 

(b) 6(1 — x — y) for 0 <x, 
O<yandx+y< 1 

(c) 1 forx = 0 

(d) 4(1 — 2x) for x < 0.5 

(a) 0.032 (b) 0.0267 


5-57. 


5-65. 


Section 5-5 


5-67. Oxy = 0.703, pyy = 0.885 
5-69. c = 1/36, Oy = —1/36, 
Pyy = —0.0435 

Syy = —1/3, pyy = —1/2 

c = 9.5, Oyy = 1.852, 

pxy = 0.928 

X and Y are independent and 


Oxy = Pyr = 0 


5-71. 
5-73. 


5-75. 


Section 5-6 
5-81. 0.827 


Section 5-7 

5-87. (a) 30 (b) 97 (c) 0.5 
(d) 0.846 

(a) E(T) = 4, 0, = 0.1414 
(b) 0.017 

(a) E(D) = 1/8, cp = 0.140 
(b) 0.187 (c) 0.187 

(a) 0.05 (b) 0.023 

(c) 12.129 

(d) 0.388 (e) 136 

(a) 0.023 (b) 4558 


5-89. 
5-91. 


5-93. 


5-95. 


Supplemental 
5-97. (a) 3/8 (b) 3/4 (c) 3/4 
(d) 3/8 
(e) E(X) = 7/8, VX) = 
39/64, E(Y) = 7/8, 
VY) = 39/64 


5-99. (a) 0.0631 (b) 0.122 
(c) E(X) = 2, WX) = 1.8 
(d) F199) = 060/7, 
fxp(1) = 0.333 
(e) 1/3 
5-103. (b) 1/3 (c) No 
5-105. (a) 0.0093 (b) 0.5787 
(c) 0.75 (d) 0.2199 
(e) 2.25 (T) 1.3333 
5-107. (a) 1/2 (b) 1/4 (c) 1/ for 
xX + y <1 
(d) 2V1 — x^/m for -1«x«1 
5-109. 3/4 
5-111. (a) 0.085 (b) Bin(10, 0.3) 
(c) 3 
5-113. (a) 0.499 (b) 0.5 
5-115. (a) 0.057 (b) 0.057 
5-119. (a) E(T) = 1.5, V(T) = 0.078 


(b) =0 
(c) E(P) = 4, V(P) = 0.568 
EN = 5,0 = V3 
CHAPTER 6 
Section 6-1 
6-1. x = 74.0044, s = 0.00473 
6-3. x = 7068.1, s = 226.5 
6-5. x = 43.975, s = 12.294 
6-7. w= 544 
6-11. (a) x = 7.184 
(b) s = 0.02066 
6-13. (a) x = 65.85, 5 = 12.16 
(c) x = 66.86, s = 10.74 
Section 6-3 
6-19. 
Variable N Median QI Q3 
cycles 70 1436.5 1097.8 1735.0 
6-21. 
Variable N Median QI Q3 
yield 90 89.250 86.100 93.125 
6-25. x = 260.7, s = 13.03, and 
X = 261.15 
6-27. (b) x = 89,5 = 2.8, and 
X —90 (c) 22/40 
Section 6-5 
6-43. (a) x — 40 (b) s = 0.867, 
s — 0.931 
6-45. (a) x = 952.44, $ = 9.55, 
s — 3.09 


(b) X = 953, largest value can 
increase by any amount 


6-47. 


(a) X = 48.125, ¥ = 49 
(b) s? = 7.246, s = 2.692 


Supplemental 


6-73. 


6-75. 


6-79. 


(a) x = 65.083 

(b) s? = 1.86869, s = 1.367 

(a) Sample 1: range — 4; 
Sample 2: range = 4 

(b) Sample 1: s = 1.604; 
Sample 2: s — 1.852 

(b) x = 9.325, s = 4.48586 


CHAPTER 7? 
Section 7-2 


7-1. 
7-3. 
7-5. 
7-7. 


7-11. 


7-13. 


7-15. 
7-17. 


Estimator 1 
Estimator 2 
29 
Estimator 3 is most efficient; 
estimator 2 is the best “unbiased” 
estimator. 
(a) 75.427 (b) 75.1 
(c) o? = 2.214, 0 = 1.488 
(d) 0.292 (e) 0.0385 
l 

(a) Yes (b) v5 o 

GT. o 


(b) se = m + i 
(b) 


jn» 


any 


am (D 010 


CO) a=- 


Section 7-3 


7-31. 


(a) 423.33, 82.4464 


Section 7-5 


7-33. 
7-35. 
7-37. 
7-39. 
7-41. 
7-43. 


7-45. 


0.8385 
0.43055 
0.3472 

12 

0.2313 

(a) 0.5885 
(b) 0.1759 
0.983 


Supplemental 


7-49. 
7-51. 
7-53. 
7-55. 


X, — X% ~ N(—5, 0.2233) 
0.8664 

1 (approximately) 

0 (approximately) 


on; (1 — alan, 


CHAPTER 8 


Section 8-2 


8-1. 97.9396, 99.36%, and 96.78% 

8-3. (a) 1.29 (b) 1.65 (c) 2.33 

8-5. 3 

8-7. (a) Longer 
(c) Yes 

8-9. (89.471, 91.489) 

8-11. (a) (74.0353, 74.0367) 

(b) (74.0355, ©) 

(a) (3232.11, 3267.89) 

(b) (3226.5, 3273.5) 

8-15. 267 

8-17. 4 


(b) No 


8-13. 


Section 8-3 


8-19. (oss = 2.131, fons io = 1.812, 
to.1020 = 1.325, f99o555 = 2.787, 
to.00130 = 3.385 

(a) fo05,14 = 1.761 

(b) foo119 = 2.359 

(€) to.001,24 = 3.467 

(1.108, oo) 

(a) Yes (b) (16.455, 17.505) 
(a) Yes (b) (8.216, 8.244) 
(4.023, cc) 

(1.093, 1.106) 


8-21. 


8-23. 
8-25. 
8-27. 
8-29. 
8-31. 


Section 8-4 


8-33. X 0.0510 = 18.31, 
X 0.025,15 E 


X 00142 = 20:22. 
X 0.00525 = 46.93, 
X 0.9520 = 1055, 
X oss = 7.01; 


X 0.995,16 — 2444 
0.00003075 < c? 


7,975,727.09 < o? 
0.31 < o < 0.46 
3.8 s< 


8-35. 
8-37. 
8-39. 
8-41. 


Section 8-5 


8-43. 622 
8-45. 666 
8-47. 5759 


Section 8-6 


8-49. 52131.1 < X,,, S 68148.3 
8-51. 263.5 < X, +1 S 370.9 
8-53. 2193.5 < X,,, = 2326.5 
8-55. 3.19 < X,,, 4.19 

8-57. 2.565 X,+; 322 

8-59. 228.1 X,,, S 2352 
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Section 8-7 


8-61. (0.408, 2.092) 
8-63. (15.14, 18.82) 
8-65. (8.16, 8.30) 
8-67. (3.91, ~) 

8-69. (1.06, 1.14) 


Supplemental 

8-71. (a) 0.1 S P-value = 0.25 
(b) 0.05 < P-value = 0.1 
(c) P-value = 0.00539 

(a) 40 (b) 23 

2178.51 = X,4, = 2341.49 
(a) Yes 

(b) 0.618 = u = 0.630 

(c) 0.588 = X,,, = 0.660 
(d) (0.583, 0.665) 

(a) Yes 

(b) 2.270 S p S 4.260 

(c) —1.297 = 4,43 = 7.827 
(d) (—3.113, 10.363) 

(a) 0.0021 = p = 0.0088 
(b) Yes 

(a) 0.210 < p = 0.274 
(b) 0.204 < p x 0.280 


8-75. 
8-77. 
8-79. 


8-81. 


CHAPTER 9 
Section 9-1 


9-1. (a) Yes (b) No (c) No 
(d) No (e) No 

9-3. (a) 0 (b) 0.02275 

9-5. 11.5875 

9-7. (a) 0.09296 (b) 0.04648 
(c) 0.00005 

9-9. (a) Reject Ho 

9-11. 

9-13. 

9-17. 

9-19. 


(b) 0.00889 
(a) 182.9 (b) 0.00776 
(a) 0.0164 (b) 0.21186 
(a) 0.08535 (b) 0 

(a) 0.29372 (b) 0.25721 


Section 9-2 

9-21. (a) Zz) = 0.36, do not reject Ho 
(b) P-value = 0.71884 (c) 5 
(d) 0.68054 (e) (87.85, 93.11) 
(a) Zo = —1.69, do not reject Ho 
(b) 0.091028 (c) 0 (d) 1 

(a) zy; = —14.43, reject Ho 

(b) 0 (c) (3232.11, 3267.89) 
(a) zy = 1.55, do not reject Ho 
(b) 0.02938 (c) 1 
(d) (99.888, oc) 

(a) Zo = 1.77, reject Ho 
(c) 35 (d) (4.003, o») 


9-23. 
9-25. 


9-27. 


9-29. (b) 1 
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Section 9-3 

9-31. (a) t = —3.48, reject Ho, 
P-value — 0.002 

(b) 1 (c) 35 

(d) (98.065, 98.463) 

(e) Yes 

(a) ty — —1.46, do not reject 
Ho, P-value = 0.156 

(b) Yes (c) 0.85 (d) 51 

(e) (129.406, 130.100) 

fy = 4.47, reject Ho, 

P-value « 0.0005 


9-33. 


9-35. 


9-37. (a) ty = —5.35, do not reject Ho 
(b) P-value > 0.4 
(c) 0.75 (d) 38 

9-39. (a) ty = 2.806, reject Ho 
(b) P-value = 0.004 (c) 1 
(d) 3 

9-41. (a) ty = 3.018, reject Ho 
(b) P-value — 0.0038 (c) 0.8 
(d) 38 

Section 9-4 

9-43. (a) x5 = 8.96, do not reject Ho 
(b) 0.5 « P-value « 0.9 
(c) 50 

9-45. (a) x5 = 4984.83, reject Hy 
(b) P-value « 0.005 

9-47. (a) xs = 109.52, reject Hy 
(b) 0.31 < o « 0.46 

9-49, 30 

Section 9-5 

9-51. 0.639, 118 

9-53. (a) zy = —0.53, do not reject Ho 
(b) P-value — 0.29806 

9-55. (a) Zo = 0.452, do not reject Ho 
(b) P-value = 0.67364 

9-57. (a) a = 0.0853 (b) B&0 

Section 9-7 

9-59. (a) x6 = 7.2, do not reject Ho 
(b) 0.05 « P-value « 0.10 

9-61. (a) xs = 1.72, do not reject Hy 
(b) 0.5 « P-value « 0.9 

9-63. (a) xs = 1.053, do not reject Hy 
(b) 0.1 « P-value « 0.5 

Section 9-8 

9-65. (a) xs = 11.65, do not reject Ho 
(b) 0.05 « P-value « 0.10 

9-67. (a) xg = 25.55, reject Hy 
(b) P-value « 0.005 

9-69. (a) xs = 10.71, do not reject Ho 


(b) 0.05 « P-value « 0.10 
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Supplemental 


9-71. (a) p(1 — p)/50 

(b) p(1 — p)/80 

(c) p(1 — p)/100 

(a) B = 0.564 

(b) B = 0.161 

(c) B = 0.116 

(a) 0.61026 (b) 0.995 

(c) 0.9988 

(a) x6 = 5.546, reject Hy 

(b) 0.01 « P-value « 0.025 
(a) xo = 1.75, do not reject Hj 
(a) x, = 17.929, reject Ho 

(b) P-value — 0.0123 

(a) xo = 63.36, reject Ho 

(a) zy; = —7.32, reject Ho 

(b) P-value = 0 

(c) xó = 12.0, reject Hy 

(b) to = 1.608, do not reject Ho 
(c) 0.1 < P-value < 0.2 


9-73. 


9-75. 
9-79, 


9-81. 
9-83. 


9-85. 
9-87. 


9-89. 


CHAPTER 10 


Section 10-2 


10-1. (a) Yes, cannot reject Ho 
(b) P-value = 0.3222 
(c) 0.9967 
(d) (—0.0098, 0.00298) 
(e) 9 
10-3. 1, Yes 
10-5. (a) (0.0987, 0.2813) 
(b) (0.0812, 0.299) 
(c) (~œ, 0.2813) 
(a) (—3.684, —2.116) 
(b) z = —7.254 reject Ho 
(c) P-value = 0 
10-9. 11 
10-11. (a) (—5.83, —0.57) (b) Yes 
10-13. z) = —2.385, reject Ho 
10-15. Yes 


10-7. 


Section 10-3 


10-17. (a) t, = 0.230, do not reject Ho 
(b) P-value > 0.80 
(c) (—0.394, 0.494) 
10-19. (a) 4 = —3.11, reject Ho 
(b) (—5.688, —0.3122) 
10-21. (a) t = —2.83, reject Ho 
(b) 0.010 « P-value « 0.020 
(c) (0.111, 0.749) 
10-23. (17.235, 44.765) 
10-25. t = —5.499, P-value < 0.0010 
10-27. (a) ty = 3.03, reject Ho 
(b) 0.005 « P-value « 0.010 
(c) ty = 3.03, reject Ho 


10-29. (—14.34, 21.94) 

10-31. (b) t = 2.558, reject Ho 
(c) P-value = 0.020 
(d) 0.05 (e)n= 5l 
(f) (1.86, 18.94) 


Section 10-4 


10-33. (0.1694, 0.3778) 

10-35. t = 0.357, cannot reject Ho 
10-37. (—727.46, 2464.21) 

10-39. f = 5.465, reject Ho 
10-41. % = 8.387, reject Ho 
10-43. t = 3.45, reject Ho 


Section 10-5 
10-45. (a) 1.59 (b) 2.28 (c) 2.64 
(d) 0.529 (e) 0.524 
(f) 0.311 
10-47. f, = 0.657, cannot reject Ho 
10-49. No 
10-51. (a) (0.08775, 3.594) 
(b) (0.0585, 5.3) 
(c) (0.137, œ) 
10-53. f, = 0.297, cannot reject Ho 
10-55. f, = 0.2575, cannot reject Ho 
10-57. (0.3369, 2.640) 
10-59. f = 0.640, cannot reject Ho 


Section 10-6 


10-61. z, = 1.49, cannot reject Ho 
10-63. (a) 0.81859 (b) 383 
10-65. (a) zy = 3.42, reject Ho, 

P-value — 0.00062 
10-67. (0.0434, 0.1616) 
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10-69. (1.40, 8.36) 
10-71. (a) ty = 2.554, reject Ho 
(b) fy = 2.554, cannot reject Ho 
(c) ty = —1.986, cannot reject Ho 
(d) t = —1.986, cannot 
reject Ho 
10-73. (a) zy = 6.55, reject Ho 
(b) zy = 6.55, reject Ho 
10-75. (a) (—0.0335, 0.0329) 
(b) (—0.0282, 0.0276) 
(c) (—0.0238, 0.0232), 
(—0.0203, 0.0200) 
10-79. 60 
10-81. 26 
10-83. (a) No (b) Yes 
(d) (18.124, 294.35) 
10-85. (b) t = —6.06, reject Ho 
10-87. (b) t, = —0.512, cannot 
reject Họ (c) 16 


10-89. (b) 4, = —2.74, reject Hy 
(c) 0.8 (d) 26 


CHAPTER 11 


Section 11-2 
11-1. (a) By = 48.013, B, = —2.330 
(b) 37.99 (c) 39.39 (d) 6.71 
11-3. (a) By = 0.4631476, 
B, — 0.0074902 
(b) B, = 0.00749 
11-5. (a) By = 13.3202, 
B, = 3.32437, 
ô? = 8.76775 
(b) 38.253 (c) —2.0273 
11-7. (a) By = 33.5348, 
B, — —0.0353971, 
ó? — 13.392 
(b) 28.226 (c) 1.50048 
11-9. (b) By = —9.8131, 


B, = 0.171484, 
ô? = 1.9818 
(c) 4.76301 
11-11. (b) By = 0.470467, 
B, = 20.5673, 
ô? = 13.81 


(c) 21.038 (d) 1.6629 
11-17. ĝo = 0, 8; = 21.031461 


Section 11-5 
11-19. (a) fy = 73.95, P-value = 
0.000001, reject Ho 
(b) se(B,) = 0.0004839, 
se(Bo) = 0.04091 
11-21. (a) t = 8.518, reject Ho 
(b) fo = 72.5563, reject Ho 
(c) se(B,) = 0.3902, 
se(Bo) = 2.5717 
(d) ty = 5.2774, reject Ho 
11-23. (a) fo = 4.53158, do not reject 
Ho, P-value = 0.04734 
(b) se(B,) = 0.0166281, 
se(Bo) = 2.61396 
(c) ty = 0.87803, P-value = 
0.804251, do not reject Ho 
(d) ty = 12.8291, P-value = 0, 
reject Ho 
11-25. (a) fo = 44.6567, reject Ho 
P-value = 0.000003 
(b) se(B,) = 0.0256613, 
se(Bo) = 2.13526 
(c) ty = —4.59573, reject Ho, 
P-value = 0.00022 
11-27. (a) fo = 155, reject Ho 
P-value < 0.00001 


(b) se(B,) = 0.0256613, 
se(Bo) = 2.13526 

(c) t = —4.59573, reject Ho 
P-value — 0.00022 

(d) ty = 57.8957, reject Ho 
P-value « 0.00001 

(e) ty = 2.7651, reject Ho 
P-value = 0.0064 


Section 11-6 and Section 11-7 


11-31. (a) (—2.9175, —1.7421) 

(b) (46.7145, 49.3114) 

(c) (41.3293, 43.0477) 

(d) (38.4289, 46.1281) 
11-33. (a) (—0.00961, —0.00444) 

(b) (16.2448, 27.3318) 

(c) (7.91433, 10.37167) 

(d) (4.07214, 14.21386) 
11-35. (a) (9.10130, 9.31543) 

(b) (—11.6219, — 1.04911) 

(c) (498.72024, 501.52776) 

(d) (495.57344, 504.67456) 
11-37. (a) (0.03689, 0.010183) 

(b) (—47.0877, 14.0691) 

(c) (44.0897, 49,1185) 

(d) (37.8298, 55.3784) 
11-39. (a) (201.552, 226.590) 

(b) (—4.67015, —2.346960) 

(c) (111.8339, 145.7941) 
11-41. (a) (—43.1964, —30.7272) 

(b) (2530.09, 2720.68) 

(c) (1823.7833, 1948.5247) 

(d) (1668.9013, 2103.4067) 


Section 11-8 

11-43. (d) R? = 76.73% 

11-45. (a) R? = 20.112196 
(c) Yes 

11-47. (a) R? = 71.27% 

11-49. (a) R? = 85.22% 


Section 11-10 


11-55. (a) By = —0.0280411, 
B, = 0.990987 
(b) fo = 79.838, reject Ho 
(c) 0.903 (d) t = 8.9345, 
reject Ho 
(e) zy = 3.879, reject Ho 
(f) (0.7677, 0.9615) 
11-57. (a) r = —0.738027 
(b) tj = —5.577, reject Ho, 
P-value = 0.00000738 
(c) (—0.871, —0.504) 
(d) zy; = —0.394, do not reject 
Ho, P-value = 0.6936 
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11-59. (a) t = 5.47, reject Ho, 
P-value = 0 
(b) (0.3358, 0.8007) (c) Yes 
11-61. (a) r = 0.933203 
(b) t = 10.06, reject Ho 
(c) By = 0.72538, 
B, — 0.498081, 
fo = 101.16, reject Ho 
(d) t) = 0.468345, do not 
reject Ho 


Supplemental 
11-65. (a) By = 93.34, B, = 15.64 
(b) fo = 12.872, reject Ho 
(c) (7.961, 23.322) 
(d) (74.758, 111.923) 
(e) (126.18, 138.70) 
11-67. (b) By = —0.8819, 
B, = 0.00385 
(c) fo = 122.03, reject Ho 
(d) No. (e) By = 0.5967, 
à* = 0.00097 
11-69. $ = 0.7916x 
11-71. (b) By = —193, 8, = 15.296 
(c) (—4.912, 35.504) 
11-75. (b) By = 66, 8, = 0.930 
(c) fo = 19.79, reject Ho 
R= 71.2% 
(d) tj = —0.1953, cannot 
reject Ho 


CHAPTER 12 


Section 12-1 
171.054 
3.713 
— 1.126 
(c) 189.481 
12-3. (b) 2 
12-5. (a) ĵ= 33.4491 — 0.05435x, + 
1.07822x; 
(b) 8.03 (c) 19.30 
12-7. (a) $ = 383.80 — 3.6381», — 
0.1119x; 
(b) 6? = 153.0, se(Bo) = 36.22, 
se(B,) = 0.5665, 
se(B5) = 0.04338 
(c) 180.95 
(d) $ = 484.0 — 7.656 
x, — 0.222 
xj — 0.0041 x12 
(e) 6? = 147.0, se(Bo) = 101.3, 
) = 3.846, 
) = 0.113, 
se(B1,) = 0.0039 
(f) —31.3 


12-1. (b B = 


se(B, 
se(B» 
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12-9. (a) ĵ = 47.174 — 9.7352x, 
+ 0.4283x; + 18.2375x; 
(b) 12. (c) se(Bo) = 49.5815, 
se(B,) = 3.6916, 
se(B,) = 0.2239, 
se(B4) = 1.312. (d) 91.43 
12-11. p = —8.0119 + 0.494x, 
+ 0.0018x; + 0.0023x, 
+ 0.0383x4 — 0.2068x; 
— 0.0128x5 + 0.030x; 
+ 0.0407xg — 0.2083x5 


ô? = 2.32, se(Bo) = 16.18, 
se(B,) = 0.0481, 

se(ĝ2) = 0.0064, 

se(B3) = 0.0209, 

se(B4) = 0.0515, 

se(B;) = 0.2611, 

se(Bc) = 0.0266, 

se(B7) = 0.038, 

se(Bg) = 0.1483, 

se(Bo) = 0.1110 


Section 12-2 
12-13. (a) 260.09, reject Ho 
(b) reject Ho both significant 
12-15. (a) f = 25.7465, reject Ho 
P-value « 0.000001 
(b) Reject Ho, all coefficients 
are significant 
12-17. (a) fp = 53.3162, reject Ho 
(b) Only Q, is significant 
12-19. (a) fo = 10.08, P-value = 0.005 
(b) Only B; is significant 
12-21. (a) f = 67.92, reject Ho 
(b) fo = 1.07, do not reject Ho 
(c) 147.0 
12-23. (a) fy = 850.55, reject Ho 
(b) Regression coefficients for 
x, and x; are significant 
12-25. (a) fy = 101.79, reject Ho 
(b) Only regression coefficient 
for “PTS” is significant 
(c) y= —5.531 + 0.497xprs — 
0.004xppc, fo = 510.12 
reject Ho only regressor 
“PTS” is significant 


Section 12-3 and Section 12-4 


12-27. (a) (—0.00657, —0.00122) 
(b) 0.497648 (c) (7.16, 9.22) 
12-29. (a) 0.0972 = B, = 1.4174, 
— 1.9646 = B, = 17.0026, 
— 1.7953 = B; = 6.7613, 
— 1.7941 = B, = 0.8319 
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(b) (272.44, 308.44) 
(c) (257.25, 323.64) 
12-31. (a) —0.595 = p, x 0.535, 
0.229 = B, = 0.812, 
—0.216 = B, = 0.013, 
—7298 x B, x 2.977 
(b) (7.982, 10.009) 
(c) (6.8481, 11.143) 
12-33. (a) —0.00003 = B, = 
0.00012, 0.00203 = B... 
< 0.00288, —0.02306 
E Duodioó = 0.05976, 
0.00501 = Borime Z 
0.01056, —0.01969 = 
Bpigpe 0.01342 
(b) (0.0206, 0.0234) 
12-35. (a) (0.3882, 0.5998) 
(b) y = —5.767703 + 
0.496501 xp. 
(c) (0.4648, 0.5282) 


Section 12-5 


12-37. (a) 0.82897 (d) No 
12-39. (a) 0.985 (b) 0.99 
12-41. (b) 0.9937 
12-43. (a) 0.9582 (c) 32 
12-45. (a) 0.12 

(b) 17 and 18 


Section 12-6 


12-47. (a) y = —1.633 + 1.232x — 
1.495x° 
(b) fo = 1858613, reject Ho 
(c) to = —601.64, reject Ho 
12-49. (a) 802.943 
(b) py = —26204.14 + 189.09x 
= 0.33147 
12-51. (a) » = —1.769 + 0.421x, + 
022215 — 0.128x3 — 
0.02x4x, + 0.009x,x, + 
0.003x,x, — 0.019x7 — 
0.007x3 + 0.001x3 
(b) fo = 19.628, reject Ho 
(d) fo = 1.612, do not reject Ho 
12-55. (a) Min. MS: x1, X3, X4, Xs, X7, 
Xg, X19, MS; = 6.58, 
C, = 5.8, Min. C,: xs, Xs, X10 
Ci 795.02, MSS = 197 
(b) » = 34.434 — 0.048x,, 
MSp = 5051.6, 555 
(c) Same as part (b) 
(d) py = 0.341 + 2.862x5 + 
0.246xa — 0.010x1o, 
MS; = 7.97, C, = 5.02 


12-57. (a) y = 4.656 + 0.511x4 — 
0.124x, 
(b) Same as part (a) 
(c) Same as part (a) 
(d) All models are the same 
12-59. (a) y = —0.304 + 0.083x, — 
0.03 1x4 + 0.00432, 
C, = 4.04, MSg = 0.004 
(b) » = —0.256 + 0.078x, + 
0.022x, — 0.042x3 + 
0.0008x3, 
C, = 4.66, MSg = 0.004 
12-61) (a) Min. C,: x), xo, C, = —1.67 
(b) Min. MS: x1, X7, Xo, 
M5, = 16/50. =0.7 
(c) Max. adjusted R? : x,, x7, Xo, 
Adj. R? — 0.98448 


Supplemental 


12-65. (a) f = 1323.62, reject Ho 
P-value « 0.00001 
(b) Only regressor x, 1s 
significant Ho 
12-67. (a) p = —0.908 + 5.482x, + 
1.126x5 — 3.920x; — 1.143x; 
(b) fo = 109.02, reject Ho, all 
regressors are significant 
12-69. (a) y = —3982.1 + 1.0964x, + 
0.1843x, + 3.7456x, + 
0.8343x; — 16.278 1x6, 
MS;(p) = 694.93, C, = 5.62 
(b) py = —4280.2 + 1.442x, + 
0.209x, + 0.6467x5 — 
17.5103x,, MS;(p) = 
714.20, C, = 5.57 
(c) Same as model b 
12-71. VIF(B3) = 51.86, 
VIF(B4) = 9.11 
VIF(Bs) = 28.99 
12-73. (a) fo = 18.28, reject Ho 
(b) fo = 2, do not reject Ho 


CHAPTER 13 


Section 13-2 
13-1. (a) Reject Ho 
(b) Model is satisfactory 
13-3. (a) Reject Hj (b) P-value = 0 
13-5. (a) Reject Họ (c) (140.71, 
149.29), (7.36, 24.14) 
13-7. (a) Do not reject Ho 
(b) P-value = 0.214 
13-9. (a) Reject Ho 
(b) P-value — 0.002 


(d) (69.17, 81.81) 
(e) (8.42, 26.33) 
13-19. 5 


Section 13-3 


13-21. (a) Reject Họ (b) 0.01412 
(c) 0.0148 

13-23. (a) Do not reject Họ (b) 0 
(c) 24 


Section 13-4 

13-25. (a) Reject Ho 
13-27. (a) Do not reject Ho 
13-29. (a) Do not reject Ho 


Supplemental 
13-31. (a) Reject Ho 

(c) (132.97, 147.83) 
13-35. (a) Reject Ho 

(b) P-value = 0.007 
13-37. (a) Reject Ho 
13-39. (a) 0.85 (b) 5 


CHAPTER 14 


Section 14-4 

14-1. (a) Reject H for both main 
effects and the interaction 

(a) Reject Ho for main effects 

(—3.40, 7.64) 

(a) Reject Ho for both main 
effects and the interaction 


14-3. 
14-7. 
14-9. 


Section 14-5 


14-11. (a) All these main effects are 
significant and the hardwood 
concentration-freeness inter- 
action 1s significant at 
a — 0.05. The P-value for 
the hardwood-cooking time 
interaction is 0.075, it is 
possibly an important effect 
as well. 


Section 14-7 


14-13. (a) Reject Hp for factors B, C, 
and AC 
(b) » = 413.125 + 9.125x, + 
45.12x, + 35.87x3 — 
59.62x 1x3 
14-15. p = 175.25 — 4.12x, + 
5.19x» + 0.19x4 + 4.62xqx4 
14-17. (b) Reject Ho for factor B 
14-19. (a) Factors A, B, C, and AB 


14-21. (b) Factors A, B, and AB 
(c) » = 400 + 40.124x, — 
32.75% 26.623X4X» 


Section 14-8 


14-23. Block 1: (1) ab ac bc 
Block 2: a b c abc 
There are no significant factors 
14-25. Block 1: (1) ac bd abcd 
Block 2: a c abd bcd 
Block 3: b abc d acd 
Block 4: ab bc ad cd 
Factor A is significant 
14-27. Block 1: (1) ab de acd bcd ace 
bce abde 
Block 2: a b cd ce ade bde abce 
abcd 
Block 3: d e bc bd abd abe acde 
bcde 
Block 4: c ad ae bd be abc cde 
abcde 
14-29. (a) Factors A, C, AB, and AC are 
significant 


Section 14-9 


14-31. (a) Factors A, B, and D are 
significant 
(c) Factors A, B, D, AB, and 
AD are significant 
14-33. (b) Design Generators: D — AB, 
E — AC; Defining Relation: / — 
ABD = ACE = BCDE; Aliases: 
A = BD = CD = ABCDE, B = 
AD = CDE = ABCE; C = 
AE = BDE = ABCD, D = 
AB = BCE = ACDE, E = 
AC = BCD = ABDE (c) A = 
—1.525, B = —5.175, C = 
2.275, D = —0.675, E = 2.275 
14-35. 2^ ! replicated twice 
14-37. Factors A, B, and D are significant 
14-39. Design Generators: D — AB, 
E = AC, F = BC; Defining 
Relations: / = ABD = ACE = 
BCF = BCDE = ACDF = 
ABEF = DEF; Aliases: A = 
BD = CE, B = AD = CF, C = 
AE = BF, D = AB = EF, E = 
AC = DF, F = BC = DE, 
AB = EF, AF = BE = CD 


Supplemental 


14-41. The main effect of pH and 
the interaction of pH and 
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Catalyst Concentration are sig- 
nificant 
14-43. The salts, application levels, and 
the interaction between salts and 
application levels are significant 
14-45. There are no significant factors 
14-47. (a) The factors V, P, G, and PG 
are significant. Effects 
P = —10.75, V = 15.75, 
G = —25.00, PG = 19.25 
(b) » = 102.75 + 7.88x, — 
2.925 DAE 
9.62x5x4 
14-49. The factors V and G are signifi- 
cant ata = 0.10. » = 100.63 — 
14.12%; — 15.13x4 
14-51. Design Generators: D = +AB, 
E = +AC; Defining Relations: / 
= ABD = ACE = BCDE; 
Aliases: A = BD = CE, B= 
AD = CDE, C = AE = BDE, 
D = AB = BCE, E=AC= 
BCD, BC = DE, BE = CD 
(a) E = ABCD (b) Factors A, 
B, C, E, and interaction BE are 
significant (c) Factor A is sig- 
nificant in affecting variability 


14-53. 


CHAPTER 15 


Section 15-2 


15-1. Do not reject Ho, P-value = 
0.109 
Reject Hp, P-value = 0.0002 
(a) Do not reject Ho 
(b) zy = 0.577, P-value = 0.281 
Zo = —1.34, do not reject Ho, 
P-value = 0.1802 
15-9. Do not reject Ho 
15-11. z) = 2.24, reject Ho 
15-13. Reject Ho 
15-15. z, = 2.84, reject Ho 
15-17. (a) 0.025 (b) 0.115 

(c) 0.011 (d) 0.1587 
15-19. P-value = 0.04 


15-3. 
15-5. 


15-7. 


Section 15-3 


15-21. w = 71 > 52, do not reject Ho 
15-23. w = 8 < 65, reject Ho 

15-25. w = 25 < 52, reject Ho 

15-27. w = 1 < 25, reject Ho 


Section 15-4 
15-29. w = 38 > 23, do not reject Ho 
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15-31. z, = —2.78, do not reject Ho, 
P-value — 0.9973 

15-33. w = 55 < 78, reject Ho 

15-35. z, = —3.77, reject Ho, 
P-value = 0.0001 


Section 15-5 


15-37. Reject Ho 
15-39. Do not reject Ho 
15-41. P-value = 0.018 


Supplemental 


15-43. Do not reject Ho, P-value =1 
15-45. Do not reject Ho 

15-47. Do not reject Ho 

15-49. Reject Ho 

15-51. Reject Ho 

15-53. Reject Ho 

15-55. Reject Ho, P-value = 0 
15-57. Reject Hp, P-value = 0.009 


CHAPTER 16 

Section 16-5 

16-1. (a) x chart: UCL = 37.5789, 
CL = 34.32, LCL = 31.0611 
R chart: UCL = 11.9461, 


CL = 5.65, LCL = 0 
(b) 1 point outside limits on 
x chart Revised limits: 
x chart: UCL — 37.4038, CL — 
34.0947, LCL = 30.7857, 
R chart: UCL = 12.1297, 
CL = 5.7368, LCL = 0 
16-3. (a) x chart: UCL = 17.40, 
CL = 15.09, LCL = 12.79 
Rchart: UCL = 5.792, 
CL = 2.25, LCL = 0 
(b) x chart: UCL = 17.96, 
CL = 15.78, LCL = 16.62 
R chart: UCL = 5.453, 
CL = 2.118, LCL = 0 
x chart: UCL = 17.42, 
CL = 15.09, LCL = 12.77 
s chart: UCL = 3.051, 
CL = 1.1188, LCL = 0 
revised limits: x chart: 
UCL = 17.95, CL = 15.78, 
LCL = 13.62, s chart: 
UCL = 2.848, CL = 1.109, 
LCL = 0 


— 
O 
wm 
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16-5. (a) x chart: UCL = 242.78, 
CL = 223, LCL = 203.22 
Rchart: UCL = 72.51, 
CL = 34.286, LCL = 0 
(b) p = 223, 6 = 14.74 
16-7. (a) x chart: UCL = 0.06347, 
CL = 0.06294, 
LCL = 0.0624 R chart: 
UCL = 0.001954, 
CL = 0.000924, LCL = 0 
x chart: UCL = 0.06346, 
CL = 0.06295, 
LCL = 0.06241 s chart: 
UCL = 0.000766, 
CL = 0.000367, LCL = 0 
The points are 1, 5, 14, 17, 
20, 21, and 22; or outside 
the control limits of the 
R chart: 6 and 15 


(b 


—S 


(c 


— 


Section 16-6 


16-9. (a) Z chart: UCL = 60.8887, 
CL = 53.05, LCL = 45.2113 
MR chart: UCL = 9.63382, 
CL = 2.94737, LCL = 0 
(b) p = 53.05, 6 = 2.613 
16-11. (a) / chart: UCL = 10.5358, 
CL — 10.0272, 
LCL = 9.51856 MR chart: 
UCL = 0.625123, 
CL = 0.19125, LCL = 0 
(b) Q = 10.027, 6 = 0.1696 


Section 16-7 
16-13. (a) PCR = PCR, = 1.5 


(b) 0 
16-15. (a) 0.00075 (b) PCR = 1.13, 
PCR, — 1.104 
16-17. (a) PCR = PCR, = 1.18 
(b) 0.00046 
16-19. (a) 0.0009 (b) PCR = 1.13, 
PCR, = 1.06 


16-21. PCR = 0.50, PCR, = 0.357 
16-23. PCR = 0.49, PCR, = 0.474 


Section 16-8 


16-25. (a) Uchart: UCL = 3.811, 
CL = 1.942, LCL = 0.0722 
(b) Revised limits: U chart: 
UCL = 3.463, 
CL = 1.709, LCL = 0 


16-27. (c) chart: UCL = 19.06, 
CL = 9.708, LCL = 0.3609 


Section 16-9 


16-29. (a) 0.2177 
16-31. (a) 0.1020 
16-33. (a) 0.0548 (b) 18.25 
16-35. (a) 0.1515 (b) 6.6 

16-37. (a) 0.16603 (b) 6.02 


(b) 4.6 
(b) 9.8 


Section 16-10 

16-39. (a) 6 — 0.1695 

16-41. (a) ARL — 38.0 
(b) ARL = 10.4 


Supplemental 


16-43. (a) x chart: UCL — 64.0181, 
CL — 64.0, LCL — 63.982 
R chart: UCL — 0.0453972, 
CL = 0.01764, LCL = 0 
(b) p = 64,6 = 0.0104 
(c) PCR = 0.641, 
(d) PCR, = 0.641 
(e) o^ = 0.025, 
(f) 0.1705, ARL = 5.87 
16-45. (a) p chart: UCL = 0.20387, 
CL = 0.11, LCL = 0.01613 
(b) p chart: UCL = 0.1717, 
CL = 0.106, 
LCL = 0.04092 
16-47. (a) c chart: UCL = 7.51442, 
CL = 2.64, LCL = 0 
(b) c chart: UCL = 6.50924, 
CL = 2.1304, LCL = 0 
16-49. (b) 6.30 (c) 2 
16-51. (a) x chart: UCL = 140.168, 


CL = 139.49, 
LCL = 138.812 R chart: 
UCL = 2.48437, 


CL = 1.175, LCL = 0 
(b) Revised control limits: x 
chart: UCL = 140.518, 


CL = 139.808, 
LCL = 139.098 R chart: 
UCL = 2.6023, 


CL = 1.231, LCL = 0 
(c) PCR = 1.26, PCR, = 1.14 
(d) o? = 0.091 
(e) 0.1877, ARL = 5.33 
16-53. (a) 0.96995 (b) 1 
16-57. 0.000135 


Glossary 


Acceptance region. In hypothesis testing, a region in 
the sample space of the test statistic such that if the 
test statistic falls within it, the null hypothesis 1s 
accepted (better terminology is that the null hypothe- 
sis cannot be rejected, since rejection is always a 
strong conclusion and acceptance is generally a weak 
conclusion). 


Addition rule. A formula used to determine the proba- 
bility of the union of two (or more) events from the 
probabilities of the events and their intersection(s). 


Additivity property of x”. If two independent random 
variables X, and X, are distributed as chi-square with v, 
and v, degrees of freedom respectively, Y = X, + X; 1s 
a chi-square random variable with u = v, + v, degrees 
of freedom. This generalizes to any number of inde- 
pendent chi-square random variables. 


Adjusted R?. A variation of the R? statistic that com- 
pensates for the number of parameters in a regression 
model. Essentially, the adjustment is a penalty for in- 
creasing the number of parameters in the model. 


Alias. In a fractional factorial experiment when certain 
factor effects cannot be estimated uniquely, they are said 
to be aliased. 


All possible (subsets) regressions. A method of vari- 
able selection in regression that examines all possible 
subsets of the candidate regressor variables. Efficient 
computer algorithms have been developed for imple- 
menting all possible regressions. 


Alternative hypothesis. In statistical hypothesis test- 
ing, this is a hypothesis other than the one that is being 
tested. The alternative hypothesis contains feasible con- 
ditions, whereas the null hypothesis specifies conditions 
that are under test. 


Analysis of variance. A method of decomposing the 
total variability in a set of observations, as measured by 


the sum of the squares of these observations from their 
average, into component sums of squares that are asso- 
ciated with specific defined sources of variation. 


Analytic study. A study in which a sample from a 
population is used to make inference to a future popu- 
lation. Stability needs to be assumed. See enumerative 
study. 


Arithmetic mean. The arithmetic mean of a set of 
numbers x, X2, ..., x, is their sum divided by the num- 
ber of observations, or (1/n) X;-;x, The arithmetic 
mean is usually denoted by x, and is often called the 
average. 


Assignable cause. The portion of the variability in a 
set of observations that can be traced to specific causes, 
such as operators, materials, or equipment. Also called a 
special cause. 


Attribute. A qualitative characteristic of an item or 
unit, usually arising in quality control. For example, 
classifying production units as defective or nondefec- 
tive results in attributes data. 


Attribute control chart. Any control chart for a dis- 
crete random variable. See variables control charts. 


Average. See Arithmetic Mean. 


Average run length, or ARL. The average number 
of samples taken in a process monitoring or inspec- 
tion scheme until the scheme signals that the process 
is operating at a level different from the level in which 
it began. 


Axioms of probability. A set of rules that probabilities 
defined on a sample space must follow. See probability. 


Backward elimination. A method of variable selec- 
tion in regression that begins with all of the candidate 
regressor variables in the model and eliminates the 
insignificant regressors one at a time until only signifi- 
cant regressors remain. 
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Bayes’ theorem. An equation for a conditional proba- 
bility such as P(A | B) in terms of the reverse conditional 
probability P(B | A). 

Bernoulli trials. Sequences of independent trials with 
only two outcomes, generally called “success” and “‘fail- 
ure," in which the probability of success remains constant. 


Bias. An effect that systematically distorts a statistical 
result or estimate, preventing it from representing the 
true quantity of interest. 


Biased estimator. See Unbiased estimator. 
Bimodal distribution. A distribution with two modes. 


Binomial random variable. A discrete random vari- 
able that equals the number of successes in a fixed 
number of Bernoulli trials. 


Bivariate normal distribution. The joint distribution 
of two normal random variables. 


Block. In experimental design, a group of experimental 
units or material that is relatively homogeneous. The 
purpose of dividing experimental units into blocks is to 
produce an experimental design wherein variability 
within blocks is smaller than variability between 
blocks. This allows the factors of interest to be com- 
pared in a environment that has less variability than in 
an unblocked experiment. 


Box plot (or box and whisker plot). A graphical dis- 
play of data in which the box contains the middle 50% 
of the data (the interquartile range) with the median 
dividing it, and the whiskers extend to the smallest and 
largest values (or some defined lower and upper limits). 


C chart. An attribute control chart that plots the total 
number of defects per unit in a subgroup. Similar to a 
defects-per-unit or U chart. 


Categorical data. Data consisting of counts or obser- 
vations that can be classified into categories. The 
categories may be descriptive. 


Causal variable. When y = f(x) and y is considered to 
be caused by x, x 1s sometimes called a causal variable. 


Cause-and-effect diagram. A chart used to organize 
the various potential causes of a problem. Also called a 
fishbone diagram. 


Center line. A horizontal line on a control chart at the 
value that estimates the mean of the statistic plotted on 
the chart. 


Center line. See Control chart. 


Central composite design (CCD). A second-order 
response surface design in k variables consisting of a 


two-level factorial, 2k axial runs, and one or more cen- 
ter points. The two-level factorial portion of a CCD can 
be a fractional factorial design when k is large. The 
CCD is the most widely used design for fitting a 
second-order model. 


Central limit theorem. The simplest form of the cen- 
tral limit theorem states that the sum of n independently 
distributed random variables will tend to be normally 
distributed as n becomes large. It is a necessary and 
sufficient condition that none of the variances of the 
individual random variables are large in comparison to 
their sum. There are more general forms of the central 
theorem that allow infinite variances and correlated 
random variables, and there is a multivariate version of 
the theorem 


Central tendency. The tendency of data to cluster 
around some value. Central tendency is usually ex- 
pressed by a measure of location such as the mean, me- 
dian, or mode. 


Chance cause of variation. The portion of the vari- 
ability in a set of observations that is due to only random 
forces and which cannot be traced to specific sources, 
such as operators, materials, or equipment. Also called a 
common cause. 


Chebyshev's inequality. A result that provides bounds 
for certain probabilities for arbitrary random variables. 


Chi-square (or chi-squared) random variable. A 
continuous random variable that results from the sum of 
squares of independent standard normal random vari- 
ables. It is a special case of a gamma random variable. 


Chi-squared test. Any test of significance based on 
the chi-square distribution. The most common chi- 
square tests are (1) testing hypotheses about the 
variance or standard deviation of a normal distribution 
and (2) testing goodness of fit of a theoretical distribu- 
tion to sample data. 


Coefficient of determination. See R?. 


Completely randomized design. A type of experi- 
mental design in which the treatments or design factors 
are assigned to the experimental units in a random 
manner. In designed experiments, a completely random- 
ized design results from running all of the treatment 
combinations in random order. 


Components of variance. The individual components 
of the total variance that are attributable to specific 
sources. This usually refers to the individual variance 
components arising from a random or mixed model 
analysis of variance. 


Conditional mean. The mean of the conditional prob- 
ability distribution of a random variable. 


Conditional probability. The probability of an event 
given that the random experiment produces an outcome 
in another event. 


Conditional probability density function. The proba- 
bility density function of the conditional probability 
distribution of a continuous random variable. 


Conditional probability distribution. The distribution 
of a random variable given that the random experiment 
produces an outcome in an event. The given event 
might specify values for one or more other random 
variables. 


Conditional probability mass function. The proba- 
bility mass function of the conditional probability 
distribution of a discrete random variable. 


Conditional variance. The variance of the conditional 
probability distribution of a random variable. 


Confidence coefficient. The probability 1 — œ associ- 
ated with a confidence interval expressing the probability 
that the stated interval will contain the true parameter 
value. 


Confidence interval. If it is possible to write a proba- 
bility statement of the form 


P(L£0s<sU)=1-a 


where L and U are functions of only the sample data and 
0 is a parameter, then the interval between L and U is 
called a confidence interval (or a 100(1 — a)% confi- 
dence interval). The interpretation is that a statement 
that the parameter 0 lies in this interval will be true 
100(1 — a)% of the times that such a statement is made. 


Confidence level. Another term for the confidence 
coefficient. 


Confounding. When a factorial experiment is run in 
blocks and the blocks are too small to contain a com- 
plete replicate of the experiment, one can run a fraction 
of the replicate in each block, but this results in losing 
information on some effects. These effects are linked 
with or confounded with the blocks. In general, when 
two factors are varied such that their individual effects 
cannot be determined separately, their effects are said to 
be confounded. 


Consistent estimator. An estimator that converges in 
probability to the true value of the estimated parameter 
as the sample size increases. 
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Contingency table. A tabular arrangement expressing 
the assignment of members of a data set according to 
two or more categories or classification criteria. 


Continuity correction. A correction factor used to im- 
prove the approximation to binomial probabilities from 
a normal distribution. 


Continuous distribution. A probability distribution 
for a continuous random variable. 


Continuous random variable. A random variable 
with an interval (either finite or infinite) of real numbers 
for its range. 


Continuous uniform random variable. A continuous 
random variable with range of a finite interval and a 
constant probability density function. 


Contour plot. A two-dimensional graphic used for a 
bivariate probability density function that displays 
curves for which the probability density function is 
constant. 


Control chart. A graphical display used to monitor a 
process. It usually consists of a horizontal center line 
corresponding to the in-control value of the parameter 
that is being monitored and lower and upper control 
limits. The control limits are determined by statistical 
criteria and are not arbitrary nor are they related to spec- 
ification limits. If sample points fall within the control 
limits, the process is said to be in-control, or free from 
assignable causes. Points beyond the control limits indi- 
cate an out-of-control process; that is, assignable causes 
are likely present. This signals the need to find and re- 
move the assignable causes. 


Control limits. See Control chart. 


Convolution. A method to derive the probability den- 
sity function of the sum of two independent random 
variables from an integral (or sum) of probability 
density (or mass) functions. 


Cook's distance. In regression, Cook's distance is a 
measure of the influence of each individual observation 
on the estimates of the regression model parameters. It 
expresses the distance that the vector of model parame- 
ter estimates with the ith observation removed lies from 
the vector of model parameter estimates based on all ob- 
servations. Large values of Cook's distance indicate that 
the observation is influential. 


Correction factor. A term used for the quantity 
(l/n( Et,xy that is subtracted from Xps 
to give the corrected sum of squares defined as 
(1/n) Ei4(x; — xy. The correction factor can also be 
written as nx?. 
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Correlation. In the most general usage, a measure of 
the interdependence among data. The concept may 
include more than two variables. The term is most com- 
monly used in a narrow sense to express the relationship 
between quantitative variables or ranks. 


Correlation coefficient. A dimensionless measure of 
the interdependence between two variables, usually 
lying in the interval from —1 to +1, with zero indi- 
cating the absence of correlation (but not necessarily 
the independence of the two variables). The most 
common form of the correlation coefficient used in 
practice is 


r= W [b -» - 3) 





which is also called the product moment correlation co- 
efficient. It is a measure of the linear association be- 
tween the two variables y and x. 


Correlation matrix. A square matrix that contains the 
correlations among a set of random variables, say Xj, 
X, ..., X,. The main diagonal elements of the matrix are 
unity and the off diagonal elements r; are the correla- 
tions between X; and X;. 


Counting techniques. Formulas used to determine the 
number of elements in sample spaces and events. 


Covariance. A measure of association between two 
random variables obtained as the expected value of the 
product of the two random variables around their 
means; that is, Cov(X, Y) = E[(X — wy)(Y — py)]. 
Covariance matrix. A square matrix that contains the 
variances and covariances among a set of random vari- 
ables, say X;, X5, ..., X,. The main diagonal elements of 
the matrix are the variances of the random variables and 
the off diagonal elements are the covariances between 
X; and X;. Also called the variance-covariance matrix. 
When the random variables are standardized to have 
unit variances, the covariance matrix becomes the cor- 
relation matrix. 


Critical region. In hypothesis testing, this is the por- 
tion of the sample space of a test statistic that will lead 
to rejection of the null hypothesis. 


Critical value(s). The value of a statistic corresponding 
to a stated significance level as determined from the 
sampling distribution. For example, if P(Z = Zo 95) = 
P(Z = 1.96) = 0.05, then Zo 95 = 1.96 is the critical 
value of z at the 0.05 level of significance. 


Crossed factors. Another name for factors that are 
arranged in a factorial experiment. 


Cumulative distribution function. For a random vari- 
able X, the function of X defined as P(X S x) that is 
used to specify the probability distribution. 


Cumulative normal distribution function. The cumu- 
lative distribution of the standard normal distribution, 
often denoted as ®(x) and tabulated in Appendix Table II. 


Cumulative sum control chart (CUSUM). A control 
chart in which the point plotted at time ¢ is the sum of 
the measured deviations from target for all statistics up 
to time f. 


Curvilinear regression. An expression sometimes 
used for nonlinear regression models or polynomial re- 
gression models. 


Decision interval. A parameter set in a Tabular CUSUM 
algorithm that is determined from a trade-off between 
false alarms and the detection of assignable causes. 


Defect. Used in statistical quality control, a defect 1s 
a particular type of nonconformance to specifications 
or requirements. Sometimes defects are classified into 
types, such as appearance defects and functional 
defects. 


Defects-per-unit control chart. See U chart. 


Degrees of freedom. The number of independent com- 
parisons that can be made among the elements of a sam- 
ple. The term is analogous to the number of degrees of 
freedom for an object in a dynamic system, which is the 
number of independent coordinates required to deter- 
mine the motion of the object. 


Deming. W. Edwards Deming (1900-1993) was a 
leader in the use of statistical quality control. 


Deming’s 14 points. A management philosophy 
promoted by W. Edwards Deming that emphasizes the 
importance of change and quality. 


Density function. Another name for a probability den- 
sity function. 


Dependent variable. The response variable in regres- 
sion or a designed experiment. 


Discrete distribution. A probability distribution for a 
discrete random variable. 


Discrete random variable. A random variable with a 
finite (or countably infinite) range. 


Discrete uniform random variable. A discrete 
random variable with a finite range and constant proba- 
bility mass function. 


Dispersion. The amount of variability exhibited by data. 


Distribution free methods(s). Any method of infer- 
ence (hypothesis testing or confidence interval con- 
struction) that does not depend on the form of the 
underlying distribution of the observations. Sometimes 
called nonparametric method(s). 


Distribution function. Another name for a cumulative 
distribution function. 


Efficiency. A concept in parameter estimation that uses 
the variances of different estimators; essentially, an esti- 
mator is more efficient than another estimator if it has 
smaller variance. When estimators are biased, the con- 
cept requires modification. 


Enumerative study. A study in which a sample from a 
population is used to make inference to the population. 
See Analytic study. 


Erlang random variable. A continuous random vari- 
able that is the sum of a fixed number of independent, 
exponential random variables. 


6-error (or Q-risk). In hypothesis testing, an error in- 
curred by failing to reject a null hypothesis when it 1s 
actually false (also called a type II error). 


a-error (or a-risk). In hypothesis testing, an error in- 
curred by rejecting a null hypothesis when it is actually 
true (also called a type I error). 


Error mean square. The error sum of squares divided 
by its number of degrees of freedom. 


Error of estimation. The difference between an esti- 
mated value and the true value. 


Error sum of squares. In analysis of variance, this 1s 
the portion of total variability that 1s due to the ran- 
dom component in the data. It is usually based on 
replication of observations at certain treatment com- 
binations in the experiment. It is sometimes called the 
residual sum of squares, although this is really a better 
term to use only when the sum of squares is based on 
the remnants of a model fitting process and not on 
replication. 


Error variance. The variance of an error term or com- 
ponent in a model. 


Estimate (or point estimate). The numerical value of 
a point estimator. 


Estimator (or point estimator). A procedure for pro- 
ducing an estimate of a parameter of interest. An esti- 
mator is usually a function of only sample data values, 
and when these data values are available, it results in an 
estimate of the parameter of interest. 
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Event. A subset of a sample space. 


Exhaustive. A property of a collection of events that 
indicates that their union equals the sample space. 


Expected value. The expected value of a random vari- 
able X is its long-term average or mean value. In the 
continuous case, the expected value of X is 
E(X) = f, xf(x)dx where f(x) is the density function 
of the random variable X. 

Exponential random variable. A continuous random 
variable that is the time between counts in a Poisson 
process. 


Factorial experiment. A type of experimental design 
in which every level of one factor is tested in combina- 
tion with every level of another factor. In general, in a 
factorial experiment, all possible combinations of factor 
levels are tested. 


F-distribution. The distribution of the random vari- 
able defined as the ratio of two independent chi-square 
random variables each divided by their number of 
degrees of freedom. 


Finite population correction factor. A term in the for- 
mula for the variance of a hypergeometric random 
variable. 


First-order model. A model that contains only first- 
order terms. For example, the first-order response 
surface model in two variables is y = Bo + B,x, + 
8.x, + e. A first-order model is also called a main 
effects model. 


Fixed factor (or fixed effect). In analysis of variance, 
a factor or effect is considered fixed if all the levels of 
interest for that factor are included in the experiment. 
Conclusions are then valid about this set of levels only, 
although when the factor is quantitative, it is customary 
to fit a model to the data for interpolating between these 
levels. 


Forward selection. A method of variable selection in 
regression, where variables are inserted one at a time 
into the model until no other variables that contribute 
significantly to the model can be found. 


Fraction defective control chart. See P chart. 


Fraction defective. In statistical quality control, that 
portion of a number of units or the output of a process 
that is defective. 


Fractional factorial. A type of factorial experiment 
in which not all possible treatment combinations are 
run. This is usually done to reduce the size of an ex- 
periment with several factors. 
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Frequency distribution. An arrangement of the fre- 
quencies of observations in a sample or population 
according to the values that the observations take on. 


F-test. Any test of significance involving the F-distri- 
bution. The most common F-tests are (1) testing 
hypotheses about the variances or standard deviations 
of two independent normal distributions, (2) testing 
hypotheses about treatment means or variance compo- 
nents in the analysis of variance, and (3) testing signifi- 
cance of regression or tests on subsets of parameters in 
a regression model. 


Gamma function. A function used in the probability 
density function of a gamma random variable that can 
be considered to extend factorials. 


Gamma random variable. A random variable that 
generalizes an Erlang random variable to noninteger 
values of the parameter r. 


Gaussian distribution. Another name for the normal 
distribution, based on the strong connection of Karl F. 
Gauss to the normal distribution; often used in physics 
and electrical engineering applications. 


Generating function. A function that is used to deter- 
mine properties of the probability distribution of a 
random variable. See Moment generating function. 


Geometric mean. The geometric mean of a set of n 
positive data values is the nth root of the product of the 
data values; that is g = (IIx). 


Geometric random variable. A discrete random vari- 
able that 1s the number of Bernoulli trials until a success 
occurs. 


Goodness of fit. In general, the agreement of a set 
of observed values and a set of theoretical values 
that depend on some hypothesis. The term is often 
used in fitting a theoretical distribution to a set of 
observations. 


Harmonic mean. The harmonic mean of a set of data 
values is the reciprocal of the arithmetic mean of the 
o> l i 
reciprocals of the data values; that is, A = G bum x) 
l 

Hat matrix. In multiple regression, the matrix 
H = X(X'X) !X'. This a projection matrix that maps 
the vector of observed response values into a vector of 
fitted values by $ = X(X'X) !X'y = Hy. 

Histogram. A univariate data display that uses rectan- 
gles proportional in area to class frequencies to visually 
exhibit features of data such as location, variability, and 
shape. 


Hypergeometric random variable. A discrete random 
variable that is the number of success obtained from a sam- 
ple drawn without replacement from a finite populations. 


Hypothesis (as in statistical hypothesis). A statement 
about the parameters of a probability distribution or a 
model, or a statement about the form of a probability 
distribution. 


Hypothesis testing. Any procedure used to test a sta- 
tistical hypothesis. 


Independence. A property of a probability model and 
two (or more) events that allows the probability of the 
intersection to be calculated as the product of the prob- 
abilities. 

Independent random variables. Random variables 
for which P(X€ A, Y€ B) = P(X€ A)P(Y € B) for any 
sets A and B in the range of X and Y, respectively. There 
are several equivalent descriptions of independent ran- 
dom variables. 


Independent variable. The predictor or regressor vari- 
ables in a regression model. 


Indicator variable(s). Variables that are assigned nu- 
merical values to identify the levels of a qualitative or 
categorical response. For example, a response with two 
categorical levels (yes and no) could be represented 
with an indicator variable taking on the values 0 and 1. 


Individuals control chart. A Shewhart control chart in 
which each plotted point is an individual measurement, 
rather than a summary statistic. See control chart, 
Shewhart control chart. 


Interaction. In factorial experiments, two factors are 
said to interact if the effect of one variable is different at 
different levels of the other variables. In general, when 
variables operate independently of each other, they do 
not exhibit interaction. 


Intercept. The constant term in a regression model. 


Interquartile range. The difference between the third 
and first quartiles 1f a sample of data. The interquartile 
range is less sensitive to extreme data values than the 
usual sample range. 


Interval estimation. The estimation of a parameter by 
a range of values between lower and upper limits, in 
contrast to point estimation, where the parameter is 
estimated by a single numerical value. A confidence 
interval 1s a typical interval estimation procedure. 


Jacobian. A matrix of partial derivatives that is used to 
determine the distribution of transformed random 
variables. 


Joint probability density function. A function used to 
calculate probabilities for two or more continuous 
random variables. 


Joint probability distribution. The probability distri- 
bution for two or more random variables in a random 
experiment. See Joint probability mass function and 
Joint probability density function. 


Joint probability mass function. A function used to 
calculate probabilities for two or more discrete random 
variables. 


Kurtosis. A measure ofthe degree to which a unimodal 
distribution 1s peaked. 


Lack of memory property. A property of a Poisson 
process. The probability of a count in an interval 
depends only on the length of the interval (and not on 
the starting point of the interval). A similar property 
holds for a series of Bernoulli trials. The probability of 
a success in a specified number of trials depends only 
on the number of trials (and not on the starting trial). 


Least significance difference test (or Fisher's LSD 
test). An application of the t-test to compare pairs of 
means following rejection of the null hypothesis in an 
analysis of variance. The error rate 1s difficult to calculate 
exactly because the comparisons are not all independent. 


Least squares (method of). A method of parameter es- 
timation in which the parameters of a system are esti- 
mated by minimizing the sum of the squares of the 
differences between the observed values and the fitted 
or predicted values from the system. 


Least squares estimator. Any estimator obtained by 
the method of least squares. 


Level of significance. If Z is the test statistic for a hy- 
pothesis, and the distribution of Z when the hypothe- 
sis is true are known, then we can find the probabili- 
ties P(Z=z,) and P(Z = zy). Rejection of the 
hypothesis is usually expressed in terms of the 
observed value of Z falling outside the interval from 
Zr to zy. The probabilities P(Z = z;) and P(Z = zy) 
are usually chosen to have small values, such as 0.01, 
0.025, 0.05, or 0.10, and are called levels of signifi- 
cance. The actual levels chosen are somewhat arbi- 
trary and are often expressed in percentages, such as a 
5% level of significance. 


Likelihood function. Suppose that the random vari- 
ables Xi, X», ... , X, have a joint distribution given by 
f, X2, «++ 5 XQ; 01, O2, ..., 0,) where the Os are un- 
known parameters. This joint distribution, considered 
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as a function of the 0s for fixed x's, is called the like- 
lihood function. 


Likelihood principle. This principle states that the 
information about a model given by a set of data is com- 
pletely contained in the likelihood. 


Likelihood ratio. Let x,, x;, ..., x, be a random sample 
from the population f(x; 0). The likelihood function for 
this sample is L = IIi- f(x; 9). We wish to test the 
hypothesis Hy: 0 € œw, where w is a subset of the possi- 
ble values Q for 0. Let the maximum value of L with 
respect to 0 over the entire set of values that the 
parameter can take on be denoted by L(Q), and let the 
maximum value of L with 0 restricted to the set of val- 
ues given by œw be L(@). The null hypothesis is tested 
by using the likelihood ratio A = L(@)/L(Q), or a sim- 
ple function of it. Large values of the likelihood ratio 
are consistent with the null hypothesis. 


Likelihood ratio test. A test of a null hypothesis 
versus an alternative hypothesis using a test statistic de- 
rived from a likelihood ratio. 


Linear combination. A random variable that is 
defined as a linear function of several random variables. 


Linear model. A model in which the observations 
are expressed as a linear function of the unknown 
parameters. For example, y = Bo + Bix + € and y = 
Bo + Bix + Box? + e are linear models. 


Location parameter. A parameter that defines a 
central value in a sample or a probability distribution. 
The mean and the median are location parameters. 


Lognormal random variable. A continuous random 
variable with probability distribution equal to that of 
exp(W) for a normal random variable W. 


Main effect. An estimate of the effect of a factor (or 
variable) that independently expresses the change in 
response due to a change in that factor, regardless of 
other factors that may be present in the system. 


Marginal probability density function. The probabil- 
ity density function of a continuous random variable 
obtained from the joint probability distribution of two or 
more random variables. 


Marginal probability distribution. The probability 
distribution of a random variable obtained from the joint 
probability distribution of two or more random variables. 


Marginal probability mass function. The probability 
mass function of a discrete random variable obtained 
from the joint probability distribution of two or more 
random variables. 
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Maximum likelihood estimation. A method of param- 
eter estimation that maximizes the likelihood function of 
a sample. 


Mean. The mean usually refers either to the expected 
value of a random variable or to the arithmetic average 
of a set of data. 


Mean square. In general, a mean square is deter- 
mined by dividing a sum of squares by the number of 
degrees of freedom associated with the sum of 
squares. 


Mean square(d) error. The expected squared deviation 
ofan estimator from the true value of the parameter it es- 
timates. The mean square error can be decomposed into 
the variance of the estimator plus the square of the bias; 
that is, MSE(@) = E(Ó — 0 = (Ô) + [E(O) — ep. 
Median. The median of a set of data is that value that 
divides the data into two equal halves. When the number 
of observations is even, say 2n, it is customary to define 
the median as the average of the nth and (n + 1)st rank- 
ordered values. The median can also be defined for a 
random variable. For example, in the case of a continu- 
ous random variable X, the median M can be defined 
as f^ f(x)dx = fj f(x)dx = 1/2. 

Method of steepest ascent. A technique that allows an 
experimenter to move efficiently towards a set of opti- 
mal operating conditions by following the gradient 
direction. The method of steepest ascent is usually em- 
ployed in conjunction with fitting a first-order response 
surface and deciding that the current region of operation 
is inappropriate. 


Mixed model. In an analysis of variance context, a 
mixed model contains both random and fixed factors. 


Mode. The mode of a sample is that observed value 
that occurs most frequently. In a probability distribu- 
tion f(x) with continuous first derivative, the mode is a 
value of x for which df(x)/dx = 0 and d?f(x)/dx? < 0. 
There may be more than one mode of either a sample 
or a distribution. 


Moment (or population moment). The expected value 
of a function of a random variable such as E(X — c) for 
constants c and r. When c = 0, it is said that the moment 
is about the origin. See Moment generating function. 


Moment estimator. A method of estimating parame- 
ters by equating sample moments to population mo- 
ments. Since the population moments will be functions 
ofthe unknown parameters, this results in equations that 
may be solved for estimates of the parameters. 


Moment generating function. A function that is used 
to determine properties (such as moments) of the 
probability distribution of a random variable. It is the 
expected value of exp(tX). See generating function 
and moment. 


Moving range. The absolute value of the difference 
between successive observations in time-ordered data. 
Used to estimate chance variation in an individuals con- 
trol chart. 


Multicollinearity. A condition occurring in multiple 
regression where some of the predictor or regressor 
variables are nearly linearly dependent. This condition 
can lead to instability in the estimates of the regression 
model parameters. 


Multinomial distribution. The joint probability distri- 
bution of the random variables that count the number of 
results in each of k classes in a random experiment with 
a series of independent trials with constant probability 
of each class on each trial. It generalizes a binomial 
distribution. 


Multiplication rule. For probability, A formula used 
to determine the probability of the intersection of two 
(or more) events. For counting techniques, a formula 
used to determine the numbers of ways to complete 
an operation from the number of ways to complete 
successive steps. 


Mutually exclusive events. A collection of events 
whose intersections are empty. 


Natural tolerance limits. A set of symmetric limits 
that are three times the process standard deviation from 
the process mean. 


Negative binomial random variable. A discrete ran- 
dom variable that is the number of trials until a specified 
number of successes occur in Bernoulli trials. 


Nonlinear regression model. A regression model that 
is nonlinear in the parameters. It is sometimes applied to 
regression models that are nonlinear in the regressors or 
predictors, but this is an incorrect usage. 


Nonparametric statistical method(s). See Distribution 
free method(s). 


Normal approximation. A method to approximate 
probabilities for binomial and Poisson random variables. 


Normal equations. The set of simultaneous linear 
equations arrived at in parameter estimation using the 
method of least squares. 


Normal probability plot. A specially constructed 
plot for a variable x (usually on the abscissa) in which 


y (usually on the ordinate) is scaled so that the 
graph of the normal cumulative distribution is a 
straight line. 


Normal random variable. A continuous random vari- 
able that is the most important one in statistics because 
it results from the central limit theorem. See Central 
limit theorem. 


NP chart. An attribute control chart that plots the total 
of defective units in a subgroup. Similar to a fraction- 
defective chart or P chart. 


Nuisance factor. A factor that probably influences the 
response variable, but which is of no interest in the cur- 
rent study. When the levels of the nuisance factor can be 
controlled, blocking is the design technique that is 
customarily used to remove its effect. 


Null hypothesis. This term generally relates to a par- 
ticular hypothesis that 1s under test, as distinct from the 
alternative hypothesis (which defines other conditions 
that are feasible but not being tested). The null hypothe- 
sis determines the probability of type I error for the test 
procedure. 


One-way model. In an analysis of variance context, 
this involves a single variable or factor with a different 
levels. 


Operating characteristic curves (OC curves). A plot 
of the probability of type II error versus some measure 
of the extent to which the null hypothesis is false. 
Typically, one OC curve is used to represent each sam- 
ple size of interest. 


Orthogonal. There are several related meanings, in- 
cluding the mathematical sense of perpendicular, two 
variables being said to be orthogonal if they are statisti- 
cally independent, or in experimental design where a 
design is orthogonal if it admits statistically independ- 
ent estimates of effects. 


Orthogonal design. See Orthogonal. 
Outcome. An element of a sample space. 


Outlier(s). One or more observations in a sample that are 
so far from the main body of data that they give rise to the 
question that they may be from another population. 


Overcontrol. Unnecessary adjustments made to 
processes that increase the deviations from target. 


Overfitting. Adding more parameters to a model than 
is necessary. 
P chart. An attribute control chart that plots the propor- 


tion of defective units in a subgroup. Also called a frac- 
tion-defective control chart. Similar to an NP chart. 
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Parameter estimation. The process of estimating 
the parameters of a population or probability distribu- 
tion. Parameter estimation, along with hypothesis 
testing, is one of the two major techniques of statisti- 
cal inference. 


Parameter. An unknown quantity that may vary over a 
set of values. Parameters occur in probability distri- 
butions and in statistical models, such as regression 
models. 


Pareto diagram. A bar chart used to rank the causes of 
a problem. 


PCR. A process capability ratio with numerator equal to 
the difference between the product specification limits 
and denominator equal to six times the process standard 
deviation. Said to measure the potential capability of the 
process because the process mean is not considered. See 
process capability, process capability ratio, process capa- 
bility study and PCR;. Sometimes denoted as C, in other 
references. 


PCR,. A process capability ratio with numerator equal 
to the difference between the product target and the 
nearest specification limit and denominator equal to 
three times the process standard deviation. Said to 
measure the actual capability of the process because the 
process mean is considered. See process capability, 
process capability ratio, process capability study, and 
PCR. Sometimes denoted as C, in other references. 


Percentage point. A particular value of a random vari- 
able determined from a probability (expressed as a 
percentage). For example, the upper 5 percentage point 
of the standard normal random variable is Z) 9; = 1.645. 


Percentile. The set of values that divide the sample into 
100 equal parts. 


Poisson process. A random experiment with counts 
that occur in an interval and satisfy the following 
assumptions. The interval can be partitioned into subin- 
tervals such that the probability of more than one count 
in a subinterval is zero, the probability of a count in a 
subinterval is proportional to the length of the subinter- 
val, and the count in each subinterval is independent of 
other subintervals. 


Poisson random variable. A discrete random variable 
that is the number of counts that occur in a Poisson 
process. 


Pooling. When several sets of data can be thought of as 
having been generated from the same model, it is possi- 
ble to combine them, usually for purposes of estimating 
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one or more parameters. Combining the samples for this 
purpose is usually called pooling. 


Population standard deviation. See standard deviation. 
Population variance. See variance. 


Population. Any finite or infinite collection of individ- 
ual units or objects. 


Power. The power of a statistical test 1s the probability 
that the test rejects the null hypothesis when the null hy- 
pothesis is indeed false. Thus the power is equal to one 
minus the probability of type II error. 


Prediction. The process of determining the value of 
one or more statistical quantities at some future point in 
time. In a regression model, predicting the response y 
for some specified set of regressors or predictor vari- 
ables also leads to a predicted value, although there may 
be no temporal element to the problem. 


Prediction interval. The interval between a set of 
upper and lower limits associated with a predicted value 
designed to show on a probability basis the range of 
error associated with the prediction. 


Predictor variable(s). The independent or regressor 
variables in a regression model. 


Probability density function. A function used to 
calculate probabilities and to specify the probability dis- 
tribution of a continuous random variable. 


Probability distribution. For a sample space, a 
description of the set of possible outcomes along with a 
method to determine probabilities. For a random 
variable, a probability distribution is a description of the 
range along with a method to determine probabilities. 


Probability mass function. A function that provides 
probabilities for the values in the range of a discrete ran- 
dom variable. 


Probability. A numerical measure between 0 and 1 as- 
signed to events in a sample space. Higher numbers in- 
dicate the event is more likely to occur. See axioms of 
probability. 


Process capability ratio. A ratio that relates the width of 
the product specification limits to measures of process 
performance. Used to quantify the capability of the 
process to produce product within specifications. See 
process capability, process capability study, PCR and 
PCR, 


Process capability study. A study that collects data to 
estimate process capability. See process capability, 
process capability ratio, PCR and PCR,. 


Process capability. The capability of a process to 
produce product within specification limits. See 
process capability ratio, process capability study, PCR, 
and PCR, 


P-Value. The exact significance level of a statistical 
test; that is, the probability of obtaining a value of the 
test statistic that is at least as extreme as that observed 
when the null hypothesis is true. 


Qualitative (data). Data derived from nonnumeric 
attributes, such as sex, ethnic origin or nationality, or 
other classification variable. 


Quality control. Systems and procedures used by an 
organization to assure that the outputs from processes 
satisfy customers. 


Quantiles. The set of n — 1 values of a variable that 
partition it into a number n of equal proportions. For 
example, n — 1 = 3 values partition data into four 
quantiles with the central value usually called the 
median and the lower and upper values usually called 
the lower and upper quartiles, respectively. 


Quantitative (data). Data in the form of numerical 
measurements or counts. 


Quartile(s). The three values of a variable that parti- 
tion it into four equal parts. The central value is usually 
called the median and the lower and upper values are 
usually called the lower and upper quartiles, respec- 
tively. Also see Quantiles. 


R^. A quantity used in regression models to measure the 
proportion of total variability in the response accounted 
for by the model. Computationally, R? — Dcus 
SS, and large values of R? (near unity) are consid- 
ered good. However, it is possible to have large values 
of R^ and find that the model is unsatisfactory. R? 
is also called the coefficient of determination (or the 
coefficient of multiple determination in multiple 
regression). 


Random. Nondeterministic, occurring purely by 
chance, or independent of the occurrence of other events. 


Random effects model. In an analysis of variance con- 
text, this refers to a model that involves only random 
factors. 


Random error. An error (usually a term in a statistical 
model) that behaves as if it were drawn at random from 
a particular probability distribution. 


Random experiment. An experiment that can result in 
different outcomes, even though it is repeated in the 
same manner each time. 


Random factor. In analysis of variance, a factor whose 
levels are chosen at random from some population of 
factor levels. 


Random order. A sequence or order for a set of objects 
that 1s carried out in such a way that every possible or- 
dering is equally likely. In experimental design the runs 
of the experiment are typically arranged and carried out 
in random order. 


Random sample. A sample is said to be random if it is 
selected in such a way so that every possible sample has 
the same probability of being selected. 


Random variable. A function that assigns a real num- 
ber to each outcome in the sample space of a random 
experiment. 


Randomization. A set of objects is said to be random- 
ized when they are arranged in random order. 


Randomized block design. A type of experimental de- 
sign in which treatment (or factor levels) are assigned to 
blocks in a random manner. 


Range. The largest minus the smallest of a set of data 
values. The range is a simple measure of variability and 
is widely used in quality control. 


Range (control) chart. A control chart used to monitor 
the variability (dispersion) in a process. See Control chart. 


Rank. In the context of data, the rank of a single ob- 
servation is its ordinal number when all data values are 
ordered according to some criterion, such as their mag- 
nitude. 


Rational subgroup. A sample of data selected in a man- 
ner to include chance sources of variation and to exclude 
assignable sources of variation, to the extent possible. 


Reference distribution. The distribution of a test 
statistic when the null hypothesis is true. Sometimes a 
reference distribution is called the null distribution of 
the test statistic. 


Reference value. A parameter set in a Tabular 
CUSUM algorithm that is determined from the magni- 
tude of the process shift that should be detected. 


Regression. The statistical methods used to investigate 
the relationship between a dependent or response 
variable y and one or more independent variables x. The 
independent variables are usually called regressor vari- 
ables or predictor variables. 


Regression coefficient(s). The parameter(s) in a re- 
gression model. 

Regression diagnostics. Techniques for examining a fit- 
ted regression model to investigate the adequacy of the fit 
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and to determine if any of the underlying assumptions 
have been violated. 


Regression line (or curve). A graphical display of a 
regression model, usually with the response y on the 
ordinate and the regressor x on the abcissa. 


Regression sum of squares. The portion of the total 
sum of squares attributable to the model that has been fit 
to the data. 


Regressor variable. The independent or predictor 
variable in a regression model. 


Rejection region. In hypothesis testing, this 1s the 
region in the sample space of the test statistic that leads 
to rejection of the null hypothesis when the test statistic 
falls in this region. 


Relative frequency. The relative frequency of an event 
is the proportion of times the event occurred in a series 
of trial of a random experiment. 


Reliability. The probability that a specified mission 
will be completed. It usually refers to the probability 
that a lifetime of a continuous random variable exceeds 
a specified time limit. 


Replicates. One of the independent repetitions of one 
or more treatment combinations in an experiment. 


Replication. The independent execution of an experi- 
ment more than once. 


Reproductive property of the normal distribution. 
A linear combination of independent, normal random 
variables is a normal random variable. 


Residual. Generally this is the difference between the 
observed and the predicted value of some variable. For 
example, in regression a residual is the difference 
between the observed value of the response and the 
corresponding predicted value obtained from the regres- 
sion model. 


Residual analysis. Any technique that uses the residu- 
als, usually to investigate the adequacy of the model that 
was used to generate the residuals. 


Residual sum of squares. See Error sum of squares. 


Response (variable). The dependent variable in a 
regression model or the observed output variable in a 
designed experiment. 


Response surface. When a response y depends on a 
function of k quantitative variables xj, x, ... , X} the 
values of the response may be viewed as a surface in 
k + 1 dimensions. This surface is called a response 
surface. Response surface methodology is a subset of 
experimental design concerned with approximating this 
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surface with a model and using the resulting model to 
optimize the system or process. 


Response surface designs. Experimental designs that 
have been developed to work well in fitting response 
surfaces. These are usually designs for fitting a first- or 
second-order model. The central composite design is a 
widely used second-order response surface design. 


Ridge regression. A method for fitting a regression 
model that is intended to overcome the problems associ- 
ated with using standard (or ordinary) least squares when 
there is a problem with multicollinearity in the data. 


Rotatable design. In a rotatable design, the variance of 
the predicted response is the same at all points that are 
the same distance from the center of the design. 


Run rules. A set of rules applied to the points plotted 
on a Shewhart control chart that are used to make the 
chart more sensitized to assignable causes. See control 
chart, Shewhart control chart. 


Sample. Any subset of the elements of a population. 


Sample mean. The arithmetic average or mean of the 
observations in a sample. If the observations are 
Xj, X>,..., X, then the sample mean is (1/1) X. xj. The 
sample mean is usually denoted by x. 


Sample moment. The quantity (1/1) >j_, x is called 


the kth sample moment. 
Sample range. See range. 
Sample size. The number of observations in a sample. 


Sample space. The set of all possible outcomes of a 
random experiment. 


Sample standard deviation. The positive square root 
of the sample variance. The sample standard deviation 
is the most widely used measure of variability of 
sample data. 


Sample variance. A measure of variability of sample 
data, defined as s* = [1/(n — 1)] iix; — x)’, where 
x is the sample mean. 


Sampling distribution. The probability distribution of 
a statistic. For example, the sampling distribution of the 
sample mean_X is the normal distribution. 


Scatter diagram. A diagram displaying observations on 
two variables, x and y. Each observation is represented by 
a point showing its x-y coordinates. The scatter diagram 
can be very effective in revealing the joint variability of x 
and y or the nature of the relationship between them. 


Screening experiment. An experiment designed and 
conducted for the purpose of screening out or isolating 


a promising set of factors for future experimentation. 
Many screening experiments are fractional factorials, 
such as two-level fractional factorial designs. 


Second-order model. A model that contains second- 
order terms. For example, the second-order response 
surface model in two variables is y = By + Bix; + 
Box, + Boxx + Bux? + ixi + e. The second order 
terms in this model are B ,x,x», B, 1x7, and Bo x5. 


Shewhart control chart. A specific type of control 
chart developed by Walter A. Shewhart. Typically, each 
plotted point is a summary statistic calculated from the 
data in a rational subgroup. See control chart. 


Sign test. A statistical test based on the signs of certain 
functions of the observations and not their magnitudes. 


Signed-rank test. A statistical test based on the differ- 
ences within a set of paired observations. Each differ- 
ence has a sign and a rank, and the test uses the sum of 
the differences with regard to sign. 


Significance. In hypothesis testing, an effect is said to 
be significant if the value of the test statistic lies in the 
critical region. 


Significance level. See Level of significance. 


Skewness. A term for asymmetry usually employed 
with respect to a histogram of data or a probability dis- 
tribution. 


Standard deviation. The positive square root of the 
variance. The standard deviation is the most widely 
used measure of variability. 


Standard error. The standard deviation of the estima- 
tor of a parameter. The standard error is also the stan- 
dard deviation of the sampling distribution of the esti- 
mator of a parameter. 


Standard normal random variable. A normal ran- 
dom variable with mean zero and variance one that has 
its cumulative distribution function tabulated in 
Appendix Table II. 


Standardize. The transformation of a normal random 
variable that subtracts its mean and divides by its standard 
deviation to generate a standard normal random variable. 


Standardized residual. In regression, the standardized 
residual is computed by dividing the ordinary residual by 
the square root of the residual mean square. This produces 
scaled residuals that have, approximately, a unit variance. 


Statistic. A summary value calculated from a sample 
of observations. Usually, a statistic 1s an estimator of 
some population parameter. 


Statistical Process Control. A set of problem-solving 
tools based on data that are used to improve a process. 


Statistical quality control. Statistical and engineering 
methods used to measure, monitor, control, and 
improve quality. 


Statistics. The science of collecting, analyzing, inter- 
preting, and drawing conclusions from data. 


Stem and leaf display. A method of displaying data in 
which the stem corresponds to a range of data values 
and the leaf represents the next digit. It is an alternative 
to the histogram but displays the individual observa- 
tions rather than sorting them into bins. 


Stepwise regression. A method of selecting variables 
for inclusion in a regression model. It operates by intro- 
ducing the candidate variables one at a time (as in for- 
ward selection) and then attempting to remove variables 
following each forward step. 


Studentized range. The range of a sample divided by 
the sample standard deviation. 


Studentized residual. In regression, the studentized 
residual 1s calculated by dividing the ordinary residual 
by its exact standard deviation, producing a set of scaled 
residuals that have, exactly, unit standard deviation. 


Sufficient statistic. An estimator is said to be a 
sufficient statistic for an unknown parameter if the 
distribution of the sample given the statistic does not 
depend on the unknown parameter. This means that the 
distribution of the estimator contains all of the useful in- 
formation about the unknown parameter. 


Tabular CUSUM. A numerical algorithm used to de- 
tect assignable causes on a cumulative sum control 
chart. See V mask. 


Tampering. Another name for overcontrol. 


(-distribution. The distribution of the random variable 
defined as the ratio of two independent random vari- 
ables. The numerator is a standard normal random 
variable and the denominator is the square root of a chi- 
square random variable divided by its number of 
degrees of freedom. 


Test statistic. A function of a sample of observations 
that provides the basis for testing a statistical hypothesis. 


Time series. A set of ordered observations taken at 
difference points in time. 


Tolerance interval. An interval that contains a speci- 
fied proportion of a population with a stated level of 
confidence. 
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Tolerance limits. A set of limits between which some 
stated proportion of the values of a population must fall 
with specified level of confidence. 


Total probability rule. Given a collection of mutually 
exclusive events whose union is the sample space, the 
probability of an event can be written as the sum of the 
probabilities of the intersections of the event with the 
members of this collection. 


Treatment. In experimental design, a treatment is a 
specific level of a factor of interest. Thus if the factor is 
temperature, the treatments are the specific temperature 
levels used in the experiment. 


Treatment sum of squares. In analysis of variance, 
this is the sum of squares that accounts for the variabil- 
ity in the response variable due to the different treat- 
ments that have been applied. 


t-test. Any test of significance based on the / distribu- 
tion. The most common t-tests are (1) testing hypothe- 
ses about the mean of a normal distribution with 
unknown variance, (2) testing hypotheses about the 
means of two normal distributions and (3) testing 
hypotheses about individual regression coefficients. 


Type I error. In hypothesis testing, an error incurred 
by rejecting a null hypothesis when it is actually true 
(also called an a-error). 


Type II error. In hypothesis testing, an error incurred 
by failing to reject a null hypothesis when it is actually 
false (also called a Q-error). 


U chart. An attribute control chart that plots the aver- 
age number of defects per unit in a subgroup. Also called 
a defects-per-unit control chart. Similar to a C chart. 


Unbiased estimator. An estimator that has its expected 
value equal to the parameter that is being estimated is 
said to be unbiased. 


Uniform random variable. Refers to either a discrete 
or continuous uniform random variable. 


Uniqueness property of moment generating function. 
Refers to the fact that random variables with the same 
moment generating function have the same distribution. 


Universe. Another name for population. 


V mask. A geometrical figure used to detect assignable 
causes on a cumulative sum control chart. With appro- 
priate values for parameters, identical conclusions can 
be made from a V mask and a tabular CUSUM. 


Variable selection. The problem of selecting a subset 
of variables for a model from a candidate list that 
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contains all or most of the useful information about the 
response in the data. 


Variables control chart. Any control chart for a con- 
tinuous random variable. See attributes control charts. 


Variance. A measure of variability defined as the 
expected value of the square of the random variable 
around its mean. 


Variance component. In analysis of variance models 
involving random effects, one of the objectives 1s to 
determine how much variability can be associated with 
each of the potential sources of variability defined by 
the experimenters. It is customary to define a variance 
associated with each of these sources. These variances 
in some sense sum to the total variance of the response, 
and are usually called variance components. 


Variance inflation factors. Quantities used in multiple 
regression to assess the extent of multicollinearity (or 
near linear dependence) in the regressors. The variance 
inflation factor for the ith regressor VIF; can be defined 
as VIF, = [1/(1 — R^], where R? is the coefficient of 
determination obtained when x; is regressed on the other 
regressor variables. Thus when x; is nearly linearly de- 
pendent on a subset of the other regressors R? will be 
close to unity and the value of the corresponding vari- 


ance inflation factor will be large. Values of the variance 
inflation factors that exceed 10 are usually taken as a 
signal that multicollinearity 1s present. 


Warning limits. Horizontal lines added to a control 
chart (in addition to the control limits) that are used to 
make the chart more sensitive to assignable causes. 


Weibull random variable. A continuous random vari- 
able that is often used to model the time until failure of 
a physical system. The parameters of the distribution 
are flexible enough that the probability density function 
can assume many different shapes. 


Western Electric rules. A specific set of run rules that 
were developed at Western Electric Corporation. See 
run rules. 


Wilcoxon signed rank test. A distribution-free test of 
the equality of the location parameters of two otherwise 
identical distributions. It is an alternative to the two- 
sample t-test for nonnormal populations. 


With replacement. A method to select samples in 
which items are replaced between successive selections. 


Without replacement. A method to select samples in 
which items are nof replaced between successive 
selections. 


A 
Acceptance region, 280, 290 
Actual process capability versus potential process 
capability, 622 
Addition of center points to 2* factorial, see Center 
points in 2^ factorial design 
Addition rules of probability, 33, 24 
Adjusted R?, 431 
All possible regressions, 453 
Alternate fraction, 551 
Alternative hypothesis, 279 
Analysis of variance, 387, 421, 429, 469, 473, 475, 
493, 494, 495, 13-5, 506, 511, 589 
Analysis of variance method for estimating 
variance components, see Estimating 
variance components 
Analytic study, 5, 7 
Anderson-Darling statistic, 6-1 
ANOVA partition of total sum of squares, 474 
Approximations to distributions, 119: 
Continuity corrections, 4-1 
Normal approximation to binomial, 120, 310 
Normal approximation to Poisson, 121 
Assignable cause of variation, 598 
Asymptotic relative efficiency, 584 
Attributes control charts, 601, 625 
Average run length, 630 
Axioms of probability, 30, 31 


B 

Backward elimination of variables in regression, 459 

Basic design, 552 

Bayes estimation of parameters, 7-3 

Bayes estimator, 7-4 

Bayes' theorem, 51, 52 

Bernoulli random variable, 11-7 

Bernoulli trial, 72 

Bias in estimation, 223 

Bias of estimator, 223, 226 

Binomial distribution, 74, 87, 119, 120, 155, 310, 
9-3, 9-4 

Binomial random variable, 74, 76, 84, 573 

Bins (cells) in histogram, 203, 204 

Bivariate distribution, 143, 177 

Bivariate normal distribution, 177, 178, 401 

Block effect, 493 

Blocking, 492, 496, 543, 545 

Blocking as restriction on randomization, 496 

Bootstrap, 7-2 

Bootstrap confidence interval, 8-1 

Bootstrap estimate of standard error, 7-2 

Bootstrap sample, 7-2, 8-1 

Box plot, 207, 208, 471 


Index 


C 

Categorical regressors, 450 

Cause-and-effect, 333 

Cause-and-effect diagram, 598, 640 

Censored data, 246, 275 

Center line on control chart, 10, 599, 600 

Center points in 2* factorial design, 541, 14-9, 14-10 

Central composite design, 14-16, 14-17 

Central limit theorem, 109, 240 

Chance cause of variation, 598 

Chebyshev's inequality, 5-13, 5-14 

Check sheet, 598 

Chi distribution, 7-1 

Chi-square distribution, 133, 7-1, 262, 308 

Chi-square distribution percentage points, 
table of, 655 

Chi-square tests, 307, 316, 321 

Choice of sample size and confidence intervals, 
252, 260, 266, 267, 335 

Choice of sample size for single-factor 
experiments, 482, 484, 490, 13-7, 13-11 

Choice of sample size in statistical tests, 293, 294, 
295, 297, 304, 305, 309, 312, 314, 331, 344, 
359, 364 

Coding data, 219 

Coefficient of determination, 397; also see R? 

Combinations, 2-3 

Common cause of variation, 598 

Comparative box plots, 208 

Comparative experiments, 278, 328 

Comparison of nonparametric tests to t-test, 
579, 584, 588 

Complement (set operation), 23 

Completely randomized design, 333, 472 

Complications in maximum likelihood 
estimation, 235 

Components of variance, 473, 487, 488, 489, 14-4 

Components of variance model, see Random 
effects model ANOVA 

Conceptual population, 190 

Conditional mean, 147, 163 

Conditional probability, 37, 38, 51, 125,126 

Conditional probability density function, 162 

Conditional probability distribution, 146, 153, 
162, 169, 5-2 

Conditional probability mass function, 
146, 153 

Conditional variance, 147, 163 

Confidence coefficient, 250 

Confidence interval, 247, 248, 250, 253, 254, 256, 
258, 259, 263, 266, 334, 345, 346, 365, 390, 
391, 403, 437, 477, 478 

Confidence interval on correlation coefficient, 403 


Confidence interval on difference in means of 
two normal distributions: 
Variances known, 334 
Variances unknown, 345, 346 
Confidence interval on difference in two 
proportions, 365 
Confidence interval on difference in two treatment 
means in ANOVA, 478 
Confidence interval on mean of normal 
distribution: 
Variance known, 249, 250, 253 
Variance unknown, 257, 259 
Confidence interval on mean response, 390, 438 
Confidence interval on population proportion, 
265, 266 
Confidence interval on ratio of variances of two 
normal distributions, 359 
Confidence interval on regression coefficients, 
389, 437 
Confidence interval on treatment mean in ANOVA, 
477, 478 
Confidence interval on variance of normal 
distribution, 261, 263 
Confidence level and precision of estimation, 251 
Confounding, 543, 544, 546, 548 
Connection between confidence intervals and 
hypothesis tests, 293 
Connection between gamma and chi-square 
distributions, 133 
Connection between Poisson and Erlang 
distributions, 129 
Connection between Poisson and exponential 
distributions, 123 
Connection between Weibull and exponential 
distributions, 134 
Consistent estimator, 245 
Contingency table, 320, 322 
Continuity corrections, 4-1 
Continuous probability distribution, 98: 
Chi-square, 133 
Erlang, 130 
Exponential, 123 
F, 355, 356, 357, 10-1 
Gamma, 131, 132 
Lognormal, 136 
Normal, 109, 110, 119, 120, 121, 204, 240 
t, 258, 8-2, 301 
Uniform, 107 
Weibull, 134 
Continuous random variable, 54, 97, 98, 100, 157, 
5-3, 5-4 
Continuous sample space, 19 
Continuous uniform distribution, 107 
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Continuous uniform random variable, 107 

Contour plot, 411 

Contrasts, 13-2, 525, 526, 532 

Control chart, 10, 598, 599, 600, 601, 602, 607, 
609, 610, 615, 625, 627, 632 

Control chart for defects per unit, see U-chart 

Control chart for fraction defective, see P-chart 

Control charts and hypothesis testing, 599 

Controllable variables, 469 

Convolution, 5-6 

Cook’s distance measure, 445 

Correlation, 171, 174, 179, 352, 400 

Correlation coefficient, 401 

Covariance, 171, 172, 174, 182 

Covariance matrix, 421 

C, Statistic in regression, 454 

Critical region, 280, 290 

Cumulative distribution function, 63, 64, 102 

Cumulative frequency plot, 205 

Cumulative standard normal distribution, 653, 654 

Cumulative sum control chart, 632, 634, 635 

Cyclic data, 210 


D 
Data collection, 5 
Data versus information, 6 
Decision interval for CUSUM, 635 
Defect concentration diagram, 598, 641 
Defects, 627 
Defining contrast for blocking in 2* factorial, 545 
Defining relation for 2%! fractional factorial 
design, 550 
Defining relation for 2? fractional factorial 
design, 556 
Degree of belief interpretation of probability, 28 
Degrees of freedom, 193 
Deming's 14 points, 642 
Dependent variable, see Response variable 
Design generator, 549, 556 
Design resolution, 555 
Designed experiment, 5, 6, 7, 11-1, 469, 506, 641 
Designed experiments in engineering design, 
470, 14-3 
Digidot plot, 210, 211 
Discrete probability distribution, 61: 
Binomial, 74, 87, 119, 120, 155, 310, 9-3, 9-4 
Geometric, 78 
Hypergeometric, 84, 87, 10-3 
Negative binomial, 78, 80 
Poisson, 89, 90, 119, 123, 129 
Uniform, 70 
Discrete random variable, 54, 59, 60, 62, 142, 
143, 5-1 
Discrete sample space, 19, 27 
Discrete uniform distribution, 70 
Discrete uniform random variable, 70 
Distribution-free statistical methods, see 
Nonparametric statistics 
Dot diagram, 3, 8, 197 


E 

Empirical model, 11, 12, 373, 374; also see 
Regression model 

Engineering method, 2 

Enumerative study, 5 

Equal variance assumption in pooled t-test, 10-1 

Erlang distribution, 130 

Erlang random variable, 129, 131 

a-Error, 281, 285 

b-Error, 282, 285, 293, 309, 312 

Error sum of squares, 379, 421, 429, 449, 474, 475, 
488, 493; also see Residual sum of squares 

Estimated standard error, 225, 384 

Estimating interaction in 2* factorial, 525, 530, 
531, 532 


Estimating main effects in 2* factorial, 525, 530, 531 

Estimating variance components, 489 

Estimation of parameters, 220; also see Point 
estimation of parameters, Parameter estimation 

Events, 22 

EWMA control chart, 647 

Exhaustive, 44, 52 

Expectation function, 12-6 

Expected mean squares, 474, 475, 488, 494, 13-6, 
13-16, 513, 14-4, 14-7 

Expected value of continuous random variable, 105 

Expected value of discrete random variable, 66 

Expected value of function of continuous random 
variable, 106 

Expected value of function of discrete random 
variable, 69 

Expected value (mean) of linear combination of 
random variables, 181 

Experimental design, see Designed experiments 

Experimental unit, 492 

Exponential distribution, 123 

Exponential probability plot, 6-2 

Exponential random variable, 123, 125 

Extra sum of squares method, 433, 12-1 

Extrapolation and regression, 11-1, 440 


F 

2? Factorial design, 524 

2? Factorial design, 530 

2* Factorial design, 1-3, 523, 529 

Factorial experiment or design, 1-1, 1-3, 7, 506, 
508, 510 

Factors for constructing tolerance intervals, 
674, 675 

Factors for constructing variables control charts, 673 

Failure mechanism, 213 

False alarm on control chart, 630 

F-distribution, 355, 356, 357, 10-1 

F-distribution percentage points, table of, 657, 
658, 659, 60, 661 

First (or lower) quartile, 200 

First-order model, 14-12, 14-13 

Fisher's LSD procedure, 479, 497, 516 

Fisher-Irwin test, 10-3, 10-4 

Fixed factor effects, 473, 511 

Fixed-effects model ANOVA, 473 

Forward selection of variables in regression, 458 

2*?! Fractional factorial design, 549, 552 

Fractional factorial design, 556 

2'*? Fractional factorial design, 555, 559 

Fractional factorial experiment, 1-3, 549 

Frequency distribution, 203 

Full model, 434, 12-1 

Functions of random variables, 5-1 


2/22 


G 

Gamma distribution, 131, 132 

Gamma function, 131, 7-1 

Gamma random variable, 131, 132 

Gauss-Newton method, 12-8 

Gaussian distribution, 109; a/so see Normal 
distribution 

General method for deriving confidence 
interval, 253 

General procedure for hypothesis testing, 287, 298 

General regression significance test, see Extra 
sum of squares method 

Generalized interaction, 547 

Generator of fractional factorial, see Design 
generator 

Geometric distribution, 78 

Geometric or 6 1 notation for factorials, 524 

Geometric random variable, 79 

Goodness-of-fit measure, 6-1 


Goodness-of-fit test, 315 
Graphical comparison of means following 
ANOVA, 13-1 


H 
Hat matrix, 443 
Histogram, 203, 204, 598, 619 
Hypergeometric distribution, 84, 87, 10-3 
Hypergeometric random variable, 86, 87 
Hypothesis, 7, 277, 278, 286 
Hypothesis testing, 7, 221, 278, 280, 287, 
280. 307, 310, 315, 320, 9-3, 329, 330, 
331, 337, 338, 339, 341, 342, 357, 358, 
359, 361, 362, 364, 10-3, 384, 387, 402, 
428, 433, 572, 581, 585; also see Analysis 
of variance 
Hypothesis testing on correlation coefficient, 402 
Hypothesis testing on difference in means of two 
normal distributions: 
Variance known, 329, 330, 331 
Variance unknown, 337, 338, 339, 341, 342 
Hypothesis testing on mean of normal distribution: 
Variance known, 289, 290, 291 
Variance unknown, 300, 301, 303 
Hypothesis testing on population proportion: 
Large-sample test, 310 
Small-sample test, 9-3 
Hypothesis testing on ratio of variances of two 
normal distributions, 357, 358 
Hypothesis testing on regression coefficients in 
multiple linear regression: 
Extra sum of squares method, 433 
Significance of regression, 428 
Tests on individual regression coefficients, 432 
Hypothesis testing on regression coefficients in 
simple linear regression, 384: 
Analysis of variance approach 387 
Significance of regression, 385 
t-Tests, 384 
Hypothesis testing on variance of normal 
distribution, 307 
Hypothesis testing on two population proportions: 
Large-sample test, 361, 362 
Small-sample test, 364, 10-3 


I 

Identity element, 533 

In-control process, 598 

Independence, 46, 47, 148, 170 

Independent events, 47, 48 

Independent random variables, 147, 148, 153, 164, 
170, 182, 5-5, 197 

Independent variable, see Regressor variable 

Indicator variables, 450 

Individuals control charts, 615 

Influential observations in regression, 444, 445 

Interaction, 1-2, 412, 507, 508 

Interpretation of confidence interval, 250, 251 

Interquartile range, 201 

Intersection (set operation), 23 

Intrinsically linear model, 400 

Invariance property of maximum likelihood 
estimators, 235 


J 

Jacobian of transformation, 5-3, 5-4, 8-3, 10-2 

Joint probability density function, 157, 167 

Joint probability distribution, 142, 145, 151, 167, 
169, 172, 7-3 

Joint probability mass function, 143, 151 


K 
Kruskal-Wallis test, 589, 590 


L 

Lack of memory property of exponential random 
variable, 125, 126, 140 

Lack of memory property of geometric random 
variable, 79, 80 

Lack-of-fit sum of squares, 11-2 

Large-sample confidence interval, 255 

Large-sample confidence interval for mean, 254, 255 

Large-sample hypothesis test on mean of normal 
distribution, 297 

Least significance difference method, 479, 
480, 497 

Least squares estimator, see Method of least squares 

Least squares normal equations, 376, 409, 414, 417 

Leverage in regression, 447 

Likelihood function, 230 

Likelihood ratio principle, 9-1 

Likelihood ratio test statistic, 9-1 

Linear combinations of random variables, 180, 181 

Linear model, 374, 413, 417, 447, 12-5, 472, 492 

Linear statistical model, see Linear model 

Location, 3 

Logistic regression model, 11-6 

Logit response function, 11-8 

Lognormal distribution, 136 

Lognormal random variable, 136 

Lower confidence limit, 250 

Lower control limit, 10, 599 


M 

Main effect, 507 

Marginal probability density function, 159, 168 

Marginal probability distribution, 144, 159, 168, 
179, 7-3 

Marginal probability mass function, 145, 152 

Matrix of scatter plots, 416, 456 

Maximum likelihood estimation, 230, 231, 233, 
7-5, 9-1, 401, 11-8 

Mean, 66, 105, 191 

Mean of continuous random variable, 105 

Mean of discrete random variable, 66 

Mean of population, 191 

Mean square error of estimator, 226 

Mean squares, 387, 474, 475 

Mechanistic model, 11 

Median, 200, 205, 206, 572 

Method of least squares, 376, 409, 414, 417 

Method of maximum likelihood, 6-1; a/so see 
Maximum likelihood estimation 

Method of moments, 229 

Method of steepest ascent, 14-13 

Minimum variance unbiased estimator, 224, 234 

Mixed model, 518, 14-6 

Mode, 200, 205, 206 

Model adequacy checking, 395, 441, 481, 498, 
517, 527, 536, 554 

Model building in regression, see Variable 
selection in regression models 

Moment estimator, 229 

Moment generating function, 5-8, 5-9, 5-11 

Moments about origin, 5-8 

Moments of random variable, 5-8; also see 
Population moment 

Moving range, 616 

Moving range control chart, 616 

Multicollinearity, 460 

Multinomial distribution, 75 

Multinomial probability distribution, 154, 155 

Multiple comparisons following ANOVA, 479, 
497, 13-1, 13-2, 13-4, 516 

Multiple linear regression model, 411 

Multiplication rule for probabilities, 42, 43, 2-1 

Multivariate data, 206 

Mutually exclusive events, 24, 34, 35, 52 


N 
Natural tolerance limits of process, 621 

Negative binomial distribution, 78, 80 

Negative binomial random variable, 81, 82 
Noncentral F distribution, 484 

Noncentral ¢ distribution, 304 

Nonlinear regression model, 12-5, 12-7 
Nonparametric statistics, 275, 572, 581, 585, 589 
Nonparametric tolerance interval, 275 

Normal (large sample) approximation in sign 

test, 576 

Normal (large sample) approximation in Wilcoxon 
rank-sum test, 587 

Normal (large sample) approximation in Wilcoxon 
signed-rank test, 583 

Normal approximation to binomial, 120, 310, 9-3 
Normal approximation to Poisson, 121 

Normal distribution, 109, 110, 119, 120, 121, 

204, 240 

Normal probability plot, 213, 214, 215, 6-1, 302, 
395, 442, 481, 537 

Normal probability plot of effects, 537, 539, 

540, 554 

Normal probability plot of residuals, 395, 442, 481 
Normal random variable, 110 

NP-chart, 626 

Nuisance factor, 492 

Null hypothesis, 279 





O 

Objective view of probabilities, 7-3 

Observational study, 5, 6 

Odds ratio, 11-8 

One observation per cell in two-factor 
factorial, 517 

One-factor-at-a-time experiment, 508 

One-half fraction, 1-3, 545 

One-quarter fraction, 556 

One-sample confidence intervals, 247, 248, 250, 
253, 254, 256, 258, 259, 263, 266 

One-sample hypothesis tests, 7, 278, 289, 297, 300, 
301, 305, 307, 310, 312, 315, 320 

One-sample t-test, 301 

One-sample z-test, 290 

One-sided alternative hypothesis, 279, 286, 291 

One-sided confidence bounds, 253, 259, 264, 267 

One-sided confidence bounds, 335 

Operating characteristic curves, 295, 304, 309, 
331, 344, 359, 482, 13-7 

Operating characteristic curves for chi-square test, 
666, 667, 668 

Operating characteristic curves for F-test, 669, 670 

Operating characteristic curves for t-test, 664, 665 

Operating characteristic curves for z-test, 662, 663 

Optimization experiment, 14-2, 14-11 

Order statistics, 245 

Ordered stem-and-leaf display, 200 

Orthogonal contrasts, 13-2 

Orthogonal design, 533 

Outlier, 207, 396 

Out-of-control process, 598 

Overcontrol, 8 

Overfitting in regression models, 432 


P 

Paired samples, 349, 350, 576 

Paired t-test, 349, 350, 491 

Paired versus unpaired comparison of means, 351 

Parameter estimation, 221, 222, 229, 230, 231, 
234, 7-3, 376 

Parameter estimation in nonlinear regression 
model, 12-7 

Parametric statistical methods, 572 

Pareto chart, 206, 207, 598, 639 
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Partial F-test, 434, 435 

Partial regression coefficients, 411 

Patterns on control charts, 599, 604, 605 

P-chart, 625, 626 

Percentile, 201 

Permutations, 2-1 

Pivotal quantity for constructing confidence 
interval, 253 

Point estimate, 221 

Point estimation of parameters, 220, 221, 229, 
230, 231, 234, 7-3 

Point estimator, 221 

Poisson distribution, 89, 90, 119, 123, 129 

Poisson process, 90, 123, 129, 130 

Poisson random variable, 93 

Polynomial regression models, 412, 447 

Pooled estimator of variance, 338 

Pooled t-test, 339, 10-1 

Population, 190, 195, 221 

Population moment, 229 

Population standard deviation, 193 

Posterior distribution, 7-3 

Power of statistical test, 285 

Precision of estimation, 225, 252 

Prediction interval, 247, 249, 268, 393, 439 

Prediction interval in regression, 393, 439 

Prediction interval on future observation from 
normal distribution, 269 

Prediction of new observations in regression, 
392, 439 

Predictor variable, 375; also see Regressor 
variable 

Principal block, 545 

Principal fraction, 551 

Prior distribution, 7-3 

Probabilistic linear model, 374; also see 
Linear model 

Probability, 14, 27, 28, 29, 37, 42, 99 

Probability density function, 98, 99 

Probability distribution, 59, 61, 66, 70, 74, 78, 80, 
84, 90, 97, 98, 142 

Probability mass function, 61, 62, 90, 143, 145 

Probability model, 14, 212 

Probability plots, 212, 213, 6-1, 6-2, 315 

Process capability, 619, 621, 622 

Process capability ratio, 621, 622, 623 

Process capability study, 601 

Process characterization experiment, 14-1 

Projection of fractional factorials, 554 

Properties of estimators, 221, 225, 226, 245 

Properties of least squares estimators, 383 

Properties of maximum likelihood estimator, 
234, 235 

Pure error sum of squares, 11-2 

P-value, 292, 299, 303 


Q 


Quality improvement, 596 
Quartiles, 200 


R 
R chart, 609 
R?, 397, 398, 431 
Random, 17 
Random effect or factor, 518, 522 
Random effects model ANOVA, 473, 487, 488, 
518, 14-3, 14-4, 14-5 
Random experiments, 17, 18 
Random order of trials in experiment, 511 
Random sample, 39, 40, 196, 225 
Random variable, 3, 53: 
Continuous, 54, 96 
Discrete, 54, 70 
Randomization, 469, 471 
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Randomized complete block design, 491, 13-16 

Range, 194, 607 

Rank transformation, 591 

Ranks, 572, 581, 586, 589, 591 

Rational subgroups and control charts, 602 

Reduced model, 434, 12-1 

Reference distribution, 301, 311 

Reference value for CUSUM, 635 

Regression analysis, 373 

Regression and causality, 11-1 

Regression coefficients, 374 

Regression model, 13, 374, 411, 527 

Regression model summary of 2* factorial, 527 

Regression sum of squares, 387, 429 

Regressor variable, 375, 412, 450 

Rejection region for statistical test see Critical region 

Relative efficiency in estimation, 227 

Relative frequency, 28 

Relative range, 607 

Replicates, 1-1, 471, 510 

Residual analysis, 395, 396, 11-1, 441, 442, 481, 
498, 527, 536, 554 

Residual plots, 481, 482, 498, 499, 517 

Residual sum of squares, 11-2, 421 

Residuals, 377, 395, 418, 441, 481, 517, 536 

Resolution III design, 555, 562 

Resolution IV design, 555, 556 

Resolution V design, 555 

Response surface, 14-12 

Response surface methodology, 14-11 

Response variable, 375, 412 

Restricted model, 14-7 

Retrospective study, 5 

Ridge regression, 461, 12-2 

Ridge trace, 12-2 

Rotatable design 14-19 

Runs rules, 607 


S 
S control chart, 610, 611 
Sample, 20, 39, 40, 190, 196 
Sample correlation coefficient, 402 
Sample mean, 190, 197 
Sample moment, 229 
Sample range, 194, 607 
Sample size formulas for confidence intervals, 
252, 335 

Sample size formulas for tests on proportion, 313 
Sample size formulas for tests on means, 294, 332 
Sample space, 18: 

Continuous, 19 

Discrete, 19, 27 
Sample standard deviation, 191, 197 
Sample variance, 191, 192, 197 
Sampling distribution, 221, 238, 289, 239 
Sampling distribution of the mean, 239 
Sampling with replacement, 20 
Sampling without replacement, 20 
Saturated fractional factorial design, 563 
Scatter diagram, 373, 598 
Scientific method, 2 
Screening experiments, 470, 545 
Second-order model, 14-10, 14-12 
Sensitivity of a statistical test, 285 
Sequential experimentation, 470, 551, 14-13 
Set operations, 23 
Shewhart control charts, 600, 632 
Sign test, 572, 575, 578, 579, 593 
Sign test for paired samples, 576 
Sign test, table of critical values, 671 


Significance level of statistical test, 281, 292 

Significance of regression, 385, 387, 388, 428 

Simple linear regression model, 374 

Single replicate of factorial design, 537 

Single-factor experiment, 469, 470 

Six-sigma process, 623, 624 

Six-sigma quality, 140 

Sources of variability, 3 

Sparsity of effects principle, 537 

SPC, 11: also see Statistical process control 

Special cause of variation, 598 

Stability, 4 

Standard deviation of random variable, 66, 105 

Standard error of point estimator, 225, 265 

Standard error of regression coefficients, 384 

Standard normal distribution, 113, 114, 240 

Standard normal random variable, 111, 113, 114 

Standardized control chart, 647 

Standardized residuals, 396, 442 

Standardizing normal random variable, 113, 114 

Statistic, 197, 221 

Statistical inference, 4, 221, 238, 327; also see 
Hypothesis testing, Parameter estimation 

Statistical process control, 11, 597; also see SPC 

Statistical quality control, 596, 597 

Statistical thinking, 3 

Statistical versus practical significance, 298, 299 

Statistics (the field), 2 

Stem-and-leaf diagram, 197, 198, 200, 204, 210 

Stepwise regression, 457 

Strong versus weak conclusions in hypothesis 
testing, 285 

Studentized residual, 443 

Subjective probability, 28 

Subjective view of probabilities, 7-3 


T 

t-Distribution percentage points, table of, 656 

t Distribution, 258, 8-2, 301 

Tabular CUSUM procedure, 634, 636 

Tampering with a process, 8 

Taylor series, 12-8 

Test for homogeneity in contingency table, 322 

Test for independence in contingency table, 320 

Test for lack of fit in regression, 11-1 

Test statistic, 279, 289, 301, 307, 311, 316, 321, 
322, 9-1 

Testing for curvature, see Center points in 2* 
factorial design 

Testing for trends, 593 

Third (or upper) quartile, 200 

Three-factor factorial experiment, 520 

Three-factor interaction, 532 

Three-sigma control limits, 603, 630 

Ties in Kruskal-Wallis test, 589 

Ties in sign test, 575 

Ties in Wilcoxon signed-rank test, 583 

Time series plot, 8, 209 

Time series, 8, 209 

Tolerance chart, 619 

Tolerance interval, 247, 248, 270, 275 

Tolerance intervals for normal distribution, 270 

Total probability rule, 43, 44, 52 

Total sum of squares, 380, 473 

Transformations, 217, 400, 11-4 

Treatment effect, 472 

Treatment sum of squares, 474, 488, 493 

Treatments, 472, 492, 493 

Tree diagram, 21, 22, 45 

Trend in data, 210 


Trimmed mean, 219 

Tukey's test, 13-4 

Two-factor factorial experiment, 510, 514, 517 

Two-factor interaction, 507 

Two-factor interaction plots, 508 

Two-factor mixed model, 14-6 

Two-factor random effects model, 14-4, 14-5 

Two-sample confidence intervals, 334, 335, 345, 
346, 352, 359, 365 

Two-sample hypothesis tests, 7, 327, 329, 337, 
341, 349, 357, 361, 364, 10-3 

Two-sided alternative hypothesis, 279, 286 

Type I error, 280 

Type II error for sign test, 578 

Type II error, 282, 293, 312 


H 
U-chart, 627, 628 
Unbalanced experiment, 479 
Unbiased estimator, 222, 223, 224, 379, 383, 421 
Uniform distribution: 
Discrete, 70 
Continuous, 107 
Union (set operation), 23 
Uniqueness property of moment generating 
functions, 5-11 
Unreplicated factorial design, 537 
Upper confidence limit, 250 
Upper control limit, 10, 599 


V 

Variability, 3, 191 

Variable selection in regression models, 432, 452, 
453, 457, 458, 459 

Variables control charts, 601 

Variance components, see components of variance 

Variance inflation factors, 460 

Variance of linear combination of random 
variables, 181 

Variance of point estimator, 223, 226 

Variance of population, 193 

Variance of random variable, 66, 105 

Venn diagram, 24 

Verifying assumptions, 213, 302; also see Model 
adequacy checking 

V-mask procedure for CUSUM, 633 


W 

Warning limits on control charts, 607 

Weibull distribution, 134 

Weibull probability plot, 6-2 

Weibull random variable, 134 

Weighted least squares, 409 

Western Electric rules for control charts, 606 

Wilcoxon rank-sum test, 585, 587, 588 

Wilcoxon rank-sum test, table of critical 
values, 672 

Wilcoxon signed-rank test, 579, 581, 583, 584 

Wilcoxon signed-rank test, table of critical 
values, 671 


X 
X control chart, 607, 608 


Z 
z-value, 114 


Problem Solutions 


The purpose of this Student Solutions Manual is to provide you with additional help in under- 
standing the problem-solving processes presented in Applied Statistics and Probability for 
Engineers. The Applied Statistics text includes a section entitled "Answers to Selected 
Exercises," which contains the nal answers to most odd-numbered exercises 1n the book. 
Within the text, problems with an answer available are indicated by the exercise number 
enclosed in a box. 

This Student Solutions Manual provides complete worked-out solutions to a subset of the 
problems included in the “Answers to Selected Exercises." If you are having dif culty reach- 
ing the nal answer provided in the text, the complete solution will help you determine the 
correct way to solve the problem. 

Those problems with a complete solution available are indicated in the “Answers to 
Selected Exercises," again by a box around the exercise number. The complete solutions to 
this subset of problems may also be accessed by going directly to this Student Solutions 


Manual. 


Chapter 2 Selected Problem Solutions 


Section 2-2 


2-43. 


3 digits between 0 and 9, so the probability of any three numbers is 1/(10*10*10); 
3 letters A to Z, so the probability of any three numbers is 1/(26*26*26); The probability your license plate 
is chosen is then (1/10°)*(1/26*) = 5.7 x 10° 


Section 2-3 


2-49. 


a) P(A = 1- P(A) = 0.7 
b)P(AUB)-P(A)* P(B) - P(CACB ) = 0.3+0.2 - 0.1 = 0.4 

c) P( A! AB )+P(AOB )= P(B). Therefore, P(A’ OB ) = 0.2 - 0.1 = 0.1 
d) P(A) = P(AQB )+P(AMB’ ). Therefore, P A OB’ ) = 0.3 - 0.1 = 0.2 
e) P(A UB )')=1-P(AUB )=1-0.4=0.6 

f) P(A’ UB ) = P(A‘) + P(B) - P(A’ OB ) = 0.7 + 0.2 - 0.1 = 0.8 


Section 2-4 


2-61. 


Need data from example 
a) P(A) = 0.05 + 0.10 =0.15 
P(ANB) _ 0.04+0.07 


b) P(AIB) = - = 0.153 
P(B) 0.72 

c) P(B) = 0.72 

à PB - (405 0044007 _ 9 493 


P(B) 0.15 
e) P(A ^ B) = 0.04 40.07 = 0.11 
f) P(A UB) 2 0.15 + 0.72 - 0.11 = 0.76 


2-67. a) P(gas leak) = (55 + 32)/107 = 0.813 
b) P(electric failure|gas leak) = (55/107)/(87/102) = 0.632 
c) P(gas leak| electric failure) = (55/107)/(72/107) = 0.764 
Section 2-5 
2-73. Let F denote the event that a roll contains a flaw. 
Let C denote the event that a roll is cotton. 
P(F) = P(F|C)P(C) + P(F|C^P(C) 
= (0.02)(0.70) + (0.03)(0.30) = 0.023 
2-79. Let A denote a event that the first part selected has excessive shrinkage. 


Let B denote the event that the second part selected has excessive shrinkage. 
a) P(B)- P(B|A )P(A) + P(B|A ')P(A') 


= (4/24)(5/25) + (5/24)(20/25) = 0.20 
b) Let C denote the event that the second part selected has excessive shrinkage. 


P(C) = P(C|A ^ BJP(A ^B) « P(C|A ^ B')P(A ^B") 
«P(C[A'^B)P(A'CB) + P(C|A'CB"P(A'^B") 


2218) 428) 42202) 


= 0.20 


Section 2-6 


2-87. It is useful to work one of these exercises with care to illustrate the laws of probability. Let H; denote the 
event that the ith sample contains high levels of contamination. 


a) P(H4 n H2 ^ Ha n Ha n Hs) = P(H1)P(H2 )P(H3 )P(Ha )P(Hs ) 
by independence. Also, P(H; ) = 09 . Therefore, the answer is 0.9? = 0.59 
b) Aq =(H, ^ H2 ^ Hs n H4 0 H5) 
A2 = (H1 HS NH3 NH4 Hs) 
A3 = (Hi^ H2 ^ H4 NH4 n Hs) 
A4 = (H40 H2 ^ Hs ^ H4 0 H5) 
As - (Huh H2 ^ H3 ^ Ha 0 Ha) 
The requested probability is the probability of the union Ay A5 UA3 A4 v As and these events 


are mutually exclusive. Also, by independence P(A;) = 0.9^ (0.1) 2 0.0656. Therefore, the answer is 


5(0.0656) = 0.328. 
c) Let B denote the event that no sample contains high levels of contamination. The requested 


probability is P(B') = 1 - P(B). From part (a), P(B') = 1 - 0.59 = 0.41. 


2-89. Let A denote the event that a sample is produced in cavity one of the mold. 
1.5 
a) By independence, P(A, A A» A A4 ANA, n Ag)-(—y = 0.00003 
) By p ( 1 2 3 4 5) (3! 
b) Let B; be the event that all five samples are produced in cavity 1. Because the B's are mutually 
exclusive, P(By 0 B2 Uu... Bg) = P(B4) + P(B2 )+...4+P(Be ) 
From part a., P(Bi) = =) . Therefore, the answer is 8) = 0.00024 


c) By independence, P(A, NMA, QA 3 N Ago As) E c $) . The number of sequences in 
T 


which four out of five samples are from cavity one is 5. Therefore, the answer is 5^ ( 8 ) = 0.00107 . 


Section 2-7 


2-97. Let G denote a product that received a good review. Let H, M, and P denote products that were high, 
moderate, and poor performers, respectively. 
a) 
P(G) = P(G|H)P(H) + P(G|M)P(M) + P(G|P)P(P) 
= 0.95(0.40) + 0.60(0.35) + 0.10(0. 25) 


= 0.615 
b) Using the result from part a., 
P(G|H)P(H)  0.95(0.40) 





P(H|G) = = 0.618 
P(G) 0.615 
P(G'|H)P(H 
c) P(H|G') = AO Drd = 0.05(0.40) = 0.052 
P(G') ] — 0.615 
Supplemental 


2-105. a) No, P(E; MN Ex MN E) z0 
b) No, E;' NE,’ is not Ø 
c) P(E)’ U Ey’ U Ej?) = P(E’) + PE) + PE) - P(E,/ Ej) - P(Ej/^ Ey’) - PEA Ej) 
+ P(E)’ ^ Ey’ A E3) 
= 40/240 


e) P(E, U E3) = P(Ej) + P(E3) - P(E, A E3) = 234/240 
f) P(E, U E U E) = 1 — P(E)’ A Ey’ A E;3’) = 1-0=1 


2-107. Let A; denote the event that the ith bolt selected is not torqued to the proper limit. 
a) Then, 


P(Ay A A2 ANA3 AQA4)= P(A4|A4 Ao n A3)P(A40 A9 n A3) 
= P(A4|A1 ^ A2 ^ A3 )P(A3|A1 ^ A2)P(A2|A4)P(A4) 


(STI IS] em 


b) Let B denote the event that at least one of the selected bolts are not properly torqued. Thus, B' is the 
event that all bolts are properly torqued. Then, 


-1-r- (25 4212) orn 


2-113. D = defective copy 


V REBRGSREIGEEE" 
(sd Iss Hes) nm 


2-117. Let Aj denote the event that the ith washer selected is thicker than target. 


>| shi s Jm 


b) 30/48 — 0.625 


c) The requested probability can be written in terms of whether or not the first and second washer selected 
are thicker than the target. That is, 


P(Aa) - P(A4A2A30rA4A2A 30TA4A 2A 30TA4A 2A a ) 
- P(Aa |A1A2)P(A1A2) + P(A3 |A1A2)P(A1A2) 
+P(A3|A1'A2)P(A1A2)  P(A ATA 2)P(A1A2) 
= P(A3|A1A2 P(A2|A, OPUS) + PA /A&A22PA2 A DPA () 
sP(Aa A A)P(A2 APA) + PA [AT A2)P(A2 ATDPQS1) 
_ 28 (30 29) , 29 (20 30) 29/20 30) , 30(20 19 

48 (50 49 48 \ 50 49 48 50 49 48 50 49 
= 0.60 
2-121. Let Aj denote the event that the ith row operates. Then, 
P(A,) = 0.98, P(A;) = (0.99)(0.99) = 0.9801, P(A3) = 0.9801, P(A4) = 0.98. 


The probability the circuit does not operate is 


P(A )P(A; P( A, )P( A4) = (0.02)(0.0199)(0.0199)(0.02) = 1.58 x 1077 


Chapter 3 Selected Problem Solutions 


Section 3-2 


3-13. 


fy (O) = P(X =0) =1/6+1/6=1/3 
f; 0.5) = P(X =1.5) z 1/3 


fx (2) 21/6 
fx (3) 21/6 
3-21. | P(X20)2002 28x10? 
P(X = 1) = 3[0.98(0.02)(0.02)]20.0012 
P(X = 2) = 3[0.98(0.98)(0.02)]=0.0576 
P(X = 3) = 0.98? = 0.9412 
3-25. X = number of components that meet specifications 
P(X=0) = (0.05)(0.02)(0.01) = 0.00001 
P(X=1) = (0.95)(0.02)(0.01) + (0.05)(0.98)(0.01)+(0.05)(0.02)(0.99) = 0.00167 
P(X=2) = (0.95)(0.98)(0.01) + (0.95)(0.02)(0.99) + (0.05)(0.98)(0.99) = 0.07663 
P(X=3) = (0.95)(0.98)(0.99) = 0.92169 
Section 3-3 
3-27. 
0, X < —2 
1/8 -2<x<-1 
F(%) = 3/8 -1<x<0 
5/8 0<x<1 
7/18 1<x<2 
1 2<X 
a) P(X < 1.25) = 7/8 
b) P(X <2.2)=1 
c) P(-1.1 < X € 1) 2 7/8 — 1/8 = 3/4 
d) P(X > 0)=1-P(K < 0) = 1 — 5/8 = 3/8 
3-31. 
0 x«0 ; 
0.008 O<x<l f(0) 2 0.2? 2 0.008, 
F(x) =40.104 1€ x«2, where f (I) = 3(0.2)(0.2)(0.8) = 0.096, 
0.4488 2<x<3 f (2) = 30.2)(0.8)(0.8) = 0.384, 
1 3< x f (3) = (0.8)° = 0.512, 
3-33. a) P(X <3)=1 


b) P(X <2)=0.5 
c) P(1 < X <2) 2 P(X-1) 20.5 
d) P(X>2) = 1 - P(X<2) = 0.5 


Section 3-4 


3-37 Mean and Variance 
LH —- E(X) 0f (0) - 1/() - Z£ 2) -3f() - 4f (4) 
= 0(0.2) + 1(0.2) + 2(0.2) + 3(0.2) + 4(0.2) = 2 
V(X) =0° FOFO? FDE FOE fA-—wW 
= 0(0.2) + 1(0.2) + 4(0.2) + 9(0.2) + 16(0.2) - 2? =2 
3-41. Mean and variance for exercise 3-19 
u= E(X) =10f00)+5f(5)+1f@) 
= 10(0.3) + 5(0.6) + (0. 1) 
— 6.1 million 
V(X) =10° (10) -5* f(5) +P f(0- u^ 
= 10° (0.3) + 5^ (0.6) - 17 (0.1) —6.1° 
= 7.89 million? 


3-45. Determine x where range is [0,1,2,3,x] and mean is 6. 
LU — E(X) Z6z0f(0) -1/(D c 2f 2) - 3f (3) + xf (x) 
6 = 0(0.2) + 1(0.2) + 2(0.2) + 3(0.2) + x(0.2) 
6 =1.2+0.2x 
4.8 2 0.2x 
x24 


Section 3-5 


3-47. E(X) = (341)2 = 2, V(X) = [(38-141Y -1]/12 = 0.667 


3-49. — X-«(1/100)Y, Y = 15, 16, 17, 18, 19. 





] (15419 
E(X) = (1/100) E(Y) = —— = 0.17 
(X) = ( ) E(Y) Al 5 | mm 
2 ?) 
Vy rnt iat Mich RN M i 
100 12 


Section 3-6 


IO) ., , 
3-537. a)P(X =5)= 5 .5 (0.5) = 0.2461 


10 10 10 
b) P(X <2)= 0 pss" 1 i psos 1 ; psos 


= 0.5" +10(0.5)"" +. 45(0.5)'" = 0.0547 


3-61. 


3-63. 


3-67. 


= 10 9 1 10 10 0 
o P(X 29)= 9 (0.5) + 10 .5 (0.5) = 0.0107 


10 3 7 10 4 6 
à PBSX <5)=| p.505 +) .5°0.5 


= 120(0.5)'° + 210(0.5)'° = 0.3223 


n=3 and p=0.25 


0 x<0 ; 
0.4219 O<x<l r= E 27 
FGel08438 12422 wie "e 64 


1 9 
0.9844 2<x<3 2) =3 — |> l= 
1 a £0 " 64 
<x S 

1 1 

ym e | =— 

£o E 1 

1000 i xs 
a) P(X =1)= i .001 (0.999) = 0.3681 


1000 1 999 
b)P(X 21) Z1- P(X =0)=1- ; .001'(0.999)"" = 0.6323 


1000 1000 
c)P(X <2) -( à poortosssy" 1 i Joor os" + (1° )o.00170.999" 


= 0.9198 
d)E(X ) =1000(0.001) =1 
V(X) =1000(0.001)(0.999) = 0.999 


Let X denote the passengers with tickets that do not show up for the flight. Then, X is binomial 


with n = 125 and p = 0.1. 
a)P(X >5)=1—P(X <4) 


125 " ps (125 1 p, (125 2 123 
-1-| H b (0.9) 1 : b (0.9) 1 j p (0.9) 
4 p p 1° (0.9)” + " p 6 0.9)" = 0.9961 


b)P(X >5)=1-— P(X <5) = 0.9886 


3-69. 


Let X denote the number of questions answered correctly. Then, X is binomial with n = 25 
and p = 0.25. 


25 25 25 
P(X >20) = 25? (0.75) 25?! (0.75% 25? (0.75y 
a) ) a) (0.75) Wu (0.75) e) (0.75) 
2 (9 


3 “| 95 
25 25 » 
b)P(X «5) 1 ; pos ?(0.75)^ " m ! (0.75 s , pos (0.75) 
25 
T poro 75) + ( pes (0.75) 20.2137 


Section 3-7 


3-71. 


3-75. 


3-77 


3-81. 


a. P(X 1) 2 (1-0.5)?0.52 0.5 

. P(X 24) =(1-0.5)°0.5 2 0.5* = 0.0625 

P(X 28) = (1—0.5)/0.5 = 0.5* = 0.0039 

d. P(X <2)= P(X 2D - P(X 22) 2 (0 20.5)?0.5 4 1 —0.5)'0.5 


-0.540.5? 20.75 
e. P(X »2) =1— P(X €2) 21-0.75 = 0.25 


ox 


p 


Let X denote the number of calls needed to obtain a connection. Then, X is a geometric random variable 
with p = 0.02 


a) P(X 210) = (1— 0.02) 0.02 = 0.98*0.02 = 0.0167 

b) P(X »5)21- P(X <4) =1-[P(X 2D + P(X 22) P(X 23) 4 P(X =4)] 
= 1—[0.02 + 0.98(0.02) + 0.987 (0.02) + 0.98° (0.02)] 
= 1 — 0.0776 = 0.9224 

c) E(X) = 1/0.02 = 50 

p=0.005 ,r=8 

a. P(X 28) =0.0005° = 3.91x10 * 


| 
b. u = E(X) = —— = 200 a 
He = 005 i 


c Mean number of days until all 8 computers fail. Now we use p=3.91x10 ? 


l 
EE) = — —ÀÁ— 2.56x10"° days or 7.01 x10" years 


3.91x10 ^! 
a) E(X) = 4/0.2 = 20 


19 16 4 

b) P(X=20) = 3 0.80) 0.2. = 0.0436 
18 15 4 

c) P(X=19) = : 0.80) 0.2" = 0.0459 


20 17 4 
d) P(X=21) = A 0.80) 0.2 = 0.0411 


e) The most likely value for X should be near Ux. By trying several cases, the most likely value is x = 19. 


3-83. Let X denote the number of fills needed to detect three underweight packages. Then X is a negative 
binomial random variable with p = 0.001 and r = 3. 
a) E(X) = 3/0.001 = 3000 
b) V(X) = [3(0.999)/0.0017] = 2997000. Therefore, 6x = 1731.18 


Section 3-8 


(i) __@xtoxisx14y/6_ 
(2) ~ (20x19x18x17)/24 


(2 | (QO0x19x18x1)/24 ` 
c) 


P(X <2)=P(X =0)+ P(X =1)+ P(X =2) 
AGN) EX), GKS) 
E Dn | i: R | i (2 | 
4 4 4 
16x15x14x13 n 4x16x15x14 £ 6x16x15 


= : ? 1 — (0.9866 


20x19x18x17 
24 


3-87. a P(X =l)= — 0.4623 


b P(X =4)= 








d) E(X) = 4(4/20) = 0.8 
V(X) = 4(0.2)(0.8)(16/19) = 0.539 


3-91. Let X denote the number of men who carry the marker on the male chromosome for an increased risk for high 
blood pressure. N=800, K=240 n=10 





a) n=10 
pope (m) 
P(X =1) =4 l = See = 0.1201 
10 101790! 
b) nz10 


P(X »1) 21- P(X <1) =1-[P(X 20) P(X =) 





po) Pe) 
P(X =0)= =X eet = 0.0276 
e" 109790! 


P(X >1) =1- P(X <1) =1-[0.0276 4 0.1201] = 0.8523 





Section 3-9 


e440 E 


3-97. a) P(X=0)= —-e ^ = 0.0183 





b) P(X €2) = P(X 20)  P(X 2 D +P(X — 2) 


_4 e 4! eA 
-e^4 4 
]! 


—4 44 
e 4 01954 
4! 


= 0.2381 








c) P(X24)- 





e 448 





d) P(X28)- 


3-99, P(X =0)= e^ -0905. Therefore, A = —In(0.05) = 2.996. 
Consequently, E(X) = V(X) = 2.996. 


3-101. a) Let X denote the number of flaws in one square meter of cloth. Then, X is a Poisson random variable 


gu (0. 1) 
| 


with A =0.1. P(X 22) = = 0.0045 


b) Let Y denote the number of flaws in 10 square meters of cloth. Then, Y is a Poisson random variable 
-141 


e | : 
with à — 1. Pec ee | = 0.3679 
c) Let W denote the number of flaws in 20 square meters of cloth. Then, W is a Poisson random variable 
witha =2. PW 20) 2 e” = 0.1353 
à P(Y > 2) 21- P(Y <1) 21- P(Y 20) - P(Y =1) 
sq 9 
= 0.2642 
3-105. a) Let X denote the number of flaws in 10 square feet of plastic panel. Then, X is a Poisson random 


variable with à 20.5. P(X 20) = e™ = 0.6065 


b) Let Y denote the number of cars with no flaws, 


10 10 0 -5 
P(Y =10) =|, 0-3935 )'° (0.6065 )° = 8.9x10 


c) Let W denote the number of cars with surface flaws. Because the number of flaws has a 
Poisson distribution, the occurrences of surface flaws in cars are independent events with 


constant probability. From part a., the probability a car contains surface flaws is 1—0.6065 = 
0.3935. Consequently, W is binomial with n = 10 and p = 0.3935. 


10 0 10 =D 
PW =0)=| , {0.6065)"(0.3935)" =8.9x10 


10 
PW =1) = j 0.6065)! (0.3935)* = 0.001372 


P(W <1) = 0.000089 + 0.001372 = 0.00146 


Supplemental Exercises 


3-107. Let X denote the number of totes in the sample that do not conform to purity requirements. Then, X has a 
hypergeometric distribution with N = 15, n = 3, and K = 2. 


ZY 
043 1312! 


P(X 2>1)=1- P(X =0)=1- ae = 0.3714 
15 10115! 





3-109. 


3-111. 


3-119. 


3-121. 


3-125. 


3-127. 


3-129. 


Let Y denote the number of calls needed to obtain an answer in less than 30 seconds. 
a) P(Y = 4) = (1 - 0.75)° 0.75 = 0.25°0.75 = 0.0117 
b) E(Y) = 1/p = 1/0.75 = 1.3333 


-5 £5 
a) Let X denote the number of messages sent in one hour. P(X =5) = EM =O0.1755 


b) Let Y denote the number of messages sent in 1.5 hours. Then, Y is a Poisson random variable with 


E pe TS” 
10! 


A-2175. P(Y =10) = = 0.0858 


c) Let W denote the number of messages sent in one-half hour. Then, W is a Poisson random variable with 


à =2.5. P(W < 2) = PW =0)+ P(W =1) = 0.2873 


Let X denote the number of products that fail during the warranty period. Assume the units are 
independent. Then, X is a binomial random variable with n = 500 and p = 0.02. 


500 0 500 
a) P(X = 0) = ó 0.02) (0.98) =4.1 x 10° 


b) E(X) = 500(0.02) = 10 
c) P(X >2) = 1 - P(X € 1) = 0.9995 


a) P(X <3)=0.2+0.4=0.6 

b) P(X > 2.5) 20.4 + 0.3 + 0.12 0.8 

c) P(2.7 «X < 5.1) = 0.4 + 0.3 2 0.7 

d) E(X) = 2(0.2) + 3(0.4) + 5(0.3) + 8(0.1) = 3.9 

e) V(X) = 27(0.2) + 37(0.4)  5(0.3) + 87(0.1) — (3.97 = 3.09 


Let X denote the number of orders placed in a week in a city of 800,000 people. Then X is a Poisson 

random variable with À = 0.25(8) = 2. 

a) P(X 2 3) = 1 - P(X < 2) = 1 - [e? + e°(2) + (e?2°)/2!] = 1 — 0.6767 = 0.3233. 

b) Let Y denote the number of orders in 2 weeks. Then, Y is a Poisson random variable with A = 4, and 
P(Y<2) = P(Y < 1) =e" + (e^4)/1! = 0.0916. 


Let X denote the number of totes in the sample that exceed the moisture content. Then X is a binomial 
random variable with n = 30. We are to determine p. 


30 
If P(X > 1) = 0.9, then P(X = 0) 2 0.1. Then A D) (0 — p)" = 0.1, giving 30In(1-p)-In(0.1), 


which results in p = 0.0738. 


a) Let X denote the number of flaws in 50 panels. Then, X is a Poisson random variable with 
à. = 50(0.02) = 1. P(X 20) 2e! = 0.3679. 

b) Let Y denote the number of flaws in one panel, then 
P(Y 2 1) = 1 — P(Y=0) = 1 — e?” = 0.0198. Let W denote the number of panels that need to be 
inspected before a flaw is found. Then W is a geometric random variable with p = 0.0198 and 
E(W) = 1/0.0198 = 50.51 panels. 


c) PŒ 21)=1- P(Y =0)=1-e™®” = 0.0198 


Let V denote the number of panels with 2 or more flaws. Then V is a binomial random 
variable with n=50 and p=0.0198 


50 0 50 50 1 49 
PV «2)-|  D.0198'C9802)" +| ` D.0198 (0.9802) 


50 2 48 
+|, D.0198'(0.9802)" = 0.9234 


Chapter 4 Selected Problem Solutions 


Section 4-2 


oo 


4l. — a) P(1< X) = [e?dx = (-e7) 


1 





i = e! = (0.3679 


2.3 


b) P(1 < X < 2.5) = Í e'"dx-(-e") =e'-e °° = 0.2858 
1 


2:9 
I 



































3 
c) P(X =3) = Je^dx -0 
3 
P 4 
d) P(X «4)- Je^dx =(-e*)| =1-e* 20.9817 
0 
e) PG < X) = Je^dx- (-e*)| = e™ = 0.0498 
3 
rx | deg 
4-3 a) P(X <4) = | de = — = = 0.4375, because f,(x)=0 for x <3. 
E: 16|, 16 
[X x 5^ —3.5^ 
b), P(X >3.5)= |zdx- — — 7. 77. 20.7969 because f(x) - 0 forx» 5. 
1.8 16]... 16 
[X m. 
c) P(4< X «5)- |-dx - — - = 0.5625 
18 16|, — 16 
1e x x» 4,5? —3? 
d P(X «45)- |-dx-2 —| 2 ———— = 0.7031 
18 16|. 16 
5 3.5 2p 2 2 2 2. us 
e) P(X » 45) P(X <3.5) = [dx f d= um = 2 422 : 2:05. 
48 — 318 — 16, 16, 16 16 








4-9 a) P(X < 2.25 or X > 2.75) = P(X < 2.25) + P(X > 2.75) because the two events are 
mutually exclusive. Then, P(X < 2.25) = 0 and 


2.8 
P(X >2.75)= | 2dx = 2(0.05) = 0.10. 
2.75 
b) If the probability density function is centered at 2.5 meters, then f, (x) — 2 for 
2.25 < x < 2.75 and all rods will meet specifications. 


Section 4-3 


4-11. a) P(X<2.8) = P(X < 2.8) because X is a continuous random variable. 
Then, P(X<2.8) =F(2.8)=0.2(2.8) = 0.56. 
b) P(X >1.5) ZI- P(X €1.5) 21—0.2(10.5) = 0.7 


o) P(X « 22) = F, (2) 20 
d) P(X > 6) -1- F,(6) 20 


X 


4.3. Now, fx (x) = e™ forü <x and F,(x)  Je"dxz-e"| =1-e 


0 


0,x <0 


for 0 € x. Then, F(x) = 
l-e*,x>0 


421. F(x)- [O.5xdx - 5|. —0.25x^ for0 <x <2. Then, 
0 
0, x«0 
F(x)24025x, O<x<2 


l, 2 Y 


Section 4-4 


NU 
= 2 = = 4.083 





5 
425. E(X)- | xzdx E 
3 





3 5 
V(X) = | (x - 4.083)? : x = |x? ždx — 4.083? 
3 3 


5 


— 4.083^ = 0.3291 


4 
X 





3 


t 600 120 
4.27. a) E(X) = |x —-dx = 6001n x|; = 109.39 
X 


100 
100 


120 
V(X)- | (x —109.39)? E 


100 X 100 








120 
2 
dx = 600 fi |. 2(109.39) ET 10959) dx 
X 


X 


= 600(x — 218.781n x — 109.39? x!) | ^ 233.19 


100 


b.) Average cost per part = $0.50*109.39 = $54.70 


Section 4-5 


4-33. a) f(x)= 2.0 for 49.75 < x < 50.25. 
E(X) = (50.25 + 49.75)/2 = 50.0, 


(50.25 — 49.75) 


V(X)= b 


= 0.0208, and o, = 0.144. 


b) F(x) = | 2.0dx for 49.75 < x < 50.25. Therefore, 


49.75 


0, x « 49.75 
F(x) 2 42x —99.5, 49.75 € x « 50.25 


l, 50.25<x 
c) P(X < 50.1) = F (50.1) = 2(50.1) - 99.5 = 0.7 


gab oee ea 
2 
Vou ate =) pads 
12 
2 1 2 2 
b) P(X «2)2 | ————— dx= [|0.7dx = 0.7x|, , = 0.7(.5) = 0.7143 
Das —1.5) J T 
c)F(X)- ceca = [0.7ax = 0.7]. for 1.5 «x « 22. Therefore, 
(2.21.5) -- | 
0, x«l.5 
F(x)240.7x—2.14, 1.5€ x «22 
ly, 229 
section 4-6 


4-4]  a)P(Z«1.28) -» 0.90 
b) P(Z < 0) — 0.5 
c) If P(Z > z) = 0.1, then P(Z < z) = 0.90 and z = 1.28 
d) If P(Z > z) = 0.9, then P(Z < z) = 0.10 and z =—1.28 
e) P(-1.224 < Z«z) - P(Z < z) - P(Z < -1.24) 
= P(Z < z) - 0.10749. 
Therefore, P(Z « z) = 0.8 + 0.10749 = 0.90749 and z = 1.33 


4-43. 


4-55. 


a) P(X < 13) = P(Z < 


(13-10)/2) 


- P(Z < 1.5) 


= 0.93319 


b) P(X >9)=1-P(X <9) 


=1-P(Z< 
=1-P(Z< 


(9-10)/2) 
-0.5) 


= | - [1- P(Z < 0.5)] 
- P(Z « 0.5) 


= 0.69146. 


c) P(6 « X < 14) = Y - 


0210 29-2 xd 


= P(-2 < Z <2) 
= P(Z < 2) -P(Z < — 2)] 
= 0.9545. 


d) PQ<X<4)= (220 «z« 170 


- P(-4 


«Z«-3) 


= P(Z < -3) - P(Z < -4) 


= 0.001 


35 


e) P(-2 < X < 8) = P(X < 8) - P(X < 2) 


sa Lae 
2 2 


=P(Z 








< —]) - P(Z < —6) 


= 0.15866. 


a) P(X <45)= PI Z < 


45 —65 
5 


= P(Z < -3) 


= 0.0013 


b) P(X > 65) = Az > 


> 





E 


= PZ >l) 
= ]-P(Z« 1) 


] - 0.841345 


= 0.158655 


c) P(X <x) = fz < 


x—60 
Therefore, 35 = 


S = 0.99. 
5 


2.33 and x = 71.6 


a) P(X > 90.3) + P(X < 89.7) 


_ fz " 90.3 —90.2 | fz Z 


=P(Z 


0.1 
>1)+ P(Z<-5) 


= ] — P(Z < 1) + P(Z < —5) 


89.7 — 90.2 


0.1 


| 


=] — 0.84134 +0 
— 0.15866. 
Therefore, the answer is 0.15866. 
b) The process mean should be set at the center of the specifications; that is, at u = 90.0. 


89.7 — 90 ud 
Ol Ol 





c) P(89.7 < X «90.3) - P 


= P(-3 < Z < 3) = 0.9973. 
The yield is 100*0.9973 = 99.73% 


4-59. a) PX > 0.0026) = P| Z > "A 


0.0004 
= P(Z> 1.5) 
= 1-P(Z < 1.5) 
= 0.06681. 
b) P(0.0014 < X < 0.0026) = T PM E  - 
0.0004 0.0004 
= P(-1.5< Z < 1.5) 
= 0.86638. 


c) P(0.0014 « X « 0.0026) = qu cZc« EM) 


O O 
_ f- 0.0006 pe 0.0006 } 
O 





O 





Therefore: (z z M |= 0.9975. Therefore, 9:96 = 9.81 and o = 0.000214. 
O 


Section 4-7 





4-67 Let X denote the number of errors on a web site. Then, X is a binomial random variable 
with p = 0.05 and n = 100. Also, E(X) = 100 (0.05) = 5 and 
V(X) = 100(0.05)(0.95) = 4.75 


l-5 i P(Z 2 —1.84) =1-— P(Z « -1.84) 21— 0.03288 = 0.96712 
5 





P(X21)& {z > 





4-69 Let X denote the number of hits to a web site. Then, X is a Poisson random variable with 
a of mean 10,000 per day. E(X) = à = 10,000 and V(X) = 10,000 


a) 
10,200 —10,000 


4/ 10,000 


= 1—0.9772 = 0.0228 


par zia) rz: Joiner 


Expected value of hits days with more than 10,200 hits per day 1s 
(0.0228)*365=8.32 days per year 





b.) Let Y denote the number of days per year with over 10,200 hits to a web site. 
Then, Y is a binomial random variable with n=365 and p=0.0228. 
E(Y) = 8.32 and V(Y) = 365(0.0228)(0.9772)-8.13 


15—8.32 


48.13 


= 1 — 0.9904 = 0.0096 


P(Y >15)= i > IR P(Z 2 2.34) 21— P(Z < 2.34) 


Section 4-9 


4-77. Let X denote the time until the first call. Then, X is exponential and 


A= 2 = x calls/minute. 





a) P(X > 30) 2 |e “dx =-e * 
30 
b) The probability of at least one call in a 10-minute interval equals one minus the 
probability of zero calls in 
a 10-minute interval and that is P(X > 10). 


P(X >10)=-e "| =e”? 20.5134. 
10 

Therefore, the answer 1s 1- 0.5134 = 0.4866. Alternatively, the requested probability is 
equal to P(X < 10) = 0.4866. 


x |10 
c) P(5< X <10)=-e 5| ze? -e° ? 202031 


5 


Ser 01353 


30 

















t| xX 


d) P(X < x)= 0.90 and P(X < x) 2—e "| 21—e "P? = 0.90 Therefore, x = 34.54 
0 





minutes. 





4-79. Let X denote the time to failure (in hours) of fans in a personal computer. Then, X is an 
exponential random variable and A =1/ E(X) = 0.0003. 


oo 
eo 


a) P(X > 10,000) = | 0.000327 dx =e") =e * = 0.0498 
10,000 10,000 
7,000 7,000 
b) P(X < 7,000) = | 0.0003e dx =e" | — 21-67?! = 0.8775 
0 0 





4-81. Let X denote the time until the arrival of a taxi. Then, X is an exponential random 
variable with A=1/E(X)=0.1 arrivals/ minute. 


4-83. 


4-87. 


| =e% = 0.0025 


60 





a) P(X > 60)= [0.1e ^^ gx =- e *^ 
60 


10 
-1-e!-20.6321 


0 





10 
b) P(X <10)= |0.1e ^ "dx =- e" 
0 








Let X denote the distance between major cracks. Then, X 1s an exponential random 
variable with A —1/ E(.X) 20.2 cracks/mile. 


oo 


-e^-0.1353 


10 





a) P(X > 10)= [02e ?*dx = e 
10 





b) Let Y denote the number of cracks in 10 miles of highway. Because the distance 
between cracks is exponential, Y is a Poisson random variable with A =10(0.2) = 2 


cracks per 10 miles. 
-2 A2 
< 5.20707 
2! 


c) 0, =1/A=5 miles. 





P(Y -2)- 








Let X denote the number of calls in 3 hours. Because the time between calls 1s an 
exponential random variable, the number of calls in 3 hours is a Poisson random variable. 
Now, the mean time between calls is 0.5 hours and X - 1/0.5 2 2 calls per hour = 6 calls in 
3 hours. 

















—6 40 —6 «1 —6 42 —6 43 
= E e6 ,e6,e6 ,e6|. 
PX 24) -1- PUK 83) -1-| T + T + 3! + 3 |= 0.8488 


Section 4-10 


4-97. 


4-101. 





Let Y denote the number of calls in one minute. Then, Y is a Poisson random variable 
with X 25 calls per minute. 
e? 54 
a) P(Y =4)= m = 0.1755 
—5 £0 —5 el —5 £2 
5 5 5 

b) P(Y >2)=1- P(Y <2)= MAT WEGE eS 

Let W denote the number of one minute intervals out of 10 that contain more than 2 


calls. Because the calls are a Poisson process, W is a binomial random variable with n = 
10 and p = 0.8754. 


Therefore, P(W = 10) = (£ 0.8754" (1— 0.8754)" = 0.2643. 





= 0.8754. 








Let X denote the number of bits until five errors occur. Then, X has an Erlang distribution 
with r — 5and X-10^ error per bit. 


a) EX)» — = 5x10? bits. 


Lin 
À 
b) V(X) = > = 5x10" and o, 2 45x10! = 223607 bits. 





c) Let Y denote the number of errors in 10° bits. Then, Y is a Poisson random variable 
with 

A — 1/10? 210^ error per bit = 1 error per 10? bits. 

P(Y 23) 21- PY x2) =1- £2? 4 e er |- 0.0803 


0! I! 2! 











4-105. a) I(6) = 5!=120 
bd P5) 25TG)733 
co) I3) 22T(2) 225 )= r" 211.6317 


Section 4-11 

4-109. B=0.2 and 0-100 hours 
E(X) -100T(1--. 4) 2100x 51212,000 
V(X)=100°'T(1+ 2-100 [T(14 f =3.61x10" 


0.2 





4-111. Let X denote lifetime of a bearing. B=2 and 6710000 hours 
En 
a) P(X »8000) ZI— F, (8000) = e oe 
b) 
E( X) -10000I (1 + 2) = 100001 (1.5) 


-10000(0.5)T(0.5) = 5000/7 = 8862.3 
= 8862.3 hours 
c) Let Y denote the number of bearings out of 10 that last at least 8000 hours. Then, Y is 
a 
binomial random variable with n = 10 and p = 0.5273. 


P(Y =10) = (£ 0.5273" (1— 0.5273)" = 0.00166. 


xu eu 5573 





Section 4-12 


4-117 X isa lognormal distribution with 0—5 and (0 =9 
a. ) 
In(13330)-5 
P(X <13300) = P(e” < 13300) = P(W < In(13300)) = spem 


= (1.50) = 0.9332 
b.) Find the value for which P(X<x)=0.95 





P(X € x)= P(e” € x)  P(W «In(x)) = of E ze i 0.95 


l = 
Tu =1.65 x=e'°* 2509522 
c) U= E(X) =e? ? 2 959? = 0% 2133597 


V(X) 2 e?** (e® 21) 2 e"? (e? -1) =e" (e° -1) =1.45x10" 


4-119 a.) X is a lognormal distribution with 022 and w=4 
In(500) - 2 
P(X «500) = P(e” < 500) = P(W < 1In(500)) = fm] 


= (2.11) = 0.9826 
b.) 


P(1000 « X «1500 
BO S5000)csd0005 5699-94. 1900) 


P(X »1000) 


| of In(1500) - 2 } of In(1000) —2 | 
2 2 
| " of In(1000) — 2 | 
2 


(2.66) - (2.45) .0.9961—0.2929 _ 0.0032 / 0.007 = 0.45 
(1— (2.45) (1—0.9929) 

c.) The product has degraded over the first 1000 hours, so the probability of it lasting 

another 500 hours is very low. 











4-121 Find the values of Band w given that E(X) = 100 and V(X) = 85,000 





di dv 85000 = e??** (e —1) 
let x 2 e^and y= e" then (1)100 2 x4 y and (2) 850002 x y(y-1) =x y^ -xy 


Square (1) 10000 = x^ y and substitute into (2) 
85000 = 10000 ( y — 1) 


yoo. 
l ; 100 
Substitute y into (1) and solve for x x 2 —— = 32.444 
49.5 


0 = 1n(32.444) = 3.45 and w* =In(9.5) = 2.25 


Supplemental Exercises 


4-127. Let X denote the time between calls. Then, A =1/ E(X) — 0.1 calls per minute. 








5 5 
a) P(X < 5) = Í 0.le desg h  21—e7? 20.3935 
0 


0 





15 
b)P(5«X«15)2-e?"| 2g? —g? = 0.3834 


5 





c) P(X « x) 2 0.9. Then, P(X « x)= i 0.le° "dt 2]—e ^" 20.9. Now, x = 23.03 
0 


minutes. 





4-129. a) Let Y denote the number of calls in 30 minutes. Then, Y is a Poisson random variable 
-3 20 31 BAD 
e3 e3 e3 
+ + 
0! l! 2l 
b) Let W denote the time until the fifth call. Then, W has an Erlang distribution with 
à 2 0.1 andr=5. 


E(W) = 5/0.1 = 50 minutes 


= 0.423. 











with x=e°. PY €2)- 








4-137. Let X denote the thickness. 


a)P(X»55)- P| Z > 





= P(Z > 2.5) = 0. 0062 


4.5-5 5.5—5 
———— < Z < —— |=P(-2.5<Z<2.5) = 0.9876 

0.2 0.2 

Therefore, the proportion that do not meet specifications is 1 — P(4.5 < X < 5.5) = 
0.012. 


b) P(4.5 < X < 5.5) = 1 





4-139. 


4-141. 


4-143 


c) If P(X < x) = 0.90, then fz > — 





— 0.9. Therefore, = = 1.65 and x = 5.33. 


If P(0.002-x « X < 0.002+x), then P(-x/0.0004 « Z « x/0.0004) = 0.9973. Therefore, 
x/0.0004 = 3 and x = 0.0012. The specifications are from 0.0008 to 0.0032. 


10,000-, 10,000—p 
If P(X > 10,000) = 0.99, then P(Z > "^ $99 )- 0.99. Therefore, ^ ego = -2.33 and 


u =11,398. 


X 1s an exponential distribution with E(X) = 7000 hours 
5800 1 "n3 1 5800 ) 
a.) P(X < 5800) = Í —— e 70% dy =1-e \) = 0.5633 
, 7000 


X 


b)P(X»x)- [ze ms —0.9 Therefore, e 7° — 0.9 


and x = —7000 In(0.9) = 737.5 hours 





Chapter 5 Selected Problem Solutions 


Section 5-1 


5-7. 
E(X) S£ 20:0 + fy 2) + foy (1,3) 2LF (2:1) 83:02): 3 (2:3)] 
+ 3l G.D + fy G2) + fry G.3)] 
= (x3) Qx2)« (x1)213/6 = 2.167 
V(X)-ü-2Y e+ (2-2) kt- $-0.639 
E(Y) = 2.167 
V(Y) = 0.639 
5-11. 
E(X) 2 -10) -0.5(2) + 0.54) - 1) 2 1 
E(Y) 22) - 1) 418) + 20 =F 


5-15 a) The range of (X,Y) is X 20, Y 20 and X +Y <4. X is the number of pages with moderate 
graphic content and Y is the number of pages with high graphic output out of 4. 


x-0 X=1 x-2 x-3 x24 
y=4 5.35x10°° 0 0 0 0 
=3 0.00183 0.00092 0 0 0 
y-2 0.02033 0.02066 0.00499 0 0 
y-1 0.08727 0.13542 0.06656 0.01035 0 
y=0 0.12436 0.26181 0.19635 0.06212 0.00699 
b.) 
x=0 X=1 X=2 X=3 x-4 
f(x) 0.2338 0.4188 0.2679 0.0725 0.0070 
C.) 
E(X)- 
4 
X xf) = 0(0.2338) + 1(0.4188) + 2(0.2679) + 3(0.7248) = 4(0.0070) = 1.2 
0 
Tar (3, y) 
d) f, (y) = — ——— , R3) = 0.0725 
1 f. O) 
y fy) 
0 0.857 
1 0.143 
2 0 
3 0 
4 0 


e) E(Y[X-3) = 0(0.857)+1(0.143) = 0.143 


Section 5-2 


s xpPOUeO)SYT QUE QS RE 44 2097)-05 
b) P(X =1,¥ =2)= f,,(L2,.) + fa (L2,2) =0.35 
o P(Z <1.5) = fy (LLD + f, (1.2.2) + fy (LD) + yyy (2.2.1) = 0.5 
d) 
P(X =1 or Z=1)=P(X =1)+ P(Z =1)- P(X =1,Z =1) =0.54+0.5-0.2 = 0.8 
e) E(X) = 1(0.5) + 2(0.5) = 1.5 

5-25. P(X=x, Y=y, Z=z) is the number of subsets of size 4 that contain x printers with graphics enhancements, y 
printers with extra memory, and z printers with both features divided by the number of subsets of size 4. 


From the results on the CD material on counting techniques, it can be shown that 
4Y5 V6 
= = zn dne: Ne E 
PRS SV. EE T for xtyt+z = 4. 
4 


ESO 
DEMNM M 
4 


b) P(X =1,Y =1)= P(X =1,Y =1,Z =2) = ALAS) 9 a198 
4 


c) The marginal distribution of X is hypergeometric with N = 15,n=4, K =4. 
Therefore, E(X) = nK/N = 16/15 and V(X) = 4(4/15)(11/15)[11/14] = 0.6146. 


P(X =2,Y =2) 0.1944 
P(Y =2) 0.2646 
P(X =2,Y 22) - 0.1922 





5:90. ma PUN ELITY EE = 0.7347 


4 
P(Y =2)= : pror = 0.2646 . from the binomial marginal distribution of Y 


b) Not possible, x+y+z=4, the probability is zero. 


;) P(X |Y 22) 2 P(X 20| Y 22, P(X -1|Y =2), (X =2|Y =2) 








— — | 
P(X =0|¥ =2) = 44 791 72 | ^ 0560370.1? | /0.2646 = 0.0204 
P(Y =2) 01212! 
= — ! 
P(X 21 y22 2 £6 7b! 72 .[ * 560301 |/02646 = 0.2449 
P(Y =2) 1121! 
= — ! 
PEX =21Y =2 = P 7 1 72 _( _* 9) 620,370.19 |/0.2646 = 0.7347 
P(Y =2) 21210! 


d.) E(X/Y=2)=0(0.0204)+1(0.2449)+2(0.7347) = 1.7142 


5-31 a.), X has a binomial distribution with n = 3 and p = 0.01. Then, E(X) = 3(0.01) = 0.03 
and V(X) = 3(0.01)(0.99) = 0.0297. 


b.) first find P(X | Y = 2) 
PY =2)=P(X =LY =2,2 =0)+ PX =0,Y =2,Z =1) 


- 3 0.01(0.04) 0.95" x 01° (0.04)? 0.95' = 0.0046 


11210! 

P(X =0| Y =2) = P 761 =2 || 3 9.91°0.0470.95' |/0.004608 = 0.98958 
P(Y =2) 072"! 
— = ! 

PO Cea rez ETE 3 


— — 0.01'0.0470.95" }/0.00460 = 0.01042 


P(Y =2) 1n! 


E(X | Y = 2) = 0(0.98958) + 1(0.01042) = 0.01042 


V(X |Y 22) = E(X*)-(E(X)) = 0.01042 — (0.01042)? = 0.01031 


Section 5-3 





3 2 3 
535. a) P(X < 2,Y <3)=4| | xydxdy = 4 Q) | ydy =4 (2)G) = 0.4444 
0 0 0 


b) P(X « 2.5) = P(X « 2.5, Y < 3) because the range of Y is n 0 to 3. 
3 2.5 


P(X «2.5, Y <3)= ar] | aydedy = 43. aspe - 4 (3.125)2 = 0.6944 


2.53 


2.5 
o) P(A «Y < 2.5) = al J xydxdy = 4 (4.5) | ydy =H] —0.5833 





2.53 


à P(X >1.8,1< Y < 2.5) = «| | xydxdy = 4 (2.88) j ydy = 4 (2.88) £29 =0.3733 


1 1.8 


2 


3 3 . 
e) E(X) = 4 | fx?ydedy = [oydy <4 =? 
0 0 0 





4 0 4 
f P(X «0, Y <4) = & | |xvdxay = 0 | ydy = () 
0 0 0 


5-37. 
3 x42 3 


c| [c+ y)dydx = |» pz 
0 x 


0 


x+2 





3 

M (x+2)? 2 li 

aa z = 
0 


«eli 4x + 2x = x! + x? + 2x] = 24c 
0 


Therefore, c = 1/24. 


5-39. a) f x (x) is the integral of f XY (x, " over the interval from x to x+2. That is, 











x+2 » |x+2 x 1 
= — y = 
fy (X= va *Xd-. xv [elt fees 
X 
: (+y) J+ y 
_ fey(Ly) _ 24 = 
b) Fy, )= FO LEE NEUE forl<y<3. 
ae 6 
6 12 
See the following graph, 





Ly j (y) defined over this line segment 





3 2 3 s 
c) E(Y|X= «p Jo» )dy =— [575 zc 
61 1 


Tiy iC ay 
4) P(Y >2|X =1)= [Ce jaje 5 =0.4167 
z l 1 





e.) f x|2 (x )= E ur . Here I Y (y) is determined by integrating over x. There are three regions of 


integration. For O < y € 2 the integration is from 0 to y. For 2 < y € 3 the integration is from y-2 to y. 
For 3 < yY < 5 the integration is from y to 3. Because the condition is x=2, only the first integration is 


IE: for O< y<2. 





ljo y 
needed. f,(y)-— a jet» = aoe 


f xp (x) defined over this line segment 


l 
340 *2) x+2 


Therefore, f, (2) 41/4 and frp) = 25 = 6 for0<x<2 





5-43. Solve for c 


eo 


cf fe dyan = je” ( E lax = de cg gy 


0 
E EI "a c — 10 
342 5J 10 


5-49. The graph of the range of (X, Y) is 
2 











e N W A Ui 








1 2 3 4 
j p t j Tea =] 
0 0 1 x-1 


= ef (x+1)dx+ | dx 
0 1 


=$c+6c=7.5c=1 
Therefore, c = 1/7.5=2/15 


5-5]. a.) 
x+1 
l x+1 
x)= | — dy =| — | for O0<x<l, 
Ie) ET ? ta 
x41 
l x+l-(x—1) 2 
x)= | — dy = —— —— —— |= — for l1«x«4 
Jœ) bare ee í 7.5 | 7.5 


b.) 
LHD) dS 
fixa) ———2—- 
f.0 2/75 
ty yl 
c )E(Y| X =1) E y= 


0 
0.5 


0.5 
d) P(Y «0.5|X =1)= Í 0.5dy =0.5y| =0.25 
0 


0 


5-53 a.) u=3.2 À=1/3.2 


poo aa p 23. 
PX -5.Y  5)52 10.24 | [e te e dydx s 3.2 at of fs 
35 5 5 


:[: "e Jj 0.0439 
fje 


co xX x 10 
P(X »10,Y >10) 210.24 fe MO Dnm safe” aC aa 

10 

1 


0 
Gi ee Ie i: 0.0019 


b.) Let X denote the number of orders in a 5-minute interval. Then X is a Poisson 
random variable with A=5/3.2 = 1.5625. 





e 77? (1.5625)* 
2] 


P(X =2)= = 0.256 


For both systems, P(X = 2)P(X = 2) = 0.256^ = 0.0655 


c.) The joint probability distribution is not necessary because the two processes are 
independent and we can just multiply the probabilities. 


Section 5-4 


0.5 0.51 


5-55. a) P(X <0.5)= | | [ (bondad - | J (Axy)dydx = | (2x)dx = x? 


b) 





, | 
= 0.25 
0 


0.50.5 


1 
P(X <0.5,Y <0.5)= Í Í Í (8xyz)dzdydx 
0 00 


0.50.5 


- = J (4xy)dydx = | (0.5x)dx rear] i 


c) P(Z < 2) = 1, because the range of Z is from to 1. 
d) P(X < 0.5 or Z < 2) = P(X < 0.5) + P(Z < 2) - P(X < 0.5, Z < 2). Now, P(Z < 2) =1 and 
P(X <0.5,Z<2) =P(X<0.5). LO the answer is 1. 
[Ld 


e) E(X) = | | | (8x? yz)dzdydx = (2x? )dx = 2 2 2/3 


000 


= 0.0625 





I 
a) fyz (Yz) = | (Sxyz)dx = 4yz forü«y«1and0«z«1. 
0 


(4 = fnz sz) _ 8x(0.5)0.8) 
f Osz) — 4(0.5)(0.8) 


b) Therefore, P(X < 0.5|Y = 0.5, Z 20.8) = f 2xdx = 0.25 
0 


Then, Liz = 2x for0<x<1. 


Determine c such that f (xyz) = cis a joint density probability over the region x>0, y>0 
and z>0 with x+y+z<1 
] 1-x 1-x- 


f (xyz) = d J | diae E fea- x — y)dydx -| c(y — xy ESI 


= [ncn 9-2 p -jd ene 


=g = Therefore, c = 6. 


l-x 





a.) | 
f (x)= 6 | jeo- j 6(1—-x— y)dy = b -—] 
= 6x44 = 300-1) forü« x«l 
b.) 


[ey 


f(xy)=6 [dz- 60—-x- y) 


forx»0,y»0andx-y«l 


C.) 


dioses uS esos UEM EN. Bog quie ses) 
f(y20.5,220.5) 6 


d.) The marginal f,(y) is similar to f(x) and f,(y) -3(1— y)" for0«y« I. 
_ f(%,0.5) | 6(0.5— x) 
Jay (103) 75 OS 3025) 


5-65. 5-65. a)Let X denote the weight of a brick. Then, 
PIX 2.75)2 (Z5 LA) = P(Z»-1)20.84134. 


Let Y denote the number of bricks in the sample of 20 that exceed 2.75 pounds. Then, by 
independence, Y has a binomial distribution with n = 20 and p = 0.84134. Therefore, 
the answer is P(Y = 20) = (2 0.84134” —0.032. 

b) Let A denote the event that the heaviest brick in the sample exceeds 3.75 pounds. 
Then, P(A) = 1 - P(A’) and A' is the event that all bricks weigh less than 3.75 pounds. As 
in part a., P(X < 3.75) = P(Z < 3) and 

P(A) =1-[P(Z «3)|? 21-0.99865^" 20.0267. 


= 4(1— 2x) for x « 0.5 




















Section 5-5 


5.671. E(X) = 1(3/8)+2(1/2)+4(1/8)=15/8 = 1.875 
E(Y) = 3(1/8)+4(1/4)+5(1/2)+6(1/8)=37/8 = 4.625 


E(XY)=[1x3 x (1/8)] - [1x4x(1/4)] - [2x 5x(1/2)] - [4x 6x (1/8)] 
= 75/8 =9.375 
O y =E(XY)-E(X)E(Y) = 9.375 — (1.875)(4.625) = 0.703125 


V(X) = 1°(3/8)+ 27(1/2) +47(1/8)-(15/8)? = 0.8594 
V(Y) = 3°(1/8)+ 4°(1/4)+ 5*(1/2) +6°(1/8)-(15/8) = 0.7344 


Oy 0.703125 


des 0,0,  |(0.8594)(0.7344) 





5-69. 


333 22 075 C=1/ 36 


x-l y-l 
2 
13 13 14 14 (13 -1 
Hoe FS. Fone gas l l= 
Q0-— E= BAN” X 
16 16 23 
E(X? = — E(Y? Ex V(X)=V(Y) = — 
(X^) 3 (Y^) ! (X)=V(Y) a 
= 
= —0.0435 





—__ 36 
á |23 |23 
36 \ 36 


5 1 x41 5 5 x«l 
5-73. E(X)=—| Ixdydx -— | | xdydx = 2.614 
me pes pe 
1 x41 5 x«l 


E(Y) =<] Jodie — | | ydydx = 2.649 
0 0 1 x-1 


1 x41 5 x+l 
Now, E(XY) = A | dye + ssl qoaa = 8.7763 
O „ =8.7763 —(2.614)(2.649) = 1.85181 
E(X^)2 8.7632 E(Y^) «9.07895 
V(x) =1.930, V(Y) = 2.062 


-. . 1852 _ 9 9979 


41.9304/2.062 


Section 5-6 


5-81. Because p = O and X and Y are normally distributed, X and Y are independent. Therefore, 


Ux = 0. 1mm 6x=0.00031mm uy = 0.23mm oy=0.00017mm 
Probability X is within specification limits is 


P(0.099535 < X < 0.100465) = - m Z E m 


«C e 
0.00031 0.00031 
= P(-1.5«Z «1.5)2 P(Z «1.5) - P(Z < —-1.5) 
= 0.8664 


Probability that Y 1s within specification limits 1s 


P(0.22966 < X < 0.23034) = a a EU "E 


0.00017 0.00017 
= P(-2«Z2«2)-2P(Z«2)- P(Z «-2) 
= 0.9545 


Probability that a randomly selected lamp is within specification limits is (0.8664)(.9594)=0.8270 


Section 5-7 


5-87. a) E(2X + 3Y) = 2(0) + 3(10) = 30 
b) V2X + 3Y) =4V(X) + 9V(Y) = 97 
c) 2X + 3Y is normally distributed with mean 30 and variance 97. Therefore, 


P(2X +3Y «30) - P(Z < “a? = P(Z«0)20.5 
d) P(2X +3Y < 40) = P(Z < 83%) = P(Z < 1.02) = 0.8461 


5-89 a) Let T denote the total thickness. Then, T= X + Y and E(T) = 4 mm, 
V(T) = 0.1% +0.1% = 0.02mm? , and of = 0.1414 mm. 


b) 
P(T > 4.3)=P| Z > e 
0.1414 


-]- P(Z < 2.12) 21- 0.983 = 0.0170 





= P(Z » 2.12 
| l ) 2.12) = 1 - 0.983 = 0.017 


5-93. a) Let X denote the average fill-volume of 100 cans. O ra EP — 0.05. 


b) E(X)=12.1 and P(X <12) = fz < ms P(Z < —2) = 0.023 





c) P(X < 12) = 0.005 implies that fz < L ns i 0.005. 
Then oH =-2.58 and U = 12.129. 
d.) P(X < 12) = 0.005 implies that P| Z < 2e = 0.005. 
Oo / 4100 


Then EL = -2.58 and O = 0.388. 





c / 100 
= 12-12.1 
e.) P(X <12)=0.01 implies that P| Z < ———— |= 0.01. 
0.5/ In 
Then tah = -2.33 and n = 135.72 = 136. 


Supplemental Exercises 


5-97. a) P(X <0.5,Y <1.5)= fy, (0,1) + fy, (0,0) =1/8+1/4 23/8. 

b) PX S1) = f yy (0,0) + f xy OD + f xy (L0) + f xy UD 23/4 
c) PY <1.5) = fy (0,0) + fay COD + fay (0) + fay (1) 2 3/4 
d) P(X > 0.5, ¥Y <1.5) = fy, (1,0) + fa (Ll) 23/8 
e) E(X) = 0(3/8) + 1(3/8) + 2(1/4) = 7/8. 

V(X) = 0°(3/8) + 1*(3/8) + 27(1/4) - 7/8 =39/64 

E(Y) = 1(3/8) + 0(3/8) + 2(1/4) = 7/8. 
.. V(Y) = 1°(3/8) + 0°(3/8) + 2°(1/4) - 7/8° =39/64 


1 1 1 A 
5-105. a) P(X «lY <1)= | | ix^ ydydx = iix = 
0 0 0 


0 














2:24 2.5 12 
b) P(X <2.5)= | Fx ydydx -|Lx m dx2il£| =0.5787 
0 0 0 0 0 
oe 3 12 3 
c) P( «Y «2.5) = | | x^ ydydx = ix L dx =x x 
0 1 0 1 0 








d) 
3 LS 3 











P [1.5 , [2 
P(X »2J«Y «1.5)- | | ix ydydx = ix L dx = 3 
2- | 2 1 2 
= 2 =0.2199 
3 2 : 3 3 
e) EQ) - [| x ydydx2 | Lx 2dx2i£&| =2 
0 0 0 0 
3 3 
4 


18 
0 





32 
f EQ)» | | x^ y^ dydx = La s gps 
0 0 


5-107. The region x? + y^ <1 and 0 «z «4 is a cylinder of radius 1 ( and base area TU) and height 4. Therefore, 


the volume of the cylinder is 470 and fxyz (x,y,z) = i for X^ + y^ € 1 and 0 «z« 4. 
T 


a) The region X? +Y? «05 isa cylinder of radius V0.5 and height 4. Therefore, 
P(X? +y <05)=" =1/2. 

b) The region X*+Y2<05 and0<z<2isa cylinder of radius V0.5 and height 2. Therefore, 
P(X? «Y! 05,2 < 2) 2 422 =1/4 





og 
d fi (5) = AE ma f(z) = f[edyds = 1/4 
f; (D) x’ +y*<l 
1/4 
for 0 <z<4. Then, fony) = =} for x^ +y es 
4 1-x? 4 
d) fx G7 ] | i dydz =|} l-x'dz2241-x^ fo-1«x«1 
0 vix 0 


5-111. Let X, Y, and Z denote the number of problems that result in functional, minor, and no defects, 
respectively. 


a) P(X 2, Y 25) P(X =2,Y 45,2 43) =3%0.2°0.5°0.3° = 0.085 


2158! 
b) Z is binomial with n = 10 and p = 0.3. 
c) E(Z) = 10(0.3) = 3. 


5-115. Let X denote the average time to locate 10 parts. Then, E( X ) -45 and OF = MIT 
a) P(X > 60) = P(Z» Son) = P(Z > 1.58) = 0.057 


b) Let Y denote the total time to locate 10 parts. Then, Y > 600 if and only if X > 60. Therefore, the 
answer is the same as part a. 





5-119 . Let T denote the total thickness. Then, T = X, + X; and 
a) E(T)=0.5+1=1.5 mm 
V(T)=V(X,) +V(X>) + 2Cov(XX7)=0.01+0.04+2(0.14)=0.078mm* 
where Cov(XY )=p0x6y=0.7(0.1)(0.2)=0.014 


5-121 


b) P(T<1)=P|Z< ra 
0.078 





= P(Z «-641) = 0 
c.) Let P denote the total thickness. Then, P = 2X, +3 X, and 
E(P) =2(0.5)+3(1)=4 mm 
V(P)-4V(X) *9V(X5) + 22)3)Cov(X4X5)74(0.01)2-9(0.04)-2(2)(3)(0.014)-0.568mm 
where Cov(XY)-pox0oy-0.7(0.1)(0.2)-0.014 








Let X and Y denote the percentage returns for security one and two respectively. 
If ^ of the total dollars is invested in each then ⁄2X+ ‘AY is the percentage return. 
E(2X+ '^Y )-5 million 
V(V5X-- V5Y)-1/4 V(X)*-1/AV(Y)-2(1/2)((1/2)Cov(X, Y) 
where Cov(XY )=p0x0y=-0.5(2)(4)=-4 
V(AX4 AY )=1/4(4)+1/4(6)-2=3 
Also, E(X)=5 and V(X) =4. Therefore, the strategy that splits between the securities has 
a lower standard deviation of percentage return. 





Chapter 6 Selected Problem Solutions 


Sections 6-land 6-2 


6-1. Sample average: 
n 


X. 
c 592.035 


n 


— 74.0044 mm 





Sample variance: 


8 
> x, = 592.035 
i=l 


8 
Y x; = 43813.18031 
i=l 








" 2 
Yx I ] 43813.18031— 092.035) 
n 


o. uec 


n-1 8-1 


— 9.000156? _ 9000022414 (mm) 


Sample standard deviation: 


s = 40.000022414 = 0.00473 mm 


The sample standard deviation could also be found using 


8 
where >'(x;-x) =0.0001569 
i=l 





------- pee-ceemc-epocecel———peeesc-eccsepeecscessep-os-e-ee--diameter 
7323920 74.0000 74.0080 74.0160 74.0240 74.0320 


There appears to be a possible outlier in the data set. 


2^ 2574] 


611. a)x-z-EL— Tm TAA 


6-13. 


"s 

2 (2x) 

Su e 
n-1 8-1 7 


s = 0.000427 = 0.02066 


= 0.000427 


c) Examples: repeatability of the test equipment, time lag between samples, during which the pH of the 


solution could change, and operator skill in drawing the sample or using the instrument. 


a) X = 65.85 
s= 12.16 


b) Dot Diagram 


c) Removing the smallest observation (31), the sample mean and standard deviation become 
x = 66.86 
s — 10.74 


Section 6-3 


6-15 


6-19. 


6-25 


a.) Stem-and-leaf display for Problem 6-15 cycles: unit - 100 1/2 represents 1200 


1 0T|3 

1 OF | 

5 08|7777 
10 00|88899 
22 1*|000000011111 
33 IT|22222223333 
(ES) 1F | 444445555555555 
22 1S|66667777777 
11 1o| 888899 

5 2*|011 

2 27 2 


b) No, only 5 out of 70 coupons survived beyond 2000 cycles. 


Descriptive Statistics 


Variable N Median Q1 Q3 
cycles 70 1436.5 1097.8 1735.0 


Stem-and-leaf display for Problem 6-25. Yard: unit = 1.0 
Note: Minitab has dropped the value to the right of the decimal to make this display. 


4 23*|2334 

Te “2301677 
15 24*|00112444 
19 240|5578 
32  25*|0111122334444 
45 250/5555556677899 
(15) 26*|000011123334444 
40 260|566677888 
31 27*|0000112222233333444 
12 270|66788999 

4 28*|003 


Èi .28e015 


n 


2.5, 


i-l 
n 


Sample Mean x — 





100 
Y 

| 4v 26070 
100 100 





sample Standard Deviation 


100 





100 


— 260.7 yards 





yx, =26070 and X$ x? 26813256 
i=l i=l 
; 2 
2x (26070) 
2 j=] 
Y-Y 6813256- 
Fa n 16807 
S = = m 
n-li 100-1 99 
=169.7677 yards’ 
and 
s =4169.7677 =13.03 yards 
Sample Median 
Variable N Median 
yards 100 261.15 
Section 6-5 
6-43. Descriptive Statistics 
Variable N Mean Median Tr Mean 
PMC 20 4.000 4.100 4.044 
Variable Min Max Q1 pu 
PMC 2.000 5.200 3.150 4.800 
a) Sample Mean: 4 
b) Sample Variance: 0.867 
Sample Standard Deviation: 0.931 
c) 
5 
4 
O 
= 
A 
3 
2 
6-47. 
Descriptive Statistics 
Variable N Mean Median Tr Mean 
temperat 24 48.125 49.000 48.182 
Variable Min Max Q1 Q3 


StDev 
0.931 


StDev 
2.692 


SE Mean 
0.208 


SE Mean 
0.549 


temperat 43.000 52.000 46.000 50.000 


a) Sample Mean: 48.125 
sample Median: 49 
b) Sample Variance: 7.246 
sample Standard Deviation: 2.692 


c) 


52 
51 
50 
49 
48 
47 
46 
45 
44 
43 


temperatur 


The data appear to be slightly skewed. 


Supplemental 


6-75 a) Sample 1 Range = 4 

sample 2 Range = 4 
Yes, the two appear to exhibit the same variability 

b) Sample 1 s = 1.604 
sample 2 s = 1.852 
No, sample 2 has a larger standard deviation. 

c) The sample range is a relatively crude measure of the sample variability as compared to the sample 
standard deviation since the standard deviation uses the information from every data point in the sample 
whereas the range uses the information contained in only two data points - the minimum and maximum. 


6-79 a)Stem-and-leaf display for Problem 6-79: unit = 1 1|2 represents 12 
1 Or |:3 
8 OF |4444555 
18 OS |6666777777 
UD 00|8888999 
15 1*|111 
12 TT 22239 
7 1F|45 
5 Te 
3 1o|899 


b) Sample Average = 9.325 
Sample Standard Deviation = 4.4858 


c) 











springs 


A 





























Index 10 20 30 40 


The time series plot indicates there was an increase in the average number of nonconforming springs 
made during the 40 days. In particular, the increase occurs during the last 10 days. 


7-13 


Chapter 7 Selected Problem Solutions 


Section 7-2 
7-T. E(0,) — 0 No bias V(O,) eI MSE(0,) 
E(0,)-0 No bias V(O,) =10 = MSE(O,) 
E (Ô 3) +0 Bias MSE (Ô 3) =6 [not that this includes (bias?)] 


To compare the three estimators, calculate the relative efficiencies: 
MSE(©,) 12 
MSE(Ó,) 10 
MSE(O,) 12 


— |.2, since rel. eff. » 1 use O , as the estimator for 0 


= n ' since rel. eff. » 1 use O 3 as the estimator for 0 
MSE(Q,) 6 
MSE(O,) 10 ^ 
PES) EIS , since rel. eff. > 1 use ©, as the estimator for 0 
MSE(Q,) 6 
Conclusion: 


^ ^ 


©, is the most efficient estimator with bias, but it is biased. OQ, is the best *unbiased" estimator. 


a.) The average of the 26 observations provided can be used as an estimator of the mean pull 
force since we know it is unbiased. This value is 75.427 pounds. 

b.) The median of the sample can be used as an estimate of the point that divides the population 
into a “weak” and “strong” half. This estimate is 75.1 pounds. 

c.) Our estimate of the population variance is the sample variance or 2.214 square pounds. 
similarly, our estimate of the population standard deviation 1s the sample standard deviation 
or 1.488 pounds. 

d.) The standard error of the mean pull force, estimated from the data provided is 0.292 pounds. 
This value is the standard deviation, not of the pull force, but of the mean pull force of the 
population. 

e.) Only one connector in the sample has a pull force measurement under 73 pounds. Our point 
estimate for the proportion requested is then 1/26 — 0.0385 


a.) To see if the estimator is unbiased, find: 
l l 
E[CX min T X max )/ 2] x 5 VP min) + E(X max )] = 2 +u)=u 


since the expected value of any observation arising from a normally distributed process is equal 
to the mean. So this is an unbiased estimator of the mean. 
b.) The standard error of this estimator is: 


V[CX min + X max )/ 2] => [V CX min ) + V (X max ) + COV (X gin | X max )] = zc? to?) = EE 


C.) This estimator is not better than the sample mean because it has larger standard error for n > 2. 
This is due to the fact that this estimator uses only two observations from the available sample. 


The sample mean uses all the information available to compute the estimate. 


) —— Eft) = E(aX, *(1-0)X5) = aEQG) + (1 20) E(X2) = Ot + (1 o)u =u 
b) 
s.e(fi) = JV(aX, +(1-a)X,) 2 Jo? V(X )+ (1-0) VCX,) 


2 2 2 
Oo Oo Oo (0J 
a? —+ (1-0) — = Ja —+ (1-a) a—— 
ni n, ni n, 
2 2 
a°n,+(1—-aQ@) an 
=o, "si ) l 
nn, 


c) The value of alpha that minimizes the standard error is: 












anj 
Q = — 
Ny + an; 


d) With a = 4 and n,72n;, the value of alpha to choose is 8/9. The arbitrary value of a=0.5 is too 
small and will result in a larger standard error. With o=8/9 the standard error is 


(8/9)°n, +(1/9)° 8n, 0.6676, 


2n, Jna 


se(û)=0, 


If &=0.05 the standard error is 


(0.5)°n, +(0.5)°8n, 1.06076, 


2n, ny 


s.e.(Il) 2 o, 


Section 7-5 


w P0 SX < 1.012) = PEE S Ths uc) 


= P(-1 < Z < 2) = P(Z €2)- P(Z <-1) 





= 0.9772 — 0.1587 = 0.8385 


U 


5 


735. Us =75.5psi, O- = = 1.429 








P(X 275.75) = Pen > 27555) 
= P(Z 2 0.175) 21- P(Z < 1.75) 


= ] — 0.56945 = 0.43055 


7-39 | c? -25 











n=12 
7-41 n= 36 
py PS, 
ne b-a*)-l- Gua f og 
12 12 12 3 
u, 22,0, 222/3 2/3 
X DE Me 36 6 
| X-u 
olin 


Using the central limit theorem: 


PO < X < 2.5) ou c«Z« LJ 








. 2/3 
6 6 


= P(0.7348 « Z « 3.6742) 
= P(Z < 3.6742) - P(Z < 0.7348) 
= 1—0.7688 = 0.2312 


7-43. 
n=16 n,=9 X,- X; - N(us -Uz O- +0” ) 





M,=75 4, =70 2 6? 
0,28  0,-12 SUM Hs e) 
j. 4 
2 2 
~ N(75 — 70, — + —) 
l6 9 
~ N (5,20) 


a) P(X, =X, > 4) 
P(Z > 48) = P(Z > -0.2236) = 1- P(Z € -0.2236) 


=] -0.4115 = 0.5885 


b) P(3.5€ X, X, € 5.5) 
PGA < Z <2) = P(Z <0.1118)- P(Z € —0.3354) 


= 0.5445 — 0.3686 = 0.1759 


Supplemental Exercises 


eet (5^ 2? 
749. X X, ~ N00-105 2 4+ 2) ~ N(-5,0.2233) 
Bue 25 25 


Chapter 8 Selected Problem Solutions 


Section 8-2 


8-1 a.) The confidence level for x — 2.140 / Jn <u<x+2.140/ Jn is determined by the 
by the value of z; which is 2.14. From Table II, we find (2.14) = P(Z«2.14) = 0.9793 and the 
confidence level is 97.93%. 
b.) The confidence level for x — 2.490 / Jn < u <x +2.490 / Jn is determined by the 
by the value of z; which is 2.14. From Table II, we find (2.49) = P(Z<2.49) = 0.9936 and the 
confidence level is 99.36%. 
c.) The confidence level for x —1.850 / Jn <u<x+1.850/ Jn is determined by the 


by the value of z; which is 2.14. From Table II, we find (1.85) = P(Z«1.85) = 0.9678 and the 
confidence level is 96.78%. 


8-7 a.) The 99% CI on the mean calcium concentration would be longer. 


b). No, that is not the correct interpretation of a confidence interval. The probability that u is between 
0.49 and 0.82 is either 0 or 1. 


C). Yes, this is the correct interpretation of a confidence interval. The upper and lower limits of the 
confidence limits are random variables. 


8-13 a) 95% two sided CI on the mean compressive strength 


Zap = Zoos = 1.96, and. x ^ 3250, o? = 1000, n=12 


= o = o 
x — rome | SUSX+ zos- 
31.62 31.62 


< u < 3250 +1.96 —— 
412 : 412 
3232.11 < u € 3267.89 


3250 —1.96 





b.) 99% Two-sided CI on the true mean compressive strength 
Za/2 = Zo.905 = 2.58 


= o - (e 
X — 20.005 EJ < U < X + Zo 005 EJ 


3250— (2 < u < 3250+ (2 


V12 V12 


3226.5 € u € 3273.5 





8-15 Set the width to 6 hours with 6 = 25, Zo 975 = 1.96 solve for n. 
1/2 width = (1.96)(25) / Vn 23 


49 2 34/n 
2 
"m = — 266.78 


Therefore, n=267. 


Section 8-3 


8-25 a.) The data appear to be normally distributed based on examination of the normal probability plot below. 
Therefore, there is evidence to support that the level of polyunsaturated fatty acid is normally distributed. 


Normal Probability Plot for 8-25 
ML Esti 95% CI 


timates - 





Percent 
Cc 
eo 
[spes pere fp vp] 














D ata 


b.) 99% CI on the mean level of polyunsaturated fatty acid. 
For a= 0.01, ty/2,n-1 E to.005,5 = 4.032 


£. S = S 
xc tans Susxt tans 


1698- 4.032| 251^ | < u <1698+4.032| “3 
V6 


V6 


16.455 < u < 17.505 


8-29 95% lower bound confidence for the mean wall thickness 
given x - 4.05 s=0.08 n=25 


to.n-1 = 100554 = 1.711 


X= == s 
0.05,24 Jn 
0.08 

4.05-1.71 — |< u 
425 

4.023 <u 


It may be assumed that the mean wall thickness will most likely be greater than 4.023 mm. 


8-31 x=1.10 s=0.015 n=25 


95% CI on the mean volume of syrup dispensed 


For & = 0.05 and n = 25; tw2.n-1 = 10.025,24 — 2.064 





= S = S 
x —t — ISusx-t —— 
0.025,24 Em u 0.025,24 n | 
1.10 — 2.064 uP < u <1.10+2.064 MU) 
X25 
1.093 € u € 1.106 
Section 8-4 
8-35 99% lower confidence bound for o? 


For a= 001 andn=15, 4,1 = Xoorsa 729.14 


14 : 

(0.008) Jee 
29.14 

0.00003075 < © 


8-37 95% lower confidence bound for o^ given n = 16, s? = (3645.94)? 
For & = 0.05 and n = 16, iud = 15.05.15 = 25 


15(3645.94) 
5(3645.94)! —.—., 


25 
7.975,721.09 < o? 


8-39 95% confidence interval for o: given n = 51, s = 0.37 
First find the confidence interval for o? : 


For o = 0.05 and n = 51, X&/2,n-1 = X0.025,50 = 71.42 and X1. 9/21 = X0.975,50 = 32.36 
2 2 
ue) dte A 
(71.42) (32.36) 
0.096 < o° < 0.2115 


Taking the square root of the endpoints of this interval we obtain, 


0.31 < © < 0.46 


8-41 90% lower confidence bound on o (the standard deviation of the sugar content) 
given n= 10, s? = 23.04 


For a=0.1 andn = 10, 45, , = Xo19 = 19.02 
9(23.04 
( lag 


14.68 
14.13 € o? 


Take the square root of the endpoints of this interval to find the confidence interval for o: 
3.8 S6 


Section 8-7 


8-63 99% tolerance interval on the polyunsaturated fatty acid in this type of margarine that has a confidence level 
of 9576 
x = 16.98 s = 0.319 n=6 and k = 5.775 
x —ks,x+ks 


16.98 —5.775(0.319), 16.98 + 5.775(0.319) 
(15.14, 18.82) 


The 99% tolerance interval is much wider than the 99% confidence interval on the population mean 
(16.46 € u < 17.51). 


8-67 90% lower tolerance bound on bottle wall thickness that has confidence level 90%. 
given X =4.05 s=0.08 n=25 and k= 1.702 
x — ks 
4.05 —1.702(0.08) 
3.91 
The 90% tolarance bound is (3.91, e») 


The lower tolerance bound is of interest if we want to make sure the wall thickness is at least a certain value 
so that the bottle will not break. 


8-69 95% tolerance interval on the syrup volume that has 90% confidence level 
x=1.10 s=0.015 n=25 and k=2.474 


x —ks,x+ks 
1.10 —2.474(0.015), 1.10 + 2.474(0.015) 
(1.06, 1.14) 


Supplemental Exercises 


8-75 With o = 8, the 95% confidence interval on the mean has length of at most 5; the error is then E = 2.5. 








2 2 
ayn =| L25 | g? = i 64 = 39.34 =40 
2.5 25 
i 1.96 Y 
bn =| 2% |g =|- | 36 = 22.13 -23 
2.5 25 


As the standard deviation decreases, with all other values held constant, the sample size necessary to 
maintain the acceptable level of confidence and the length of the interval, decreases. 


8-79 | Normal probability plot for the coefficient of restitution 


Normal Probability Plot for 8-79 


ML Estimates - 9596 CI 





Percent 
8 
| | | | | 














0.59 0.60 0.61 0.62 0.63 0.64 0.65 0.66 
Data 


b.) 99% CI on the true mean coefficient of restitution 
X = 0.624, S= 0.013, n = 40 tay2, n-1 ~ to.005, 39 = 2.7079 


= S = S 
X — lo 095,39 ic S U S X +h y 095.39 
n 


vn 


ae < u € 0.624 + 2.7079 9-015 


V40 — v40 
0.618 < u < 0.630 


b.) 99% prediction interval on the coefficient of restitution for the next baseball that will be tested. 


_ | ] » l 
X — £095,398 i S Xaa SX t Í9995395 l+ 5 


0.624 — 2.7079(0.013),/1 + — € x,,, € 0.624 + 2.7079(0.013), {1+ = 


0.624 — 2.7079 





0.588 < x,,, < 0.660 


c.) 99% tolerance interval on the coefficient of restitution with a 95% level of confidence 


(x — ks, x 4 ks) 
(0.624 — 3.213(0.013), 0.624 + 3.213(0.013)) 
(0.583, 0.665) 


e.)The confidence interval in part (b) describes the confidence interval on the population mean and we may 
interpret this to mean that 99% of such intervals will cover the population mean. The prediction interval 
tells us that within that within a 99% probability that the next baseball will have a coefficient of restitution 
between 0.588 and 0.660. The tolerance interval captures 99% of the values of the normal distribution 
with a 95% level of confidence. 


8-83 a.) 95% Confidence Interval on the population proportion 
n-1200x-8 | p = 0.0067 Za2=Zo.02s=1.96 


r p(l- £ : 5(1— £f 
P— Zan A T pd - p) 
i n 
QUOC ed Ged = 2 zo T 5990/4 — 40090) 
1200 Dm 


0.0021 € p € 0.0088 


b.) Yes, there is evidence to support the claim that the fraction of defective units produced is one percent 
orless. This is true because the confidence interval does not include 0.01 and the upper limit of the 
control interval is lower than 0.01. 


Chapter 9 Selected Problems Solutions 


Section 9-1 

9-] a) H 9 H= 25, H QA 25 Yes, because the hypothesis is stated in terms of the parameter of 
interest, inequality is in the alternative hypothesis, and the value in the null and alternative 
hypotheses matches. 


b) H 0:0 > 10, H ,:0 = 10 No, because the inequality is in the null hypothesis. 


c) H 0: x =50, H iX 50 No, because the hypothesis is stated in terms of the statistic rather 

than the parameter. 

d Hy): p= 0.1, H, : p — 0.3 No, the values in the hull and alternative hypotheses do not match and 
both of the hypotheses are equality statements. 

e) H 0-$— 30, H 1S > 30 No, because the hypothesis is stated in terms of the statistic rather than the 





parameter. 
= X-u _ 115-12 
9-3 a)a=P(X<11.5|/u=12)=P < — — |= P(Z < —4)= 1 - P(Z <4) 
o/4n 05/416 
=1-1=0. 


The probability of rejecting the null, when the null is true, is approximately 0 with a sample size of 16. 


Kath: diy5s51125 


> ———— 
o/Vv¥n 05/16 





b) B= P(X > 115 |W=1125) «d J-ra@>a=1-ra@s9 


= ]- 0.97725 = 0.02275. 
The probability of accepting the null hypothesis when it is false is 0.02275. 


| 190-175 


a) Z ——— —— ——- 
20/4/10 


mean foam height is greater than 175 mm. 
b) P(X > 190 when u = 175) 
E X-175 _ 190- | 


9-9 = 2.37, Note that z is large, therefore reject the null hypothesis and conclude that the 


201410 — 20/410 
= P(Z > 2.37) = 1 — P(Z < 2.37) 
=] — 0.99111 
= 0.00889. 
The probability that a value of at least 190 mm would be observed (if the true mean height is 175 mm) is 
only 0.00889. Thus, the sample value of x = 190 mm would be an unusual result. 


9-17. | The problem statement implies Ho: p = 0.6, Hı: p > 0.6 and defines an acceptance region as 


P < a = 0.63 and rejection region as P » 0.63 


0.63 — 0.6 


[0.6(0.4) 
500 


= P(Z 21.37) 1- P(Z « 1.37) = 0.08535 
b) B = P( P < 0.63 when p = 0.75) = P(Z € -6.196) = 0. 


a) æ = P(P > 0.63 | p = 0.6)= P| Z > 





Section 9-2 


9-2]. a) 1) The parameter of interest is the true mean yield, u. 
2) Ho: w= 90 
3) H,: u #90 
4) o. — 0.05 
x-H 
o/ 4n 
6) Reject Ho ifz; < —zy5 where —Z 925 =—1.96 or Z > Zy2 where Zo 025 = 1.96 
7) X = 90.48, o=3 


5) Zo = 





_ 90.48 —90 


zu MON 
rS 


8) Since —1.96 < 0.36 < 1.96 do not reject Hg and conclude the yield is not significantly different from 
90% at a = 0.05. 
b) P-value = 2[1 —®(0.36)] = 2[1— 0.64058] = 0.71884 
c)n= (zaz T Zg Fo? - m + Zoos )3 = (1.96 + 1.65 )9 
ô’ (85 - 90 Y (-5y 


= 0.36 


= 4.67 


5 
I 
UA 


90 — 92 90 — 92 

d)p [ns * NE of Z0025 + SUE ) 

= (1.96 + 1.491) — P(—1.96 + 1.491) 

= $(0.47) — $(-3.45) 

= $(0.47) — (1 — (3.45) 

= 0.68082 — ( 1 — 0.99972) 

— 0.68054. 
e) For & = 0.05, zy» = Zoo»5 = 1.96 


_ o _ o 
X — rome | SUSE rome 


3 3 
90.48 — 1.96} — | < u € 9048 + 1.96) — 
[s | [s | 
87.85 < u < 93.11 
With 95% confidence, we believe the true mean yield of the chemical process is between 87.85% and 
93.11%. 


9-25. a) 1) The parameter of interest is the true mean tensile strength, u. 

2) Ho : u = 3500 

3) Hı: u #3500 

4)a=0.01 

x-u 

o/ 4n 
6) Reject Ho if Zọ <—Z y2 where —z9995 = —2.58 or Z > Zy2 where Z99o5 = 2.58 
7) x = 3250, o = 60 





5) Zo = 


_ 3250-3500 


P 29 
^ — 60/412 


8) Since —14.43 < —2.58, reject the null hypothesis and conclude the true mean compressive strength is 
significantly different from 3500 at & = 0.01. 


= —14.43 


b) Smallest level of significance = P-value = 2[1 — (14.43) |= 2[1 2 1] 2 0 


The smallest level of significance at which we are willing to reject the null hypothesis is 0. 
C) Zgj2 = Zo.025 = 1.96 


B o = o 
k= rome | SUSE rome | 


3250 — iq : =| < u < 3250+ Ld 


V12 V12 


3232.11 < u € 3267.89 





With 95% confidence, we believe the true mean tensile strength is between 3232.11 psi and 3267.89 
psi. We can test the hypotheses that the true mean strength is not equal to 3500 by noting that the value is 
not within the confidence interval. 


9-27 a) 1) The parameter of interest is the true mean speed, u. 
2) Ho : u= 100 
3) H,: u< 100 
4) a. — 0.05 
x-p 
o/ 4n 
6) Reject Ho if Zọ < —, where —z595 ^ —1.65 


7) x 102.2, o=4 


5) Zo = 


~ 102.2 -100 


PEE SL itu 
° 4/48 


8) Since 1.55» —1.65, do not reject the null hypothesis and conclude the there is insufficient evidence to 
conclude that the true speed strength is less than 100 at ox = 0.05. 


=1.55 


| (95-100)V8 
4 
Power = 1-B = 1-0.97062 = 0.02938 


(zatz a” (Coos +2015)? (0.657103 (Y 


b D = Ø| — Zi o5 = @(-1.65 - —3.54) = ®(1.89) = 1 


y s = 0.927, 
n 8? (95—100)? (5) 
nzl 
O 
d) X — Zoos m Su 


102.2-1.65| - |< u 


«E 
99.866 < u 


Since the lower limit of the CI is just slightly below 100, we are confident that the mean speed is 
not less than 100 m/s. 


9-29 a) 1) The parameter of interest is the true average battery life, u. 
2)Ho: u=4 
3)H,;: u>4 
4) a=0.05 
x-H 
ix o/ 4n 
6) Reject Ho if Zo > z, where Zoos = 1.65 
7) x 2 4.05, o=0.2 





405-4 _| 


Z, ——————-— 
^ 02/4/50 


8) Since 1.77>1.65, reject the null hypothesis and conclude that the there is sufficient evidence to 
conclude that the true average battery life exceeds 4 hours at o = 0.05. 


| (4.5- 4450 
0.2 


Power = 1-B = 1-07 1 


b) B =D] Zoos = $(1.65 — 17.68) = ®(-16.03) = 0 


= 34.7 
ô’ (4.5 — Ay (0.5y 


dero s Fear 


vd 


4.008€ u 
Since the lower limit of the CI is just slightly above 4, we conclude that average life is greater than 4 
hours at o=0.05. 


Section 9-3 
9-3] a. 1) The parameter of interest is the true mean female body temperature, u. 
2) Ho : u = 98.6 
3)H, : w# 98.6 
4) o = 0.05 
y= 
TAE 
s/n 


6) Reject Ho if lto] > to2.n-1 where tov2,n-1 = 2.064 
7) x =98.264, s=0.4821 n=25 
98.264 -98.6 - 


f, = ——_____—- = 
^ 0.4821 / 4/25 


8) Since 3.48 > 2.064, reject the null hypothesis and conclude that the there is sufficient evidence to 
conclude that the true mean female body temperature is not equal to 98.6 °F at a= 0.05. 
P-value = 2* 0.001 = 0.002 


jas 9 Hus] 198-986] 15, 


O O 0.4821 


—3.48 


Using the OC curve, Chart VI e) for «= 0.05, d= 1.24, and n 7» 25, we get B = 0 and 
power of 1—0 = 1. 


|u- H| |982-98.6| - 








c) d= — = ———— 0.83 
O O 0.4821 
Using the OC curve, Chart VI g) for œ = 0.05, d = 0.83, and B = 0.1 (Power=0.9), 
. n +1 20+1 
n =20. Therefore, Nn = ) = ) = 10.5 and n=11. 


d) 95% two sided confidence interval 


e) 


9-37. 


x —t RE Suszx-t pu. 
X — SuUSx 
0.025,24 0.025,24 
Vn Vn 


4821 
Us J= u 98.264 +2.064{ 


X25 
98.065 € u € 98.463 


We can conclude that the mean female body temperature is not equal to 98.6 since the value is not included 
inside the confidence interval. 


J25 


98.264 — 2.064 0.4821 | 


Normal Probability Plot for 9-31 


ML Estimates - 95% Cl 





Percent 
Cc 
eo 
| 














Data appear to be normally distributed. 


a.) In order to use t statistics in hypothesis testing, we need to assume that the underlying distribution 
is normal. 

1) The parameter of interest is the true mean coefficient of restitution, LL. 

2) Ho : u = 0.635 

3) Hi : u > 0.635 

4) a. — 0.05 


X-u 
s/n 


6) Reject Ho if tg > Ío.n-1 where 10.05.39 = 1.685 
7) X=0.624 s=0.013 n=40 


| 0.624-0.635 _ 
0.013/4/40 


8) Since —5.25 « 1.685, do not reject the null hypothesis and conclude that there is not sufficient evidence to 
indicate that the true mean coefficient of restitution is greater than 0.635 at a = 0.05. 





5) to = 


—5.35 


b.)The P-value > 0.4, based on Table IV. Minitab gives P-value = 1. 


T Ô |uU-L,| |0.64—0.635| 
C i RARE D P ——— 
O O 0.013 


= 0.38 


Using the OC curve, Chart VI g) for œ = 0.05, d = 0.38, and n= 40, we get B = 0.25 and 
power of 1—0.25 = 0.75. 


ó |u-L,|_ 10.638- 0.635 | 


d)d= — 
O 0.013 
Using the OC curve, Chart VI g) for œ = 0.05, d= 0.23, and B = 0.25 (Power=0.75), 


= 0.23 


n +l 7541 

















n = 75. Therefore, n = ) = 38 and n=38. 
9-41 a) In order to use t statistics in hypothesis testing, we need to assume that the underlying distribution 
is normal. 
1) The parameter of interest is the true mean concentration of suspended solids, u. 
2)Hg:p 7 55 
4) a. — 0.05 
bam 
5) to = H 
s/ An 
6) Reject Ho if Ito | > to2.n-1 where 10.025,59 —2.000 
7) x 59.87 s=12.50 n=60 
59.87 — 55 
to ———— ———— = 3.018 
12.50/60 
8) Since 3.018 > 2.000, reject the null hypothesis and conclude that there is sufficient evidence to indicate that 
the true mean concentration of suspended solids is not equal to 55 at a = 0.05. 
b) From table IV the t0 value is found between the values of 0.001 and 0.0025 with 59 degrees of 
freedom, so 2*0.001<P-value = 2* 0.0025 Therefore, 0.002« P-value«0.005. 
Minitab gives a p-value of 0.0038 
50-55 
c) d = ——— — —— = 0.4 , n=60 so, from the OC Chart VI e) for a= 0.05, d= 0.4 and n=60 we find that 
12.50 
B=0.2. Therefore, the power = 1-0.2 = 0.8. 
d) From the same OC chart, and for the specified power, we would need approximately 38 observations. 
50-55 
d = —————— — 0.4 | Using the OC Chart VI e) for « = 0.05, d= 0.4, and B = 0.10 (Power=0.90), 
12.50 
; n +1 7541 
n -—'[5. Therefore, n = = — 38 and n=38. 
2 ? 
Section 9-4 
9-43 a) In order to use the Y? statistic in hypothesis testing and confidence interval construction, we need to 


assume that the underlying distribution is normal. 
1) The parameter of interest 1s the true standard deviation of the diameter, o. However, the answer can 
be found by performing a hypothesis test on o°. 
2) Ho: o = 0.0001 
3) H; : o? > 0.0001 
4) à. — 0.01 


n- Ds 
5xi- 99 


6) Reject Ho if x5 > X5 4. where X59114 = 29.14 
7) n= 15, s? = 0.008 
> (n-Ds^ 14(0008) 
^ 6 000 
8) Since 8.96 < 29.14 do not reject Hp and conclude there is insufficient evidence to indicate the true 


standard deviation of the diameter exceeds 0.01 at œ = 0.01. 
b) P-value = P(y” > 8.96) for 14 degrees of freedom: 0.5 « P-value « 0.9 


= 8.96 


o 0.0125 


c) A=—= =12) power = 0.8, B=0.2 
O, 0.01 
using chart VIk, the required sample size is 50 
9-47. a) In order to use x? statistic in hypothesis testing and confidence interval construction, we need to assume 


that the underlying distribution is normal. 

1) The parameter of interest 1s the true standard deviation of titanium percentage, o. However, the 
answer can be found by performing a hypothesis test on o°. 

2) Hyp: 6^ = (0.25 

3) H; : 0^ z (0.25) 

4) o. — 0.01 


5 g= ODE 
6) Reject Ho if X5 < Xí-a/2,n-1 Where X5995,59 = 27.99 or X% > Xo,2,n-1 Where X590559 = 79.49 
7)n=51,s=0.37 
2 (n—1)s* 50037 
^ c . (s 
8) Since 109.52 > 79.49 we would reject Hp and conclude there is sufficient evidence to indicate the true 
standard deviation of titanium percentage is significantly different from 0.25 at a= 0.01. 


= 109.52 


b) 95% confidence interval for o: 
First find the confidence interval for o? : 


For œ = 0.05 and n = 51, X&/2,n-1 = X0.025,50 = 71.42 and X}_o/2,n-1 = X0.975,50 = 32.36 
2 2 
UAM mo ed 
(71.42) (32.36) 
0.096 € o^ € 0.2115 


Taking the square root of the endpoints of this interval we obtain, 


0.31 < o < 0.46 


Since 0.25 falls below the lower confidence bound we would conclude that the population standard 
deviation is not equal to 0.25. 


| | "e 40 
9-49 Using the chart in the Appendix, with A = T = 1.49 and B= 0.10, we find 
n= 30. 
Section 9-5 


9-5] p- 0.15, po=0.10, n=85, and z5,71.96 


B= Po —-ptZ,34 Pol- po)/ n Po = 92 53) Pod- po) n 


4p. — p)/n 4p. —- p)/n 
0.10—0.15--1.96/0.10(1—0.10)/85 0.10—0.15—1.96/0.10(1—0.10)/85 


0.151—0.15)/85 0.151-0.15/85 
dX0.36) -P2.94 20.6406-0.0016- 0.639 


2 
Zai Po- Do) —Z 8 X PUl—p) 
P~Po 
2 
1.960.100 —0.10) —1.28/0.151—0.15) 
0.15—0.10 
-(1085)* 2117632118 








9-53. a) Using the information from Exercise 8-51, test 
2) Ho : p = 0.05 
3) H; :p<0.05 
4) a=0.05 
X—n 
5) Zo e —- LIAE UI. Either approach will yield the same conclusion 
"po (1 2 Po) BN p - p.) 
6) Reject Ho if Zọ < —z, where —z, = —z995 =—1.65 
7) x - 13 n 300 j= <= = 0043 
300 
x— np, 13 — 300(0.05) 
Zo SS). =e —————— — — 
Jnp,(0— p,)  4300(0.05)(0.95) 
8) Since —0.53 > —1.65, do not null hypothesis and conclude the true fraction of defective integrated 
circuits is not significantly less than 0.05, at œ = 0.05. 
b) P-value = 1 — ®(0.53) = 0.29806 
9-57. The problem statement implies that Ho: p = 0.6, H;: p > 0.6 and defines an acceptance region as 
p< = = 0.63 and rejection region as pl» 0.63 
a) The probability of a type 1 error is 
i 0.63 — 0.6 
a = P(p 2 0.63| p = 0.6)= P| Z > ————— [= P(Z 21.37) =1- P(Z «1.37) = 0.08535 
0.6(0.4) 
500 
b) B = P( P< 0.63 | p = 0.75) = P(Z € -6.196) = 0. 
Section 9-7 
9-59. 


Value 0 l 2 3 4 
Observed Frequency 24 30 31 11 4 
Expected Frequency 30.12 36.14 21.69 8.67 2.60 


Since value 4 has an expected frequency less than 3, combine this category with the previous category: 
Value 0 l 2 3-4 
Observed Frequency 24 30 31 15 
Expected Frequency 30.12 36.14 21.69 11.67 


The degrees of freedom are k —-p—1=4-0-1=3 


a) 1) The variable of interest 1s the form of the distribution for X. 
2) Ho: The form of the distribution is Poisson 
3) H,: The form of the distribution is not Poisson 
4) a. — 0.05 
5) The test statistic 1s 
2 
2 . (0, NH E, 
Xo — 
i-l E, 
6) Reject Ho if X5 > x6053 = 7.81 
2 2 2 2 
, (24-3012) (30-3614) (31-2169) (15-1167) _ 


7) Xo = tt 9 at £7.23 
3012 3614 21.69 11.67 


8) Since 7.23 < 7.81 do not reject Hy. We are unable to reject the null hypothesis that the distribution of 
X is Poisson. 


b) The P-value is between 0.05 and 0.1 using Table III. P-value = 0.0649 (found using Minitab) 


9-63 The value of p must be estimated. Let the estimate be denoted by Panels 
0(39) + 1(23) - 2(12)  X(1) 


sample mean — — 0.6667 
75 
s sample mean 0.6667 
sample = 2 0.02778 
n 
Value 0 l 2 3 
Observed 39 23 12 l 


Expected 38.1426 26.1571 8.5952 1.8010 
Since value 3 has an expected frequency less than 3, combine this category with that of value 2: 


Value 0 ] 2-3 
Observed 39 23 13 
Expected 38.1426 26.1571 10.3962 


The degrees of freedom are k- p- 1=3 -1-1-71 
a) 1) The variable of interest is the form of the distribution for the number of under-filled cartons, X. 
2) Ho: The form of the distribution is binomial 
3) Hı: The form of the distribution is not binomial 
4) a. — 0.05 
5) The test statistic 1s 


6) Reject Ho if X5 > X695 = 3.84 


2 2 2 
39 — 38.1426 23 — 26.1571 13—10.3962 
7) ccc Ce MEN GIGS NN 
38.1426 26.1571 10.39 
8) Since 1.053 < 3.84 do not reject Hy. We are unable to reject the null hypothesis that the distribution 


of the number of under-filled cartons is binomial at o = 0.05. 


b) The P-value is between 0.5 and 0.1 using Table III P-value = 0.3048 (found using Minitab) 


Section 9-8 


9-65. 1. The variable of interest is breakdowns among shift. 
2. Ho: Breakdowns are independent of shift. 


3. Hı: Breakdowns are not independent of shift. 
4. Q, — 0.05 
5. The test statistic 1s: 
2 
Foa CX (0, -E ij 
Xo = 
i=l j=l E 


6. The critical value 1s pa = 12.592 
7. The calculated test statistic is 4 i =11.65 


8. xó ? xó 056 ^ do not reject Ho and conclude that the data provide insufficient evidence to claim that 


machine breakdown and shift are dependent at o = 0.05. 
P-value = 0.070 (using Minitab) 


9-69. . The variable of interest 1s failures of an electronic component. 
. Ho: Type of failure is independent of mounting position. 

. H,: Type of failure is not independent of mounting position. 

. 0 — 0.01 


. The test statistic 1s: 


Un d» WN Re 


6. The critical value 1s ym —]1.344 
7. The calculated test statistic is Y à = 1-057] 


2 2 ; : 
8. Xo FX 0.01,3 » do not reject Ho and conclude that the evidence is not sufficient to claim that the type of 


failure is not independent of the mounting position at a = 0.01. 
P-value = 0.013 


Supplemental 


9-75. 6-8, 5=204-200=-4, 5 = 0.025, zo «s = 1.96. 


a)n=20: B= ois z 2d = @(-0.28) = 1 — (0.28) = 1 — 0.61026 = 0.38974 


Therefore, power = 1 — B = 0.61026 


b)n-50: B= ofiss = +f) = @(-2.58) = 1 — &(2.58) = 1 — 0.99506 = 0.00494 


Therefore, power = 1 — B = 0.995 


44100 
8 





c)n=100: B= ofiss- | = $(—3.04) = 1 — (3.04) = 1 — 0.99882 = 0.00118 

Therefore, power = 1 — B = 0.9988 

d) As sample size increases, and all other values are held constant, the power increases because the 
variance of the sample mean decreases. Consequently, the probability of a Type II error decreases, which implies the 
power increases. 


9-77. a) Rejecting a null hypothesis provides a stronger conclusion than failing to reject a null hypothesis. 
Therefore, place what we are trying to demonstrate in the alternative hypothesis. 


Assume that the data follow a normal distribution. 


b) 1) the parameter of interest is the mean weld strength, u. 
2) Ho: uw = 150 
3) H,: u > 150 
4) Not given 
5) The test statistic is: 


2X7 Ho 
s/n 


6) Since no critical value is given, we will calculate the P-value 
7) x 2153.7,s- 11.3, n=20 


[o 


(15337-1580 - 


jo ee e 
" — 1134/20 


P-value = P(t > 1.46) = 0.05 « p - value < 0.10 


1.46 


8) There is some modest evidence to support the claim that the weld strength exceeds 150 psi. 
If we used a = 0.01 or 0.05, we would not reject the null hypothesis, thus the claim would not be 
supported. If we used a = 0.10, we would reject the null in favor of the alternative and conclude the 
weld strength exceeds 150 psi. 


pele a) 1) the parameter of interest is the standard deviation, o 
2) Ho : o° = 400 
3) Hj: o^ < 400 
4) Not given 
a — Ds? 
5) The test statistic is: X6 = eS 
o 


6) Since no critical value is given, we will calculate the p-value 
7)n=10,s=15.7 


P-value = P(x” < 5.546); 0.1 < P — value < 0.5 


8) The P-value is greater than any acceptable significance level, a, therefore we do not reject the null 
hypothesis. There is insufficient evidence to support the claim that the standard deviation is less than 
20 microamps. 


b)7)n-751,s- 20 


2 5005." 
LOU i08] 
Ko ~~ 100 
P-value = P(x” < 30.81]; 0.01 < P — value < 0.025 


8) The P-value is less than 0.05, therefore we reject the null hypothesis and conclude that the standard 
deviation is significantly less than 20 microamps. 


c) Increasing the sample size increases the test statistic X6 and therefore decreases the P-value, providing 


more evidence against the null hypothesis. 


9-85 We can divide the real line under a standard normal distribution into eight intervals with equal 
probability. These intervals are [0,.32), [0.32, 0.675), [0.675, 1.15), [1.15, œ) and their negative 
counterparts. The probability for each interval is p = 1/8 = .125 so the expected cell frequencies 
are E = np = (100) (0.125) = 12.5. The table of ranges and their corresponding frequencies is 
completed as follows. 


Interval Obs. Frequency. Exp. Frequency. 


x € 5332.5 l 12.5 
3092.5 9008 5397.5 4 12:5 
5357.5« x < 5382.5 7 12.5 
5382.5< x < 5407.5 24 125 
5407.5< x < 5432.5 30 12.5 
5432.5< x < 5457.5 20 12.5 
5457.5< x < 5482.5 15 12.5 

x 2 5482.5 5 12.5 


The test statistic 1s: 
2 E 2. E 2 B > 
s (1-125) | (4-125) A ,45-125) | (5-125) 


: = 63.36 
12.5 12.5 12.5 12.5 


and we would reject if this value exceeds. Y 40.05,5 =11.07.Since Y; > Y moss , reject the 
hypothesis that the data are normally distributed 


9-87 a) In order to use t statistics in hypothesis testing, we need to assume that the underlying distribution 
is normal. 
1) The parameter of interest is the true mean overall distance for this brand of golf ball, u. 
2) Ho : u 7 270 
3) H,;: u «270 
4) a=0.05 
5) Since n>>30 we can use the normal distribution 


XU 
Zo = 
sin 
6) Reject Ho if Zọ <- z, where Zp 095 =1.65 
7) X=1.25 s=0.25 n=20 
— 260.30— 270.0 — 


— -13.41//100 — 


8) Since —7.23<-1.65, reject the null hypothesis and conclude there is sufficient evidence to indicate that the 
true mean distance is less than 270 yds at a = 0.05. 
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b) The P-value = 0. 


C) We can divide the real line under a standard normal distribution into eight intervals with equal 
probability. These intervals are [0,.32), [0.32, 0.675), [0.675, 1.15), [1.15, œ) and their 
negative counterparts. The probability for each interval is p = 1/8 = .125 so the expected cell 
frequencies are E = np = (100) (0.125) = 12.5. The table of ranges and their corresponding 
frequencies is completed as follows. 


Interval Obs. Frequency. Exp. Frequency. 

x € 244.88 16 12:5 
244.88« x € 251.25 6 12:5 
251.25< x € 256.01 17 12:3 
256.01« x € 260.30 9 12.5 
260.30« x € 264.59 13 12.5 
264.59< x < 269.35 8 12.5 
269.35< x < 275.72 19 12:5 


Xc 73.12 12 12.5 


The test statistic 1s: 


2 2 2 2 
y: -062125 (6-123 y p 019-1237 02-125 15 
12.5 12.5 12.5 12.5 


and we would reject if this value exceeds y 40.05,5 = |1.07 . Since it does, we can 


reject the hypothesis that the data are normally distributed. 


Chapter 10 Selected Problem Solutions 


Section 10-2 


10-1. a) 1) The parameter of interest is the difference in fill volume, [t4 — u» ( note that A970) 
2) Ho: My — Uy =O or ut, =H, 


3) Hy: My — UM, #0 or ui, f Uy 
4) a=0.05 
5) The test statistic 1s 


Zo 
? p 
Ta 
nj; Wy 


6) Reject Ho if Zg So —1.96 or Zo ^ Zgo = 1.96 
7)x;—16.015 Xx, = 16.005 
6, =0.02 © = 0.025 








n; ^ 10 n — 10 
„ = (6015-1600) _ 099 
(0.02)? | (0.025)? 
10 10 


8) since -1.96 < 0.99 < 1.96, do not reject the null hypothesis and conclude there is no evidence that the 
two machine fill volumes differ at œ = 0.05. 
b) P-value — 2(1 — ®(0.99)) = 2(1— 0.8389) = 0.3222 
c) Power = 1— B , where 











B =D} Zy/2 - Ao - | -zg/? — dE. 
01 o5 IE: e 
nj M? 
rr a 
Se 02)? , €. 221 (0.02)? E (0.025)? 
COET 10 10 
-®(1.96 — 3.95)— 9(—-1.96 —3.95) = 6(-1.99)— o(- 5.91) 
= 0.0233 — 0 
— 0.0233 


Power = 1 —0.0233 = 0.9967 


2 2 2 2 

= = O1 O^ ©] O^ 

d) Gu -32)- Zan ot S- u < (xi CE)tfeng t 
nj M n Mm 


(0.02)? . (0.025)? 


i9 SMH § (16.015 — 16.005) + 1.96 


(0.02)? a (0.025)? 
0 


(16.015 — 16.005) — 1.96 
10 


—0.0098 € u, — pt» < 0.0298 
With 9596 confidence, we believe the true difference in the mean fill volumes is between —0.0098 and 
0.0298. Since 0 is contained in this interval, we can conclude there is no significant difference between 


the means. 
e) Assume the sample sizes are to be equal, use a = 0.05, B = 0.05, and A = 0.04 


Can tza) lo? +03) _ (1.96+1.645)' (002 +0.025)") «i. no 


n= 
ô’ (0.04)? 
use n =, =9 


10-5. xX, =30.87 x, =30.68 
0,=0.10 o, =0.15 
nj = 12 n5 = 10 


a) 90% two-sided confidence interval: 


-— "IN oi o 
(Xy — X2) - Za/2 Lor Spy =p $ (Xi = X2) Zur ee oa 
1 2 1 2 


010)? (015) 10 (015^ 
(30.87 — 30.68) — 1.645 C Se «ui - uo < (3087-3068) + L645 iN OE 


0.0987 € u4 — u> € 02813 


We are 90% confident that the mean fill volume for machine | exceeds that of machine 2 by between 
0.0987 and 0.2813 fl. oz. 


b) 95% two-sided confidence interval: 


-- "DN of o 
(xi 7 X5)- Za/2 pum € My - My € (xi - X5) zy = s 
1 2 1 2 





2 2 : : 
(3087 — 30.68) - 196 I + o> < p — Hy S (30.87 - 30.68) + 1.96 — : "Ue 


0.0812 < u, — LL, € 0.299 


We are 95% confident that the mean fill volume for machine | exceeds that of machine 2 by between 
0.0812 and 0.299 fl. oz. 


Comparison of parts a and b: 
As the level of confidence increases, the interval width also increases (with all other values held constant). 


c) 95% upper-sided confidence interval: 
oi 9) 


My -H2 € (X 7 X2) Za | — + — 
nj mn» 


2 2 
e 0) | (015) 


U1 — H2 < (3087 — 30.68) + 1.645 T 


Ui — U2 < 0.2813 


With 95% confidence, we believe the fill volume for machine 1 exceeds the fill volume of machine 2 by 
no more than 0.2813 fl. oz. 


10-7. | xX(,2896 X» =92.5 


01-215 of =12 
nj-15 m =20 


a) 95% confidence interval: 


-— "DEM "ue 
(x1 7 X2) Zgj2 oto S My Ha < (X1 7X») Zar PSU 
1 2 1 2 


10-11. 


15 12 15 12 
89.6— 92.5) — 1.96 "PEST 89.6— 92.5) + 1.96,| — + — 
( ) lasts uU — My <( ) as" 25 


-3.684 € u, - u> € 2.116 


With 95% confidence, we believe the mean road octane number for formulation 2 exceeds that of 
formulation 1 by between 2.116 and 3.684. 


b) 1) The parameter of interest is the difference in mean road octane number, u4 — [i5 and Ag = 0 
2) Ho: Hy = H2 70 or u =o 
3) Hi : Wy 7 H5 <0 or Wy < p2 


4) o, — 0.05 
5) The test statistic 1s 





Z0 
O1 o5 
nj Ny 
6) Reject Ho if Zo < —z4 = —1.645 
7) X] = 89.6 X5 = 92.5 
op =15 05 =12 
n =15 m =20 
B (89.6 — 92.5) - 0 217254 
(L5)? 7 (1.2)? 
15 20 


8) Since —7.25 < -1.645 reject the null hypothesis and conclude the mean road octane number for formulation 
2 exceeds that of formulation 1 using a = 0.05. 


c) P-value = P(z < —7.25) =1— P(z < 7.25) 21-1z0 


95% level of confidence, E = 1, and Zp 925 =1.96 


zur 1.96 | 
nz 3 (o? +07 )= Ts (1.54-1.2) 210.37, n2 11, usen =m=11 


Catalyst 1 Catalyst 2 
55.22 X5 = 68.42 

0] = 07 = 

nı = 10 n» = 10 


a) 95% confidence interval on UW; — >, the difference in mean active concentration 


2 2 
Oo Oo 
(xy - x5)- cud Uu € (x1 - X5) zyj3 | — 4 — 
nj Ng 
ORRON 
(65.22 — 68.42) — 1.96 “<u < (65.22 — 68.42) + 1.96 T GE 


—5.83 < Uu — U> S —0.57 


We are 95% confident that the mean active concentration of catalyst 2 exceeds that of catalyst 1 by 
between 0.57 and 5.83 g/l. 


b) Yes, since the 95% confidence interval did not contain the value 0, we would conclude that the mean 
active concentration depends on the choice of catalyst. 


10-13. 


1) The parameter of interest is the difference in mean active concentration, lt — U2 
2) Ho: ui 7H =9 or Wy =W 
3) Hy: Wy 7g» #0 or Wy £u 


4) a — 0.05 
5) The test statistic 1s 


Zo = 
6? 63 
D 22: 
nj M 


6) Reject Ho if Zg SAT —1.96 or Zo ^ Zgo = 1.96 
7)x( = 750.2 X =756.88 8-0 
6, = 20 07 = 20 
nj = 15 n5 — 8 
| (750.2 — 756.88) — 0 
(0) Q0) 
15 8 


Zo = 2385 





8) Since —2.385 « —1.96 reject the null hypothesis and conclude the mean active concentrations do differ 
significantly at o = 0.05. 


P-value = 2 (1— $(2.385)) = 2(1— 0.99146) = 0.0171 


The conclusions reached by the confidence interval of the previous problem and the test of hypothesis 
conducted here are the same. A two-sided confidence interval can be thought of as representing the 
"acceptance region" of a hypothesis test, given that the level of significance is the same for both 
procedures. Thus if the value ofthe parameter under test that 1s specified in the null hypothesis falls outside 
the confidence interval, this is equivalent to rejecting the null hypothesis. 


Section 10-3 


10-17 


a) 1) The parameter of interest is the difference in mean rod diameter, u; — U> , with Ao = 0 
2) Ho: yı -H2 =0 or Wy =o 
3) Hy: Wy 7 uo £0 or u £u» 


4) o, — 0.05 
5) The test statistic 1s 


nj n» 
6) Reject the null hypothesis if to < —ty/2 n,4n,-2 Where —t905539 = —2.042 or tj? ty/2n,4n,-2 Where 
to.025,30 = 2.042 


2 2 
(n; - Dsi + (n5 - D$ 


n; +n -2 


32-035  s-040 7 039) OOM) = 0614 


n, =15 n =17 


7) ) x] = 8.73 X5 = 8.68 


N O88) _ gars 


^ | | 
0.614. | + 
15 17 


8) Since —2.042 < 0.230 < 2.042, do not reject the null hypothesis and conclude the two machines do not 
produce rods with significantly different mean diameters at œ = 0.05. 





b) P-value = 2P (t > 0.230) » 2(0.40), P-value > 0.80 


c) 95% confidence interval: to 925,39 = 2.042 


T 1 1 =a 1 1 
(xi - X2) fa/2, en; -2 (8p) + — < My -M S(xi- X2)  ta/2,a en; 289) + — 
is idi 1 no 


1 |] 1 |] 
(8.73 868) — 2.042(0.614),|— + «ui -H2 $ (8.73 — 8.68) + 2.042(0.643) TET 


— 0.394 < u, — u, € 0.494 


Since zero is contained in this interval, we are 9596 confident that machine 1 and machine 2 do not 
produce rods whose diameters are significantly different. 


10-21. a) 1) The parameter of interest is the difference in mean etch rate, [i4 — ui» , with Ap = 0 
2) Ho: Uy -H2 =O or Wy =o 
3) Hi: 41—42 #0 or p # Uy 


4) a=0.05 
5) The test statistic 1s 


Hy. Ho 
6) Reject the null hypothesis if to < —ty/2 n,4n,-2 Where — t995518 = —2.101 or to > to/2.n,4n,-2 where 
to.025,18 = 2.101 


2 2 
1)X,=9.97 %=104 s - a e 
Dj t n» — 
2 2 
s =0.422  s,-0231 = M = 0.340 


n = 10 n, = 10 
(9.97 —10.4) 


e coe: cl 
0.340 + 
10 10 


8) Since —2.83 < —2.101 reject the null hypothesis and conclude the two machines mean etch rates do 
significantly differ at a = 0.05. 


—2.83 





b) P-value = 2P (f < —2.83) 2(0.005) < P-value < 2(0.010) = 0.010 < P-value < 0.020 


c) 95% confidence interval: t9055;s = 2.101 


T l l TS l l 
(x - X2) ~ ta/2,n, +n, -2 p) |— + — S My -M S(xi- X2) + ta/2n +n, -2 Gp) |— t — 
1 Ny 1 Ny 


l l l l 
9.97 — 10.4) — 2.101(.340),/—+— < u, — u, < (9.97 — 10.4 )+ 2.101(.340), | — + — 
' } ( ) 10 10 ! : ) 10 10 


— 0.749 < u, — u, € —0.111 


We are 95% confident that the mean etch rate for solution 2 exceeds the mean etch rate for solution 1 
by between 0.1105 and 0.749. 


d) According to the normal probability plots, the assumption of normality appears to be met since the data 
from both samples fall approximately along straight lines. The equality of variances does not appear to be 
severely violated either since the slopes are approximately the same for both samples. 


Normal Probability Plot 


.80 
.20 


10.0 10.1 102 103 104 105 106 107 
solution 


Normal Probability Plot 


Probability 
Probability 
3 





9.5 10.0 10.5 
solution 


Anderson-Darling Normality Test Anderson-Darling Normality Test 


erage: 9.97 Average: 10.4 
StDev: 0.421769 A-Squared: 0.269 StDev: 0.230940 A-Squared: 0. 
N: 10 P-Value: 0.595 N: 10 alue: 0.80. 


10-27 a) 1) The parameter of interest is the difference in mean wear amount, u4 —U>. 
2) Ho : Hy —H2 20 or u; =p? 
3) Hy: 4 7 uo #0 or p z Hy 


4) o, — 0.05 
5) The test statistic 1s 


to 
2 2 
S S 
nj Dn» 


6) Reject the null hypothesis if to < —t905557 where —t 975,97 =—2.052 or to > t905557 where 1995557 = 
2.052 since 











y = 26 
(truncated) 
7) x, — 20 X, =15 Ag = 0 


(20—15)-0 
a 


= 3.03 





8) Since 3.03 > 2.056 reject the null hypothesis and conclude that the data support the claim that the two 
companies produce material with significantly different wear at the 0.05 level of significance. 


b) P-value = 2P(t > 3.03), 2(0.0025) < P-value < 2(0.005) 
0.005 < P-value < 0.010 


c) 1) The parameter of interest is the difference in mean wear amount, u; — U2 
2) Ho: Hy - U2 = 9 
3) H,: Ui — U2 >0 
4) a= 0.05 


5) The test statistic 1s 


to 
2 2 
S S 
] Q2 
nj n» 


6) Reject the null hypothesis if to > t99557 where fo 95 96 = 1.706 since 


1)X,=20 % =15 





nj = 23 n5 = 25 
ty = LIS = 903 
2 @) 
25 25 


8) Since 3.03 > 1.706 reject the null hypothesis and conclude that the data support the claim that the 
material from company 1 has a higher mean wear than the material from company 2 using a 0.05 level 
of significance. 

10-29. If œ= 0.01, construct a 99% lower one-sided confidence interval on the difference to answer question 10-28. 
10.005,19 = 2.878 


(x -—X; )- tio, 








2 2 
002) GOD < u,- pi, € (103.5—99.7) - 2.878 


(103.5 — 99.7) — 2.878 
12 13 





— 14.34 € u, — u, € 21.94. 


Since the interval contains 0, we are 99% confident there is no difference in the mean coating thickness 
between the two temperatures; that 1s, raising the process temperature does not significantly reduce the 
mean coating thickness. 


10-31 a) 
Normal Probability Plot for Brand 1...Brand 2 


ML Estimates 


a Brand 1 
Brand 2 


Percent 
Cc 
eo 





b . 1) The parameter of interest is the difference in mean overall distance, U; — [t5 , with Ao = 0 


2) Ho: ui 7-95 =0 oru =p? 
3)H;: uiuo #0 or pui zu 
4) a. — 0.05 


5) The test statistic 1s 


6) Reject the null hypothesis if to < —ta/2,n, +n, -2 Where — £5 055 1g 7 2.101 or to > ty jan, 45, 5» Where 


£0 025.18 = 2.101 


2 2 
7)x,-2757 X, =265.3 s, = (ni - Dsi + = Ds? 
nj T n5 —2 
2 2 
s =8.03 s, = 10.04 = E — 9.09 


n; =10 n, = 10 


- CTS 122059). Les 


l 
9.09 + 
10 10 
8) Since 2.558>2.101 reject the null hypothesis and conclude that the data do not support the claim that 
both brands have the same mean overall distance at œ = 0.05. It appears that brand 1 has the higher mean 
differnce. 





c.)P-value = 2P (f < 2.558) P-value = 2(0.01)=0.02 


d) g = —> 9.975 B=0.95 Power =1-0.95=0.05 
2(9.09 ) 
3 100 +1 
) 1-B=0..75 =0.27 d =———— = 0.165 n*=100 „= ——  — = 50,5 
ae zi 2(9.09) í "=; 


Therefore, n=51 


a l l NA l l 
0 -x)-ts I oe m m (S - etus, [o 
nn N, n, n, 
1 


1 1 ] 
275.7 — 265.3) — 2.101(9.09), | — 4 — < u, — u, € (275.7 — 265.3) - 2.101(9.09), | — + — 
( ) = 2.101(9.09),|— e — S1 - He &( ) 2.10,(9.09) -+ 


1.86 € ui, — u, €18.94 


Section 10-4 


10-37 d = 868.375 sa = 1290, n = 8 where d; = brand 1 - brand 2 
99% confidence interval: 


S — S 
d= vas] Sug Sd- ta 3 | 


868.375 — Ld < ug < 868.375 + ses [m 


J8 


—727.46 € Ug € 2464.21 


Since this confidence interval contains zero, we are 99% confident there is no significant difference between 
the two brands of tire. 


10-39. 1) The parameter of interest is the difference in blood cholesterol level, Ug 
where d; = Before — After. 


2) Ho: Ug =0 
3) Hy: Ug 0 
4) a = 0.05 


5) The test statistic 1s 


T d 
> sq / Vn 


6) Reject the null hypothesis if to > too514 where t99544 = 1.761 





7) d = 26.867 
sq = 19.04 
n=15 


|| 26867 
^ 1904/415 ` 
8) Since 5.465 > 1.761 reject the null and conclude the data support the claim that the mean difference 
in cholesterol levels is significantly less after fat diet and aerobic exercise program at the 0.05 level of 
significance. 


465 


Section 10-5 


10-47. . 1) The parameters of interest are the variances of concentration, o? G5 


2 2 
2) Ho : OF = 0) 


3) H, : o; z o3 
4) a. — 0.05 
5) The test statistic 1s 
pe 
perm. 
S2 
6) Reject the null hypothesis if fo < f0.975,9.15 where f0.975,9.15 = 0.265 or fo > f0.025,9.15 where f0.025,9.15 —3.12 
T) nı = 10 n» = 16 
sı =4.7 S2 — 5.8 
A 
(58) 


8) Since 0.265 « 0.657 « 3.12 do not reject the null hypothesis and conclude there is insufficient evidence to 
indicate the two population variances differ significantly at the 0.05 level of significance. 





0 = 0.657 


10-51 — a) 90% confidence interval for the ratio of variances: 


2 2 2 
S O S 
Er 24e em en 

P 1-a / 2n -Ln, -] — 2 irs 2 a / 2n, -Ln, -] 
So 0» So 








(0.8) 


2 


0.08775 < <b < 3594 
0$ 


b) 95% confidence interval: 


2 2 
c S 
E ij l-a/2,n,-1,n,—-1 = NE = [3 ens 
2 
ME 6 0.104 < - E 9.60 
(0.8)? (0.8) 


o? 
0.0585 < OL — «54 
65 


The 95% confidence interval is wider than the 90% confidence interval. 








c) 90% lower-sided confidence interval: 


2 2 
51 |f 291 
“9 | l-a,n,—-1,n,-1 - 7» 

Oo 





S2 2 
2 2 
= 0243 < Sb 
(0.8) O^ 
2 
0137 < ÊL 
O2 


10-55 1) The parameters of interest are the thickness variances, o? © 
2) Ho: : o; = = Gs 


3) Hy: o? z o3 


4) o. — 0.01 
5) The test statistic 1s 


6) Reject the null hypothesis if fọ < f 0.995104 where f 09951042 =0.1766 or fọ > p 10.12 Where 


Fo005,10,12 = 2.91 
7) nı = 11 n» = 13 


sı = 10.2 s) = 20.1 
10.2)° 

Fics ( y 

(20.1) 


8) Since 0.1766 >0.2575 > 5.0855 do not reject the null hypothesis and conclude the thickness variances are 
not equal at the 0.01 level of significance. 


= 0.2575 


10-59  ]) The parameters of interest are the overall distance standard deviations, 04,0» 


2 
2) Ho: o? = 05 


3) Hy: o; z o3 


4) a. — 0.05 
5) The test statistic 1s 


S] 
fo 7 7 
S2 


6) Reject the null hypothesis if fọ < Toses =0.248 or fy > So.005.9.9 = 4.03 
7) ni = 10 n5 = 10 S, = 8.03 s2 = 10.04 


_ (8.03)? — 
fo = (10.04)? 


8) Since 0.248 « 0.640 « 4.04 do not reject the null hypothesis and conclude there is no evidence to support 


the claim that there is a difference in the standard deviation of the overall distance of the two brands at the 
0.05 level of significance. 


640 


95% confidence interval: 


si Loi [si 
=F Hsc a ne po [i2 
S2 O02 S2 


2 
O 
(0.640)0.248 < — < (0.640)4.03 
2 
o? 
0.159 < — < 2.579 
2 


Since the value 1 is contained within this interval, we are 95% confident there is no significant difference in 
the standard deviation of the overall distance of the two brands at the 0.05 level of significance. 


Section 10-6 


10-61.  1)the parameters of interest are the proportion of defective parts, p; and p> 


2) Ho: Pi = Pr 
3) Hy : Pi z D» 
4) a. — 0.05 


5) Test statistic 1s 


where 





n, tn, 


6) Reject the null hypothesis if z; < —z9955 where —z9955 = —1.96 or Zo > Zo 995 
where Z0.025 — 1.96 





7) n, = 300 ny = 300 
X] = 15 X3 8 
^ i x 15-8 
pı = 0.05 p» = 0.0267 p= BOO — 0.0383 
0.05 — 0.0267 
Zy = -g 
0.0383(1 — 0.0383) + 








300 300 


8) Since —1.96 < 1.49 < 1.96 do not reject the null hypothesis and conclude that yes the evidence indicates 
that there is not a significant difference in the fraction of defective parts produced by the two machines 


at the 0.05 level of significance. 
P-value = 2(1—P(z « 1.49)) = 0.13622 


10-68. a)Power-1-p 





-..300(0.05) + 300(0.01) 
300 + 300 


. [0.05(1 — 0.05) : 0.01(1 — 0.01) "e 
On E = - — =). 
ee 300 300 
1 1 1 1 
1.96 10.0 0.9 —(0.05—0.01 —1.96 0.0 0.9 —(0.05—0.01 
B= Sd 300 300 | ) $ a 300 300 | ) 


0.014 0.014 


= 0.03 q =0.97 








= $(-0.91)- 6(- 4.81) = 0.18141 -0 = 0.18141 
Power = 1 — 0.18141 = 0.81859 


t t 
LE i tp M PP ron 





b)n- 





2 
(p, - p:) 
(0.05 + 0.01(0.95 + 0.99) 


1.96 





i +1.29,/0.05(0.95) + 0.01(0.99) 





= 382.11 


(0.05 — 0.01" 


n = 383 


10-67 95% confidence interval on the difference: 


Pd -PD , 821-2) - Pid pi) , 920 - B2) 
P1 ee < p1 — p? € ($1 - Po) + zo P1 Pi) , P2 P2 
2 


(Pı — P2) ^ Za/2 
070-077) , 06650-06675) . . . (077 05675) +196 [07/0 07D. , 066751 - 06675) 
500 400 500 400 


0.0434 € p, — p, € 0.1616 


Since this interval does not contain the value zero, we are 95% confident there is a significant difference in 
the proportions of support for increasing the speed limit between residents of the two counties and that the 
difference in proportions is between 0.0434 and 0.1616. 


(0.77 — 0.6675) — 196 


Supplemental Exercises 


10-69 a) Assumptions that must be met are normality, equality of variance, independence of the observations and of 
the populations. Normality and equality of variances appears to be reasonable, see normal probability plot. The data 
appear to fall along a straight line and the slopes appear to be the same. Independence of the observations for each 
sample is assumed. It is also reasonable to assume that the two populations are independent. 


Normal Probability Plot Normal Probability Plot 


.999 
.99 
.95 
.80 
.50 
.20 


.05 
.01 
.001 


Probability 
Probability 





14 15 16 17 18 19 20 8 9 10 11 12 13 14 15 
ódiour 1-hour 
Average: 16.3556 Anderson-Darling Normality Test Average: 11.4833 Anderson-Darling Normality Test 
StDev: 2.06949 A-Squared: 0.171 StDev: 2.37016 A-Squared: 0.158 
N:9 P-Value: 0.899 N:6 P-Value: 0.903 
b) x1 =1636 X; =11486 
S] = 2.07 S5 = 2.37 
nj = 9 n = 6 


99% confidence interval:  ta/2,n,+n,-2 = to.005,13 Where to.005,13 = 3.012 


2 2 
E [8(2.07) CEU — 


= [1 1 "IN 1 1 
(Xi ^32) — ta/2,n,+n,-2(Sp) uu <u =u < (Xi = X2) + ta/2,n,+n,-2(Sp) i s 


(16.36 — 11.486) — 3012(219) $ + < uu - Hy < (1636 — 11.486) + 3012(219) $ + 


c) Yes, we are 99% confident the results from the first test condition exceed the results of the second test 
condition by between 1.40 and 8.36 (x10° PA). 


10-73 a) 1) The parameters of interest are the proportions of children who contract polio, p; , p2 
2) Ho : pi = po 
3) Hi : pi Z po 
4) a. — 0.05 
5) The test statistic 1s 





6) Reject Ho if Zo < —zg;5? OF Z > Zs. Where zg5- 1.96 
. 11 . 
7) Pi = a MN 0.00055 (Placebo) p-2——— 
n, 201299 Ty 
^ X» 33 
= = 
n, 200745 


= 0.000356 








— 0.00016 (Vaccine) 


0.00055 — 0.00016 26.55 





Zo — 











l l 
0.000356 (1 — 0.000356) + 
201299 200745 
8) Since 6.55 > 1.96 reject Hp and conclude the proportion of children who contracted polio is 
significantly different at o = 0.05. 

b)a=0.01 Reject Ho if Zo < —Zzg/5 OF Zo > Zg;5 Where zg,5 =2.33 

Zo = 6.55 

Since 6.55 > 2.33, reject Hy and conclude the proportion of children who contracted polio is different at 

a= 0.01. 
c) The conclusions are the same since Zg is so large it exceeds Zan in both cases. 


10-79. 








2 
2,575, 42 * NS *04) (128. [0.9(0.1) +0.6(0.4) 
n = 2 
(0.9 — 0.6) 
- 5.346 59 4 
0.09 
n = 60 
10-81. Ho : u =) 
Hı : u; # u2 
Nn; = Nn nN 
B=0.10 
a= 0.05 
Assume normal distribution and 01 = o = 0° 
U =H, tO 
J= Hhh | o l 
20 20 2 


From Chart VI (e), n° = 50 
n +1 50+1 
n = = 
2 


Nn; — 1» —26 





= 25.5 





10-83 . a) No. 


Normal Probability Plot Normal Probability Plot 





© D 

a s 

[ts] © 

a Q 

O O 

[us [um 

A A 

23.9 24.4 24.9 
mercedes volkswag 

Average: 24.67 Anderson-Darling Normality Test Average: 40.25 Anderson-Darling Normality Test 
StDev: 0.302030 A-Squared: 0.934 StDev: 3.89280 A-Squared: 1.582 
N: 10 P-Value: 0.011 N: 10 P-Value: 0.000 


b) The normal probability plots indicate that the data follow normal distributions since the data appear to fall 
along a straight line. The plots also indicate that the variances could be equal since the slopes appear to be 


the same. 


Normal Probability Plot 


Normal Probability Plot 


.999 


.99 
.95 





i | 
£^ 80 
a | 2 
S .50 a 
S. rr g 
il 
a E | [e] 
.05 | a 
.01 | 
.001 
24.5 24.6 24.7 24.8 24.9 39.5 40.5 415 425 
mercedes volkswag 
Average: 24.74 Anderson-Darling Normality Test Average: 41.25 Anderson-Darling Normality Test 
StDev: 0.142984 A-Squared: 0.381 StDev: 1.21952 A-Squared: 0.440 
:10 P-Value: 0.329 N: 10 P-Value: 0.230 


c) By correcting the data points, it is more apparent the data follow normal distributions. Note that one 
unusual observation can cause an analyst to reject the normality assumption. 


d) 95% confidence interval on the ratio of the variances, Ov i ot 





sy = 149 fo 9,0.025 = 4.03 
5, = 0.0204 f : | 20248 
M -— V 9.9.0[075 — 7 —'4A4. Ww 
foo 9025 4.03 
Sy ov [sy 
=~ 110309759 >~ < | =~ [19.9.0025 
SM OM SM 
1.4 : 14 
Cons oss E ov < l jos 
0.0204 o2, 0.0204 
c? 
18124 « —- « 294.35 
OM 


Since the does not include the value of unity, we are 95% confident that there is evidence to reject the claim 
that the variability in mileage performance is different for the two types of vehicles. There is evidence that the 
variability is greater for a Volkswagen than for a Mercedes. 


10-85 a) Underlying distributions appear to be normal since the data fall along a straight line on the normal 


probability plots. The slopes appear to be similar, so it is reasonable to assume that O : =O : 


Normal Probability Plot for tip 1...tip2 


ML Estimates 





tip1 
tip2 


Percent 














40 45 50 
Data 


b) 1) The parameter of interest is the difference in mean volumes, |t, — U2 
2) Ho: Hy 7-9» = 9 org = po 
3) Hi: Wy Ho #0 or Wy pu» 


4) a — 0.05 
5) The test statistic 1s 


_ (Q3 7 X2)-0 
l l 
+ 


nj 1n» 


to 


6) Reject Ho if tg < —ty/2,y Or Zo > ta/2,v where ta/2,v = t0.025,18 = 2.101 


" = 1.252)* + 9(0.843)* 
73,2 7527] % = 155.6 sp = a = 1.07 


sı = 1.252 S2 = 0.843 
n; = 10 n = 10 
"m (752.7 — 755.6) — 0 


0 — —6.06 
| 1 1 
1.07, — + — 
10 10 


8) Since —6.06 < —2.101, reject Ho and conclude there is a significant difference between the two wineries 
with respect to the mean fill volumes. 

















10-89 a.) The data from both depths appear to be normally distributed, but the slopes are not equal. 
Therefore, it may not be assumed that O g =O s . 
Normal Probability Plot for surface. ..bottom 
ML Estimates 
a Surface 
bottom 
€ 
(ab) 
2 
oO 
A 
4 5 6 7 8 
Data 
b.) 


1) The parameter of interest is the difference in mean HCB concentration, U4 — U> , with Ao = 0 
2) Ho: Wy -H2 =0 org, = po 

3) Hi: pi - Ho #0 or u £p» 

4) at — 0.05 

5) The test statistic 1s 


to (X1 — X2) - Ag 
LEES 
nj fy 


6) Reject the null hypothesis if to < — £505, 15 where — 


£0 025,15 = —2.131 or to > C0 025,15 where C0 025,15 B 
2.131 since 











y zxl5 
(truncated) 
7) xX, =4.804 x» =5.839 sı =0.631 s, =1.014 
n, =10 n; = 10 
- (4804-5.839) a, 
(0.631) " (1.014)? 





10 10 


8) Since —2.74 < -2.131 reject the null hypothesis and conclude that the data support the claim that 
the mean HCB concentration is different at the two depths sampled at the 0.05 level of significance. 


b) P-value = 2P(t < -2.74), 2(0.005) < P-value < 2(0.01) 


0.001 < P-value < 0.02 


2 
ce) A2. 02005 m; 93,910 d =——=1 
2(1) 
From Chart VI (e) we find B = 0.20, and then calculate Power = 1- B = 0.80 


2 
d)A-2 a=0.05 co ee p =0.0 


50 +1 
From Chart VI (e) we find n*=50 and n = 





= 25.5 , SO n=26 


Chapter 11 Selected Problem Solutions 
Section 11-2 


11-1. a)y; =Bo +B) x) +£; 
S = 157.42 — 45 


14 
= 25348571 

Syy = 1697.80 - BC) 
= —59.057143 

a S — 

gj, — a =59.057143 oa 





S. 2534857 


By = y-Bix = 322 — (723298017) (8) = 48.013 
b)y- B, + Bx 
y = 48.012962 — 2.3298017(4.3) = 37.99 


o) $ = 48.012962 — 2.3298017(3.7) = 39.39 
de= y- $ 246.1-39.39 2 6.71 


11-5. a) 

Regression Analysis - Linear model: Y = a+bX 

Dependent variable: SalePrice Independent variable: Taxes 
Standard T Prob. 

Parameter Estimate Error Value Level 

Intercept 13.3202 2.57172 5.17948 .00003 

Slope 3.32437 0.390276 8.518 .00000 

Analysis of Variance 

Source Sum of Squares Df Mean Square F-Ratio Prob. Level 

Model 636.15569 1 636.15569 72.5563 .00000 

Residual 192.89056 22 8.76775 

Total (Corr.) 829.04625 23 

Correlation Coefficient = 0.875976 R-squared = 76.73 percent 

Stnd. Error of Est. = 2.96104 

A2 

O° = 8.76775 


If the calculations were to be done by hand use Equations (11-7) and (11-8). 
y = 13.3202 + 3.32437x 


b) $ 2133202 + 3.32437(7.5) = 38253 
c) ¥ = 13.3202 + 3.32437(5.8980) = 32.9273 


$ 232.9273 
e = y — 9 = 30.9 — 32.9273 = 2.0273 


d) All the points would lie along the 45% axis line. That is, the regression model would estimate the values exactly. At 
this point, the graph of observed vs. predicted indicates that the simple linear regression model provides a reasonable fit 
to the data. 


Plot of Observed values versus predicted 





ke. 
in 
2 
D 
= 
a 
25 30 35 40 45 50 
Observed 
11-9. a) Yes, a linear regression would seem appropriate, but one or two points appear to be outliers. 
9 
8 
7 
6 
5 
> 
4 
3 
2 
1 
0 
60 70 80 90 100 
X 
Predictor Coef SE Coef T P 
Constant -9.813 2.135 -4.60 0.000 
X 0.17148 0.02566 6.68 0.000 
S = 1.408 R-Sq = 71.3% R-Sq(adj) = 69.7% 


Analysis of Variance 


Source DF SS MS F P 
Regression 1 88.520 88.520 44.66 0.000 
Residual Error 18 35.680 1.982 

Total 19 124.200 


b) 6° =1.9818 and $ — —9.8131-- 0.171484x 
c) P = 4.76301 at x = 85 


11-11. a) Yes, alinear regression would seem appropriate. 





40 
30 
> 20 
10 
0 
00 02 04 06 08 10 12 14 16 «18 
X 
Predictor Coef SE Coef T P 
Constant 0.470 1.936 0.24 0.811 
x 20.567 2.142 9.60 0.000 
S = 3.716 R-Sq = 85.2% R-Sq(adj) = 84.3% 


Analysis of Variance 


Source DF SS MS F P 
Regression 1 1273.5 127345 92.22 0.000 
Residual Error 16 220.9 13.8 

Total 17 1494.5 


^ =13.81 


b) O 
y = 0.470467 + 20.5673x 


c) Y = 0.470467 + 20.5673(1) = 21.038 
d)y =10.1371 e=1.6629 


Section 11-4 


11-21. Refer to ANOVA of Exercise 11-5 
a) 1) The parameter of interest is the regressor variable coefficient, B4. 


2) Hy: p, 0 

3 H,: B, #0 

4) a= 0.05, using t-test 

BL 
se(B,) 


6) Reject Ho if to < —tw/2,n-2 where —to.025,22 = —2.074 or to > to.025,22 = 2.074 
7) Using the results from Exercise 11-5 


ae 


0 = = 
0.390276 
8) Since 8.518 > 2.074 reject Hg and conclude the model is useful o = 0.05. 


5) The test statistic is f, = 


b) 1) The parameter of interest is the slope, D, 
2)Hog:B, 20 
3)H;:B #0 
4) a. — 0.05 
MSk .J $SSg/l 
MS;  SSp /(n—2) 
6) Reject Ho if fo > fo,1,22 where fo.01,1,22 = 4.303 
7) Using the results from Exercise 10-5 


1 1 
fj = 3615560911 455563 
192.89056 / 22 


8) Since 72.5563 > 4.303, reject H ; and conclude the model is useful at a significance o = 0.05. 


The F-statistic is the square of the t-statistic. The F-test is a restricted to a two-sided test, whereas the 
t-test could be used for one-sided alternative hypotheses. 


A £ | 8.7675 
c) se(B,)= = a 
se( B, ) = E 8. 775| 49409 | a 5717 
E t5 sai 


d) 1) The l of interest is the intercept, Bo. 
2 H, : B, 20 


3) H,: p, #0 
4) a= 0.05, using t-test 





5) The test statistic is fọ = 





Bo 
se( D, ) 
6) Reject Ho if to < —tw2,n-2 where —to.025,22 = —2.074 or to > to.025,22 = 2.074 
7) Using the results from Exercise 11-5 
ji e 255774 
2.5717 
8) Since 5.2774 > 2.074 reject Hg and conclude the intercept is not zero at a = 0.05. 


5) The test statistic is f, = 


11-25. Refer to ANOVA of Exercise 11-9 
a) H ^s D, —0 


H,:p, z0 
a = 0.05 
fo = 44.6567 


P os118 = 4.416 
fo > Foss 


Therefore, reject Ho. P-value = 0.000003. 
b) se(B, ) = 0.0256613 

se( B, ) = 2.13526 
c) H, N Bo =0 

H : 8, #0 

œ= 0.05 


f, — —4.59573 
tonsig = 2.101 


| fo > [5/218 
Therefore, reject Ho. P-value = 0.00022. 


Sections 11-5 and 11-6 


11-31. tw/2n-2 = to.025,12 = 2.179 
a) 95% confidence interval on D, . 


Ê, t ACT) 
— 2.3298 t Í 025,12 (0.2697) 
— 2.3298 + 2.179 (0.2697 ) 
—2.9175.< B, € 1.7421. 
b) 95% confidence interval on Bo. 
B, t tos io Se (Bs) 
48.0130 + 2.179(0.5959) 
46.7145 € D, € 49.3114. 
c) 95% confidence interval on u when xp 225. 
By, = 48.0130 — 2.3298 (2.5) = 42.1885 


^ EY mx). 
Myx, Etos 0 Gt s) 


42.1885 + (2.179)4/1.844 (d, + CTD) 
42.1885 + 2.179(0.3943) 
41.3293 < fiy, € 43.0477 


d) 95% on prediction interval when x =2.5. 


Vo ots nedo se) 
(2.5-3.0714)? 
42.1885 + 2.1794/1.844(1-- ++ S) 


42.1885 + 2.179(1.1808) 
38.2489 € y, € 46.1281 


It is wider because it depends on both the error associated with the fitted model as well as that with the 
future observation. 


11-35. 99 percent confidence intervals for coefficient estimates 
Estimate Standard error Lower Limit Upper Limit 
CONSTANT -6.33550 1.66765 -11.6219 -1.05011 
Temperature 9.20836 0.03377 9.10130 9.93154 


a) 9.10130 < B, € 9.31543 
b) 211.6219 < By € -1.04911 


2 
c) 500.124 + (2228)43.774609 4. + Go 


500.124 + 1.4037586 
498.72024 € By, < 501.52776 


2 
d) 500.124 + (2.228),/3.774609(1 +b + 750 - 


11-41 


500.124 + 4.5505644 


495.57344 € y, € 504.67456 


It is wider because the prediction interval includes error for both the fitted model and from that associated 


with the future observation. 


a) — 43.1964 < B, < -30.7272 


b) 2530 


c) 1886 .154 + (2.101)4/9811 .21(4 


1886 
1823 


d) 1886 
1886 
1668 


Section 11-7 


11-43. 


.09 < B, € 2720 .68 


.154 + 62.370688 


7833 € p... € 1948 .5247 


.154 € (2.101)4/9811 .21(1 + 4+ 


.154 + 217.25275 


9013 < y, € 2103 .4067 


(20-13.3375)? 
+ T14 6618 -) 


(20—13.3375 )? ) 
1114 .6618 


Use the Results of exercise 11-5 to answer the following questions. 
Predicted 


a) SalePrice 


25.9 
297; 
Zi 
25x 
29. 
29. 
30. 
28. 
353 
od. 
31. 
30. 
30. 
36. 
41. 
40. 
43. 
S. 
34. 
44. 
LEE 
395 
36. 
45. 


OO XO XO UO Ul XO Ul XO UI O O OW O UI «XO WOO XO XO OO Wl UI 


b) Assumption of normality does not seem to be violated since the data appear to fall along a straight line. 


('00 0000 -12UI 'O-12000U01U1 OY U1U1 U1 U1 t0 U1 dH» d» WB 


29. 
.0111824 
.4224654 
.4703363 
.1405004 
.2553078 
.9273208 
SL. 
49952797 
.9403441 
1679762 
. 1307723 
.1082540 
. 7342742 
.5831610 
Siok 
2:367 1762 
22311655 
43932520 
.5583567 
.5426619 
41. 


30 
28 
28 
30 
26 
32 


32 
30 
34 
33 
30 
40 
35 


43 
33 
38 
42 
33 


40 
43 


6681073 


9496232 


1974174 


1142499 


.3805611 
. 7102513 


Residuals 


-3 


. 76810726 
.51118237 
.52246536 
.57033630 
.24050041 
.64469225 
.02732082 
.04962324 
.20472030 
.55965587 
.16797616 
.23077234 
.10825401 
. 83427422 
.31683901 
.30258260 
.53282376 
.26883165 
.49325200 
.94164328 
.35733807 
.21424985 
.48056112 
.08974865 


Normal Probability Plot 





99.9 


cumulative percent 














MN 
a 
" 
ra 
m 
m 
= 


LLL LN 


Residuals 


C) No serious departure from assumption of constant variance. This is evident by the random pattern of the 





residuals. 
Plot of Residuals versus Predicted Plot of Residuals versus Taxes 
c T 
8 SA el te g eN 
Creaa a reese Pee ee Pees 2 
26 29 32 35 38 41 44 3.8 4.8 5.8 6.8 7.8 8.8 9.8 
Predicted Values Taxes 


d) R? = 76.73% ; 


11-47. a) R° =71.27% 


b) No major departure from normality assumptions. 


Normal Probability Plot of the Residuals 


(response is y) 


2 -1 0 1 2 
Norma Scare 


c) Assumption of constant variance appears reasonable. 


Resduals Versus x 
(response is y) 


Residuals Versus the Fitted Values 


(response is y) 


Residual 
Residual 





x Fitted Vd ue 


11-49. a) R^ =85.22% 
b) Assumptions appear reasonable, but there is a suggestion that variability increases with yl. 


Residuals Versus x 


. Residuals Versusthe Fitted Values 
(response is y) 


(response is y) 





X Fitted Value 


c) Normality assumption may be questionable. There is some “ bending” away from a straight line in the 
tails of the normal probability plot. 


Normal Probability Plot of the Residuals 


(response is y) 


Residual 


Normal Score 


Section 11-10 


11-55. a) y 2 —0.0280411 + 0.990987 x 
b) H,:B, =0 
H,:p, #0 a = 0.05 
fy = 79.838 
Josias = 4-41 
fo >> fas 


Reject Hy, ; 


c) r = 0.816 = 0.903 


d H,:p-0 
H,:p #0 a = 0.05 
ie R4n-2 0.90334 v18 — peas 
vi-R?  4A-0.816 
toosig = 2-101 
fo > by i218 
Reject IH: 
e) H,:p-0.5 
H,:p 0.5 a = 0.05 
2,553,879 
Z5 = 1.96 
Zo > Ze) 
Reject Ho . 


f) tanh(arctanh 0.90334 - 7) < p < tanh(arctanh 0.90334+ ==) where Z 925 = 196 . 
0.7677 € p € 0.9615 . 


11-59 n-50 r=0.62 


a) H,:p=0 
H,:p #0 a= 0.01 
|. r4n-2. _ 0.62448  . 
fg Jar aeur 5.475 


t oos 48 = 2-683 


fo > £0 005 ,48 


Reject /7 o. P-value = 0 


b) tanh(arctanh 0.62 - 777) < p < tanh(arctanh 0.62 + 77) where Zoos = 2:515. 
0.3358 < p < 0.8007 . 


c) Yes. 


11-61. a)r-0.933203 


H,:pz0 a = 0.05 
EZ r4n-2 _ 0.933203 415 _ 10.06 
9 imc OT), 7 
Loos 15 = 2.131 
fo > Í54515 
Reject Ho. 


c) y = 0.72538 + 0.49808 1x 
AM st py =0 
H,: B, #0 a = 0.05 


fa = 101 .16 
Pos 15 = 4545 


fo 22 a 1,15 
Reject Ho. Conclude that the model is significant at œ = 0.05. This test and the one in part b are identical. 


d H,:B, =0 
H,:B, #0 a = 0.05 
t, = 0.468345 
los 15 = 2.131 
[o ? Lo 1245 


Do not reject Hy. We cannot conclude Bg is different from zero. 


e) No serious problems with model assumptions are noted. 


Residuals Versus x Residuals Versus the Fitted Values 


(response is y) (response is y) 


Residual 
Residual 





5 15 25 
x Fitted Value 


Normal Probability Plot of the Residuals 


(response is y) 


Residud 


Supplemental 
11-65. a) Y 2 93.34 4 15.64x 
b) di; iD 
H,: B, #0 a= 0.05 
f, =12.872 


Joosian = 4.60 
fo > Fo.0s.1.14 


Reject A, . Conclude that B, + 0 at a= 0.05. 
c) (7.961 < B, < 23.322) 
d) (74.758 < B, < 111.923) 
e) y 2 93.34 +15.64(2.5) = 132.44 


11-67 


11-71 


132.44 + 2.1454/136.27 L + E 


132.44 + 6.26 
126.18 € fly, 5; $138.70 


a) 





b) y =—0.8819 + 0.00385x 
c) Hy: pj 20 
H,:p, #0 a = 0.05 
fo =122.03 
fo > fous 
Reject Hg . Conclude that regression model is significant at ot = 0.05 


d) No, it seems the variance is not constant, there is a funnel shape. 


Residuals Versus the Fitted Values 


(response is y) 


Residual 





Fitted Val ue 


e) y = 0.5967 + 0.00097 x . Yes, the transformation stabilizes the variance. 


a) 





110 + 
100 — 
90 + . . 
80 — s . 

70 — 


days 


60 4 . ° 
50 4 
40 + 








30 — 





b)The regression equation is 
y =-1934+15.296x 


Predictor Coef 
Constant -193.0 
x 15.296 
S = 23.79 R-Sq - 15. 


Analysis of Variance 


Source DE 
Regression 1 

Error 14 
Total 15 


SE Coeff T P 
163.5 -1.18 0.258 
9.421 1.62 0.127 

8% R-Sq(adj) = 9.8% 

SS MS F P 
1492.6 1492.6 2.64 0.127 
7926.8 566.2 
9419.4 


Cannot reject H,; therefore we conclude that the model is not significant. Therefore the seasonal 
meteorological index (x) is not a reliable predictor of the number of days that the ozone level exceeds 0.20 


ppm (y). 
95% CI on B, 
B, X t, se (B) 
15.296 + tos 12 (9.421) 
15.296 + 2.145 (9.421) 
— 4.912 < f, € 35.504 


c) 


d) 


The normality plot of the residuals is satisfactory. However, the plot of residuals versus run order 


exhibits a strong downward trend. This could indicate that there is another variable should be included in the 


model, one that changes with time. 





Residual 














Residual 











| 
930 
Fitted Value 


11-75 a) 





940 — 


> 
930 — 


920 — 











b) y = 33.3 + 0.9636x 


c) Predictor Coef 
Constant 33.3 
xX 0.9639 
S = 4.805 R-Sq = 


Analysis of Variance 


Source DF 
Regression 1 
Residual Error 8 
Total 9 


940 


SE Coef T P 
171.7 0.19 04051 
0.1848 5.22 0.001 

.3$ R-Sq(adj) = 74.4% 

SS MS F P 
628.18 628.18 27.21 0.001 
184.72 23.09 
812.90 


Reject the hull hypothesis and conclude that the model is significant. 77.3% of the variability is explained 


by the model. 
d H,:p, =1 
H,:D, #1 a=.05 
$, —1 0.9639 — 
R. CAEL UD 
se( D,) 0.1848 
[2,2 = bos = 2.306 


Since fo > —1,,» ,_7» We cannot reject H, and we conclude that there is not enough evidence to reject the 


claim that the devices produce different temperature measurements. Therefore, we assume the devices 


produce equivalent measurements. 


e) The residual plots to not reveal any major problems. 





Normal Score 











T | 
-5 0 


Residual 


Residual 














Fitted Value 


Section 12-1 


Chapter 12 Selected Problem Solutions 


10 223 553 
12-1. a) X’X =| 223 5200.9 12352 
553 12352 31729 
1916.0 
X'y =| 43550.8 
104736.8 
171.054 
b B=] 3.713 |,so y 2171.054 4 3.714x, —1.126x, 
—1.126 
c) $ =171.054+3.714(18) —1.126(43) = 189.481 
12-5. Predictor Coef StDev T P 
Constant 33.449 1.576 21.22 0.000 
xl -0.054349 0.006329 -8.59 0.000 
x6 140782 0.6997 1.54 0.138 
S = 2.834 R-Sq = 82.9% R-Sq(adj) = 81.3% 
Analysis of Variance 
Source DF SS MS F P 
Regression 2 856.24 428.12 53.32 0.000 
Error 22 176.66 8.03 
Total 24 1032.90 
a)  ĵ=33.4491-0.05435x, +1.07822x, 
b) 6? =8.03 
c) p=33.4491-0.05435(300) + 1.07822(2) = 19.30 mpg. 
12-7. Predictor Coef SE Coef T P 
Constant 383.80 36.22 10.60 0.002 
X1 -3.6381 0.5665 -6.42 0.008 
X2 -0.11168 0.04338 -2.57 0.082 
S = 12.35 R-Sq = 98.5% R-Sq(adj) = 97.5% 
Analysis of Variance 
Source DF SS MS F P 
Regression 2 29787 14894 97.59 0.002 
Residual Error 3 458 153 
Total 5 30245 


a) $ =383.80—3.6381x, —0.1119x, 
b) 6? 2153.0, se(B,) = 36.22, se( B,) = 0.5665, and se(B,) =.04338 


c) y = 383.80 — 3.6381 (25) — 0.1119 (1000) 2180.95 


d) Predictor 
Constant 

X1 

X2 

X1*X2 O 
S ue T22 
Analysis of Va 
Source 
Regression 
Residual Error 
Total 


Coef SE Coef T P 
484.0 101.3 4.78 0.041 
-7.656 3.846 -1.99 0.185 
-0.2221 0.1129 -1.97 0.188 
.004087 0.003871 1.06 0.402 
R-Sq = 99.0% R-Sq(adj) = 97.6% 
riance 
DF SS MS F 
3 29951.4 9983.8 67.92 
2 294.0 147.0 
5 30245.3 


y = 484 .0 — 7.656 x, — 0.222 x, — 0.0041 x, 


0.015 


e) 6° =147.0, se(B,) 2101.3, se(B,) 23.846, se(B,) = 0.113 and se(B,,) = 0.0039 
f 9 = 484 .0 — 7.656 (25) — 0.222 (1000 ) — 0.0041 (25)(1000 ) = -31 .3 


The predicted value is smaller 


12-9. Predictor Coef SE Coef T P 
Constant 47.17 49.58 0.95 0.356 
xl -9.735 3:692 -2.64 0.018 
x2 0.4283 0.2239 1.91 0.074 
x3 18.237 1.312 13.90 0.000 
S - 3.480 R-Sq = 99.4% R-Sq(adj) = 99.3% 


Analysis of Variance 


Source DF SS MS F P 
Regression 3 30532 10177 840.55 0.000 
Residual Error 16 194 12 

Total 19 30725 


a) y-47174—9 7352x; 4 04283x» +182375x3 

b) G7 12 

c) se(f,) 49.5815, se(f,) 23.6916, se(B,) = 0.2239 , and se(B,) 21.312 
d) y-247174—9 735Q 14.5) - 04283(220) + 18.2375(5) = 9143 


Section 12-2 


12-13. n=10, k=2, p=3, a-0.05 
Ho D epe =0 


H,:B, #0 for at least one j 





2 
SS, = 371595.6- OL = 4490 
Y» 1030 
X'y =| X x,y, |=] 21310 
Nx! [44174 
1916 
â' X'y =[171.054 3.713 -1.126] 43550.8 | =371535.9 
104736.8 
2 
SS, 3715359 - 219 = 4430.38 


SS, = SS, — SS, = 4490 — 4430.38 = 59.62 
=" ^ 4430.38/2 








= + = ——— = 260.09 
^ c 10910277 
Joos,2,7 = 4.74 
Jo > fo.0s.2,7 


Reject Ho and conclude that the regression model is significant at œ = 0.05. 


eI Doe B, =0 


Hp. 40 B, #0 
EN j. 





0 — ^ [y = ^ 
se( p,) se( D, ) 
2313 1959 _ 71.126 1308 
0.1934 0.0861 
ta/2,7 710257 = 2.365 
Reject Ho Reject Ho 


Both regression coefficients are significant 


247. uw Ho pi =P. =0 
H, : atleast one D # 0 
J = 03-3162 
fuam = fosa = 3-44 
fo > fom 


Reject Hg and conclude regression model is significant at ox = 0.05 


bia P= 0 
H,: B, #9 
f, = —8.59 


[925.253 = 1925.22 = 2.074 

| lo > Ly 12,22» Reject Ho and conclude f is significant at œ = 0.05 
A, : B, =9 

AH, : B, #0 

a= 0.05 

f, — 1.5411 


| lo |» L/2.22 » Do not reject Ho, conclude that evidence is not significant to state D; is significant at ot = 0.05. 


No, only x; contributes significantly to the regression. 


12-21. a) Hy: P, =f, =f, 50 
H; at least one B, # 0) 


a= 0.05 
fy = 67.92 


Jass = f o532 — [9.16 
Jo ? DE 


Reject Ho 


b) Hy: By = 0 
H,: B,, #0 
a= 0.05 


12-23. 


SSR(B,, | B,, B.) = 29951.4 29787 = 164.4 
_ SSR 1644 _ 
" MS, 153 
dioses 801 
fo ? fao 


Do not reject Ho 


1.07 





o) 6^ «147.0 
ô? (no interaction term) = 153.0 


MSg (ô? ) was reduced in the interaction term model due to the addition of this term. 


a) Hy: B, = B, =p; =0 forallj 
H,:p,20 for at least one j 


fo = 840.55 
fos316 = 3.24 
fo > Taaie 
Reject Ho and conclude regression is significant at a = 0.05 

b) a = 0.05 ta/2,n-p = t.025,16 = 2.12 
H,:p, =0 p, = 0 pi = 0 
H,:8, z 0 D, #0 Ba #0 
f£, = —2.637 t£, =1.91 ty = 13.9 
Ito? ta/2,16 Ito| to» 16 Itol> ta/2,16 
Reject Ho Do not reject Ho Reject Ho 


Sections 12-3 and 12-4 


12-27. 


12-29. 


a) — 0.00657 < D, < —0.00122 


5 (62x, (X'X) 1x, — 0.497648 = se(fi,,.. ) 
o) fly, —7.63449 + 0.00398(2000) + 0.24777(60) — 0.00389(1800) = 8.19 
fy, E £ 025,24 se(ly. ) 
8.19 + (2.064)(0.497648) 
8.19+1.03 
7.16 < Uyy, S 9.22 


a) 95 % CI on coeficients 
B, Etain- (By) 
0.0972 < B, € 1.4174 
—1.9646 < D, €17.0026 
—1.7953 € D, € 6.7613 
—1.7941 < D, € 0.8319 


12-31 


12-35. 


b) fly. — 290.44 se(y.. ) — 7.61 
By. t Lars sel Ày ) 
290.44 + (2.365)(7.61) 
272.44 < uy, € 308.44 


c) Yo df dans Vô (+x, (X'X) xo) 


290.44 + 2.365(14.038) 
257.25 € y, € 323.64 


a)95% Confidence Interval on coefficients 
—0.595 < B, € 0.535 
0.229 < B, < 0.812 
-0.216 < B, € 0.013 
-7.982 < B, < 2.977 


b) fly. =8.99568 se(My,, ) 2 0.472445 
B. de Lions se(fly. ) 
8.99568 + (2.145)(0.472445) 
7.982 < fly, S 10.009 

c) yg = 8 99568 se(y,) 21.00121 
8.99568 + 2.145(1.00121) 
6.8481 € y, € 11.143 


a) 0.3882 < By, < 0.5998 
b) P ——5.767103 + 0.49650 1x, 


c) 0.4648 € B, < 0.5282 


fs; = 2.365 


toosig = 2.145 


d) The simple linear regression model has the shorter interval. Yes, the simple linear regression model in this case is 


preferable. 


Section 12-5 


12-37. 


a) r^ = 0.82897 


b) Normality assumption appears valid. 


Normal Probability Plot of the Resduals 


(response is y) 

5 
E 
5 0 
d 

5 

2 1 0 1 2 
Normal Score 
c) Assumption of constant variance appears reasonable. 
Res duals Versus x6 Res duals Versus xl 
(response is y) (response is y) 


Residual 





Residuals Versus the Fitted Values 
(response is y) 


Residual 





Fitted Value 


d) Yes, observations 7, 10, and 18 


12-39. a) r° =0.985 
b) r^ = 0.990 
r^ increases with addition of interaction term. No, adding additional regressor will always increase r? 


12-41 a) There is some indication of nonconstant variance since the residuals appear to “fan out" with increasing 
values of y. 


Residual Plot for y 








o0 

2 

3 

ke 

7) 

® 

x 

0 40 80 120 160 200 240 
Predicted 
b) 
Source Sum of Squares DF Mean Square F-Ratio P-value 
Model 30531.5 3 10177.2 840.546 .0000 
Error 193.725 16 12.1078 
Total (Corr.) 30725.2 19 
R-squared = 0.993695 Stnd. error of est. = 3.47963 
R-squared (Adj. for d.f.) = 0.992513 Durbin-Watson statistic = 1.77758 
R? = 0.9937 or 99.37 %; 
Jdem = (0.9925 or 99.25%; 
c) Model fitting results for: log(y) 

Independent variable coefficient std. error t-value sig.level 
CONSTANT 6.22489 1.124522 5.5356 0.0000 
x1 -0.16647 0.083727 -1.9882 0.0642 
x2 -0.000228 0.005079 -0.0448 0.9648 
x3 0.157312 0.029752 5.2875 0.0001 
R-SQ. (ADJ.) = 0.9574 SE= 0.078919 MAK= 0.053775. DurbWat= 2.031 
Previously: 0.0000 0.000000 0.000000 0.000 


20 observations fitted, forecast(s) computed for 0 missing val. of dep. var. 


y = 6.22489 — 0.16647x, — 0.000228x, + 0.157312x4 


d) 


Residual Plot for log(y) 


Residuals 





4 4.3 4.6 4.9 5.2 5.5 
Predicted 


Plot exhibits curvature 


There is curvature in the plot. The plot does not give much more information as to which model is 
preferable. 


Residual Plot for log(y) 


Residuals 





Plot exhibits curvature 
Variance does not appear constant. Curvature is evident. 


f) 
Model fitting results for: log(y) 

Independent variable coefficient std. error t-value Sig.level 
CONSTANT 6.222045 0.547157 11.3716 0.0000 
x1 -0.198597 0.034022 -5.8374 0.0000 
x2 0.009724 0.001864 5.2180 0.0001 
1/x3 -4.436229 0.351293 -12.6283 0.0000 
R-SQ. (ADJ.) = 0.9893 SE= 0.039499 MAE= 0.028896 DurbWat= 1.869 
Previously: 0.9574 0.078919 0.053775 2.031 
20 observations fitted, forecast (s) computed for 0 missing val. of dep. var. 

Analysis of Variance for the Full Regression 
Source Sum of Squares DF Mean Square F-Ratio P-value 
Model 2.75054 3 0.916847 587.649 .0000 
Error 0.0249631 16 0.00156020 
Total (Corr.) 2.77550 19 
R-squared = 0.991006 Stnd. error of est. = 0.0394993 


R-squared (Adj. for d.f.) = 0.98932 Durbin-Watson statistic = 1.86891 


Residual Plot for log(y) 


Residuals 





3.8 4.1 4.4 4.7 5 5.3 
Predicted 


Using 1/x3 


The residual plot indicates better conformance to assumptions. 


Curvature is removed when using 1/x; as the regressor instead of x; and the log of the response data. 


Section 12-6 


12-47. a) y 2 —1.63341.232x - 1.495x^ 


b) fo = 1858613, reject Ho 
c) to = —601.64, reject H 
d) Model is acceptable, observation number 10 has large leverage. 


Res Is V X . 
esiduals iion Residuals Versus the Fitted Values 
(response is y) i 
(response is y) 





Fitted Vdue 


Normal Probability Plot of the Resduals 


(response is y) 


Residual 


Narmal Score 


x—x 





12-49. | $—2759.395—90.783x'—47.166(x')? , where x'— 


a) Atx=285 x= A = —].016 
11.9336 


$ = 759.395 — 90.783(—1.106) — 47.166(—1.106)?° = 802.943 psi 
b) P= 759.395 — 90.7832225 )-— 47.1666725 J 


11.9336 


$ = 759.395 — 7.607 (x — 297.125) — 0.33 I(x — 297.125)” 
y = —26204.14 +189.09x — 0.33 1x" 


c) They are the same. 
d) y'= 0.385 — 0.847x'-0.440(x') 


X-X 
Sx 





and X'= 





where y' = 


y 
The "proportion" of total variability explained is the same for both standardized and un-standardized models. 
Therefore, R? is the same for both models. 


y'= By + Bix*BiG Y where y'== 








and x= y fe Bore] ny 
X 


y 
12-51 a) Predictor Coef SE Coef T P 
Constant -1.769 1.287 -1.37 0.188 
xl 0.4208 0.2942 1.43 0.172 
x2 0.2225 0.1307 1.70 0.108 
acd -0.12800 0.07025 -1.82 0.087 
x1x2 -0.01988 0.01204 -1.65 0.118 
x1x3 0.009151 0.007621 1.20 0.247 
X2x3 0.002576 0.007039 0.37 0.719 
x1^2 -0.01932 0.01680 -1.15 0.267 
x2^2 -0.00745 0.01205 -0.62 0.545 
x3^3 0.000824 0.001441 0.57 0.575 
S = 0.06092 R-Sq = 91.7% R-Sq(adj) = 87.0% 
Analysis of Variance 
Source DF SS MS F P 
Regression 9 0.655671 0.072852 19.63 0.000 
Residual Error 16 0.059386 0.003712 


Total 25 0.715057 


$ =-1.769 + 0.421x, + 0.222x, — 0.128x, —0.02x,x, + 0.009x,x, + 
0.003x,x, —0.019x,° —0.007x,^ + 0.001x," 
b) Hy: all B, = B, = B, =K = B,, =0 
Hy atleast! P, #0 
f, 19.628 
Yos: 52:54 
Joc? oae 


Reject Ho and conclude that the model is significant at œ = 0.05 
c) Model is acceptable. 


d Hy: Ba = Boy = Bs = Biz = By; = Ba =0 


Hı. at least one Bij # 0 


f. SS. (Dus Boo B33» Bi Bis> Bos | Bis Po» Ds Dor e — 1.612 
f MS, 0.003712 
J 05.6.16 = 2.74 


fo ? J 05.6.16 


Do not reject Ho 


SS Bis Dis Pas Dii. Doo. B33 | Bi» B23 B33 Bo) = SS x (Ds B2, B33 Pia Piss Boas Birs B223 B33 | Bo) - 


SS CD; D; D, | Bo) 


= 0.65567068 — 0.619763 
= 0.0359 





Reduced Model: y — Bo + p,x, + Dy. t pix, 


12-55. | a) The min. MS; equation is X1, Xo, X3, X4, X5, X6, X7, Xg 


MS, = 6.58 Cp = 5.88 
The min. C, xs, xs, X10 
C, =5.02 MS, =7.97 


b) $ —34.434 — 0.048x, 
MS,-881  C,-555 


c) Same as part b. 


d) $ —0.3414- 2.862x, +0.246x, —0.010x,, 
MS,=7.97  C,-502 


e) Minimum C, and backward elimination result in the same model. Stepwise and forward selection 
result in the same model. Because it is much smaller, the minimum C, model seems preferable. 


12-61. a)Min.C, 
y = —3.517 + 0.486x, — 0.156x, 


C, = -1.67 
b) Min MS; model is X1, x7, Xo, MS, = 1.67, C, — —0.77 
y = —5 964 + 0 495x, + 0 025x; — 0 163xo 


c) Max. adjusted R model is X1, X7, Xo, Adj. R? = 0.98448 Yes, same as Min. MS; model. 


Supplemental Exercises 


12-65. a)H, "D -pgn-p o0 
H,: B, #0 for at least one j 


a=0.01 
fy = 1323.62 


f01,3,36 = 4.38 
fo >> fa 5.36 


Reject Ho and conclude regression is significant. 
P-value < 0.00001 


b) a —0.01 Lo0s,36 = 2.72 


Ho:ß; = 0 Ho: By = 0 Ho:Bs = 0 
H: B # 0 H: B4, # 0 H,:B; z 0 

to = —1.32 to = 19.97 to = 2.48 
[tol ta/2,36 Itol> ta/2,36 Ito] * 52.36 
Do not reject Ho Reject Ho Do not reject Ho 


Only regressor x4 is significant 


c) Curvature is evident in the residuals vs. regressor plots from this model. 


12-67. a) 5 2 —0.908 + 5.482x, +1.126x, —3.920x; -1.143x, 


b) Hy:B, = Ba =P; =B, =0 
H,:B, #0 for at least one j 


o = 0.05 
fo = 109.02 


F05,4,19 = 2.90 


fo 22 a,4,19 
Reject Hg and conclude regression is significant at a = 0.05. 


a= 0.05 t 025,19 = 2.093 

H,: B, =0 H,: B, =0 H, : p, =0 H,: B, =0 
H,:B,#0 £4,:B, #0 H: B; #0 H,: B, #0 
£, 211.27 ft, 214.59 ty = —6.98 tọ = -8.11 

Itol> tay219 — [tol? ta/2,19 Itol> ta/2,19 Itol> ta/2,19 
Reject Ho Reject Ho Reject Hg Reject Ho 


c) The residual plots are more pleasing than those in Exercise 12-66. 


12-69. | a) $ =—3982.1+1.0964x, + 0.1843x, + 3.7456x, + 0.8343x, — 16.278 1x, 


MS (p) = 694.93 C, =5.62 
b) $——4280.2 +1.442x, + 0.209x, +0.6467x, —17.5103x, 
MS,(p) 2 71420 Ceb] 


c) Same as model b. 
d) Models from parts b. and c. are identical. Model in part a. is the same with x4 added in. 
MSyg model in part a. = 694.93 ou = 562 


MS; model in parts b.&c. = 714.20 C, =5.57 


12-71. a) VIF(B;) 251.86 
VIF(B,) -9.11 
VIF(B,) = 28.99 


* 
Yes, VIFs for X4 and X5 exceed 10. 
b) Model from Exercise 12-65: p —19.69—1 21x, + 0.005x, + 0.0004x; 


12-73. aR me 
SS, 
SS, = R2(SS,.) = 0.94(0.50) = 0.47 
$$, = SS, — SS, = 0.5—0.47 = 0.03 
Ho : B - p, =...= B, -0 


H, : p, #0 forat least one j. 





œ= 0.05 

f =—SSalk_ 047/6 og 
SS,/n—-p  0.03/7 

Joss. =3.87 

fo > fas 

Reject Hj. 


b)k-5 n-14 p-6  R^z0.92 
Dop = R*(SS,) = 0.92(0.50) = 0.46 
SS,,'= SS, —SS,'= 0.5 — 0.46 = 0.04 
SS; (B; , Douce | Bo) = SS, ( full) — SS, (reduced) 





= 0.47 — 0.46 
= 0.01 
J= SS p (D; | D asia eas, | B,)/r E 0.01/1 22 
i SS .'/(n — p) 0.04/8 
fosis 79.32 
Jo ? Jaiz 
Do not reject Hj and conclude that the evidence is insufficient to claim that the removed variable is significant at 
a = 0.05 
c) MS, (reduced) = Da 0.005 
n—p 8 


MS,,( full) = ES — 0.00429 


No, the MS; is larger for the reduced model, although not by much. Generally, if adding a variable to a model 
reduces the MS; it is an indication that the variable may be useful in explaining the response variable. Here the 
decrease in MS; is not very great because the added variable had no real explanatory power. 


Chapter 13 Selected Problem Solutions 


Section 13-2 


13-1. a) Analysis of Variance for STRENGTH 


Source DF SS MS F P 
COTTON 4 475.76 118.94 14.76 0.000 
Error 20 161.20 8.06 

Total 24 636.96 


Reject Hp and conclude that cotton percentage affects mean breaking strength. 
b) Tensile strength seems to increase to 30% cotton and declines at 35% cotton. 


, qu 


15 20 25 30 35 
COTTON 


STRENGTH 


c) The normal probability plot and the residual plots show that the model assumptions are reasonable. 


Residuals Versus the Fitted Values " : 
(response is STRENGTH) Normal Probability Plot of the Residuals 
(response is STRENGTH) 


Kobe © ^ OA NY WHR 0 o 








8 
g |; 
10 15 20 2 1 0 1 2 
Fitted Value Norma Score 
Res duals Versus COTTON 
(response is STRENGTH) 
8 
g 
13-3. a) Analysis of Variance for STRENGTH 
Source DF SS MS F P 
TECHNIQU 3 489740 163247 12.73 0.000 
Error 12 153908 12826 


Total 15 643648 


Residual 


13-5. 


Residual 





Reject Hy. Techniques affect the mean strength of the concrete. 
b) P-value = 0 


c) Residuals are acceptable 


Residuals Versus the Fitted Values 
(response is STRENGTH) 


Residuals Versus TECHNIQU 


(response is STRENGTH) 

















E 
g 
1 2 3 4 2650 2750 2850 2950 3050 3150 
TECHNIQU Fitted Value 
Normal Probability Plot of the Residuals 
(response is STRENGTH) 
2-4 
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D 
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Residual 
a) Analysis of Variance for CONDUCTIVITY 
Source DF SS MS F P 
COATINGTYPE 4 1060.5 265.1 16.35 0.000 
Error 15 243.3 16.2 
Total 19 1303.8 


Reject Hp ; P-value = 0. 


b) There is some indication that the variability of the response may be increasing as the mean response 
increases. There appears to be an outlier on the normal probability plot. 


Residuals Versus COATINGT 
Meeponse is CONDUCTI) Residuals Versus the Fitted Values 
(response is CONDUCTI) 





COATINGT Fitted Value 


Normal Score 


Normal Probability Plot of the Residuals 
(response is CONDUCTI) 














| | | | 
-10 -5 0 5 


Residual 


c) 95% Confidence interval on the mean of coating type 1. 


MS, 
Íoo2515 ?r <y, S yi t fooisis a 
145.00 — 2.13 i «145.00 +2.13 un 
4 4 
929 


140.71< u, < 





d.) 99% confidence interval on the difference between the means of coating types 1 and 4. 


"EM 2MS, NEN 2MS, 
Yı — Y4 T Íooosis : S I, =U, Sy, — V4 t fooosis| ——— 


(145.00—12925) — 2.947, 2 e Ll, — u, € (145.00 —12925) — 2.947, | a 


7.36€ u, — U, €24.14 





a) Analysis of Variance for STRENGTH 


Source DF SS MS F P 
AIRVOIDS 2 1230.3 615.1 8.30 0.002 
Error 21 1555.8 74.1 

Total 23 2786.0 

Reject Ho 


b) P-value = 0.002 


c) The residual plots show that the assumptions of equality of variance is reasonable. The normal probability 


plot has some curvature in the tails. 


Residuals Versus the Fitted Values 
(response is STRENGTH) 


Residual 





Residuals Versus AIRVOIDS 


(response is STRENGTH) 





Residual 











AIRVOIDS 


Normal Probability Plot of the Residuals 
(response is STRENGTH) 





Normal Score 
c 
| 











Residual 


d) 95% Confidence interval on the mean of retained strength where there is a high level of air voids 


- MS, O MS, 
Y3 7902521 , SUM; S y4 t fooiszi 


n 
8.229 — 2.080, = < U, <8.229 + 2.080, A 


69.17€ u, $81.83 


e) 95% confidence interval on the difference between the means of retained strength at the high level and the 
low levels of air voids. 


TEN 2M5; — 2MS, 
Yı Z Y3 7 Íoo521 ud Sp, = Hs Sy, T V3 t Íoosoi EU 
(92.875— 75.5) — 2.080 LM < u, — u, €(92.875— 75.5) — 2.080 — I) 


8.42 < L, — L, $26.38 


Section 13-3 


13-21 a) Analysis of Variance for OUTPUT 


Source DF SS MS F P 
LOOM 4 0.3416 0.0854 5.77 0.003 
Error 20 0.2960 0.0148 

Total 24 0.6376 


Reject Hp, and conclude that there are significant differences among the looms. 


„2 MS — MS .0854 — 0.014 
b) ô - Treatments E = 0.0854 — 0.0148 — 0.01412 
n 5 
>) 6? = MS, = 0.0148 
d) Residuals plots are acceptable 
Res duals Versus LOOM Residuals Versus the Fitted Values 


(response is OUTPUT) (response is OUTPUT) 


Residual 





LOOM Fitted Value 


Normal Probability Plot of the Residuals 
(response is OUTPUT) 





Normal Score 
eo 
| 











-0.2 -0.1 0.0 0.1 0.2 
Residual 


Section 13-4 


13-25. a) Analysis of Variance for SHAPE 


Source DF SS MS F P 
NOZZLE 4 0.102180 0.025545 8.92 0.000 
VELOCITY 5 0.062867 0.012573 4.39 0.007 
Error 20 0.057300 0.002865 

Total 29 0.222347 


Reject Ho, and conclude that nozzle type affects the mean shape measurement. 


145 1.15 
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b) Fisher's pairwise comparisons 


Family error rate = 0.268 
Individual error rate = 0.0500 
Critical value = 2.060 
Intervals for (column level mean) - (row level mean) 
1 2 3 4 
2 -0.15412 
0.01079 
3 -0.20246 -0.13079 
-0.03754 0.03412 
4 -0.24412 -0.17246 -0.12412 
-0.07921 -0.00754 0.04079 
5 -0.11412 -0.04246 0.00588 0.04754 
0.05079 0.12246 0.17079 0.21246 


There are significant differences between nozzle types 1 and 3, 1 and 4, 2 and 4, 3 and 5, 
and 4 and 5. 


c) The residual analysis shows that there is some inequality of variance. The normal probability plot is 
acceptable. 


Res duals Versus VELOCITY 
(response is SHAPE) 


Residuals Versus NOZZLE 
(response is SHAPE) 


0.1 


Residual 


0.0 





Residuals Versus the Fitted Values 
(response is SHAPE) 


Residual 


0.1 





Fitted Value 


Normal Probability Plot of the Residuals 
(response is SHAPE) 





Normal Score 
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Supplemental Exercises 


13-31. a)analysis of Variance for RESISTANCE 


Source DF SS MS F P 
ALLOY 2 10941.8 5470.9 76.09 0.000 
Error 27 1941.4 71.9 

Total 29 12883.2 


Reject Ho, the type of alloy has a significant effect on mean contact resistance. 


b) Fisher's pairwise comparisons 


Family error rate = 0.119 
Individual error rate = 0.0500 
Critical value = 2.052 
Intervals for (column level mean) - (row level mean) 
1 2 
2 -13.58 
1.98 
3 -50.88 -45.08 
-35.32 -29.52 


There are differences in the mean resistance for alloy types 1 and 3, and 2 and 3. 


c) 99% confidence interval on the mean contact resistance for alloy 3 


Y3 — 9005271 : SU; $ y4 t 1oops57 


1404 -2,771,]-22 < u, «1404-2771 2 
10 10 


13297 < u, €147.83 


d) Variability of the residuals increases with the response. The normal probability plot has some 
curvature in the tails, indicating a problem with the normality assumption. A transformation of the 


response should be conducted. 


Res duals Versus ALLOY Residuals Versus the Fitted Values 


(response is RESISTAN)) (response is RESISTAN) 


Residual 
e 





2 3 100 110 120 130 140 


ALLOY Fitted Value 


Normal Probability Plot of the Residuals 
(response is RESISTAN) 





Normal Score 
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Residual 


13-35. a)Analysis of Variance for VOLUME 


Source DF SS MS F P 
TEMPERATURE 2 16480 8240 7.84 0.007 
Error 12 12610 1051 

Total 14 29090 

Reject Ho. 


b) P-value = 0.007 


c)Fisher's pairwise comparisons 


Family error rate = 0.116 
Individual error rate = 0.0500 
Critical value = 2.179 
Intervals for (column level mean) - (row level mean) 
70 75 
75 -16.7 
72.7 
80 35.3 7.3 


124.7 96.7 


There are significant differences in the mean volume for temperature levels 70 and 80, and 75 and 80. The 
highest temperature (8096) results in the smallest mean volume. 


d)There are some relatively small differences in the variability at the different levels of temperature. The 
variability decreases with the fitted values. There is an unusual observation on the normal probability plot. 


Residuals Versus TEMPERAT Residuals Versus the Fitted Values 


(response is VOLUME) (response is VOLUME) 


50 50 





3 3 
g 8 
-50 -50 
70 75 80 1170 1180 1190 1200 1210 1220 1230 1240 1250 
TEMPERAT Fitted Vdue 
13-37. a) Analysis of Variance for PCTERROR 
Source DF SS MS F P 
ALGORITH 5 2825746 565149 6.23 0.000 
PROJECT 7 2710323 387189 4.27 0.002 
Error 35 3175290 90723 
Total 47 8711358 
Reject Ho , the algorithm is significant. 
b) The residuals look acceptable, except there is one unusual point. 
Residuals Versus PROJECT Res duals Versus ALGORITH 


(response is PCTERROR) 


(response is PCTERROR) 


1000 


500 


Residual 
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Resduals Versus the Fitted Values 
(response is PCTERROR) 
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Normal Probability Plot of the Residuals 
(response is PCTERROR) 
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c) The best choice is algorithm 5 because it has the smallest mean and a low variablity. 





13-39. aj} = |, 4420") -3 
c? 


From Chart VIII with numerator degrees of freedom = a - 1 = 4, denominator degrees of freedom = a(n - 1) 
= 15, B = 0.15, and the power = 1 - B = 0.85. 


b) 
n À a(n - 1) B Power=1-ß 
9 3.317 20 0.10 0.90 


The sample size should be approximately n = 5 


Section 14-3 


Chapter 14 Selected Problem Solutions 


14-1. a)Analysis of Variance for life 
source DF SS 
material 2 10683.7 
temperat 2 391118.7 
material*temperat 4 9613.8 
Error 27 18230.7 
Total 35 77647.0 


Main factors and interaction are all significant. 


b)The mean life for material 2 is the highest at temperature level 1, in the middle at temperature 
level 2 and the lowest at temperature level 3. This shows that there is an interaction. 
Interaction Plot - Means for life 
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c) There appears to be slightly more variability at temperature 1 and material 1. The normal 
probability plot shows that the assumption of normality 1s reasonable. 


Normal Probability Plot of the Residuals 


(response is life) 





Normal Score 
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Residuals Versus temperat Residuals Versus material 


(response is life) (response is life) 
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14-3 aH iTi =T, =0 
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b) Analysis of Variance for current 


Source DF SS MS F E 
glasstyp 1 14450.0 14450.0 273.79 0.000 
phostype 2 93342 466.7 8.84 0.004 
glasstyp*phostype 2 13343 66.7 1.26 0.318 
Error 12 633.3 52.8 

Total 17 16150.0 


Main effects are significant, the interaction is not significant. Glass type 1 and phosphor type 2 
lead to the high mean current (brightness). 


c) There appears to be slightly more variability at phosphor type 2 and glass type 2. The normal 
plot of the residuals shows that the assumption of normality is reasonable. 


Resduals Versus phostype Residuals Versus glasstyp 


(response is curent) 


(response is curent) 


Residual 
e 





Residuals Versus the Fitted Values 
(response is curent) 


Residual 





14-7 The ratio 


Normal Probability Plot of the Residuals 


(response is current) 





Normal Score 











Residual 


T = has a t distribution with ab(n-1) degrees of freedom 


42MS,. /n 


Therefore, the (1-œ)% confidence interval on the difference in two treatment means is 


MEN 2MS, MM 
Ya m Jo m Í /2,ab(n-l) j < U; m Hj = Xj. m X.j. + E 





For exercise 14-6, the mean warping at 80% copper concentration is 21.0 and the mean warping at 60% 
copper concentration is 18.88 a=4, b=4, n=2 and MS5-6.78. The degrees of freedom are (4)(4)(1)=16 


i x 
(21.0 18.88) — 2.120 — < u, — My < (21.0 18.88) + 2.120 —— 


—3.40& LL, — U, € 7.64 


Therefore, there is no significant differences between the mean warping values at 80% and 60% copper 
concentration. 


Section 14-4 


14-11 Parts a. and b. 
Analysis of Variance for strength 


Source DF SS MS F P 
hardwood 2 8-3750 4.1875 7.64 0.003 
cookingtime a 1743611 17.3611 31.66 0.000 
freeness 2 21-8517 10,9258 19.92 0.000 
hardwood*cookingtime 2 3.2039 1.6019 2.92 0.075 
hardwood*freeness 4 6.5133 1.6293 24.97 0.042 
cookingtime*freeness 2 1.0506 0.5253 0.96 0.399 
Error 22 12.0644 0.5484 

Total 35 70.4200 


All main factors are significant. The interaction of hardwood * freeness is also significant. 


c) The residual plots show no serious problems with normality or equality of variance 


Residuals Versus freeness 
(response is strength) 
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Residuals Versus hardwood 
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Residual 


Section 14-5 


14-13a) Analysis of Variance for life 


Source DF 
Speed 

hardness 

angle 
Speed*hardness 
Speed*angle 
hardness*angle 
Error 

Total 


TORPRPRPRERE 


Hæ 


b)Estimated Effects and Coefficients for life 


SS 
1332 
28392 
20592 
506 
56882 
2352 
24530 


134588 


* 
3 
* 
* 
$ 


Residuals Versus cookingt 
(response is strength) 


Residuals Versus the Fitted Values 


(coded units) 


MS 
1332 
28392 
20592 
506 
56882 
2352 
2726 


(response is strength) 


F P 
0.49 0.502 
10.42 0.010 
7.56 0.023 
0.19 0.677 
20.87 0.000 
0.86 0.377 


(coded units) 


* 


4 € +e 








50 


-50 


Term Effect 


Constant 

Speed 18 
hardness 84 
angle 2e 
Speed*hardness -11. 
Speed*angle eL. 
hardness*angle -24 
Speed*hardness*angle -34 


.25 
.25 


75 
25 
25 


<25 
25 


Coef 


413 


42 
35 
=5 


=59; 
.12 
-17. 


-12 


. 13 
D4 
.12 
.87 
20:9 


12 


62 


37 


SE Coef 


12 
12 
12 
12 
12 
12 
12 
12 


y =413.125 + 9.125x, + 45.12x2 + 35.87x3 —59.62x13 
c) Analysis of the residuals shows that all assumptions are reasonable. 


Normal Probability Plot of the Residuals 


(response is life) 
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Residual 


Residuals Versus angle 
(response is life) 


enge 


Residuals Versus speed 


(response is life) 


50 


Residuals Versus hardness 


.41 
.41 
.41 
.41 
.41 
.41 
.41 
.41 


(response is life) 


Residuals Versus the Fitted Values 
(response is life) 


hardness 


8.3 


T 


s20 
.74 
.40 
:59 
.45 
.81 
Xr 
.40 


OOOO OO 0 CO 


P 


.000 
.483 
+009 
.020 
.662 
.001 
.357 
.199 








14-19. 


a) Factors A, B, C, and the interaction AB appear to be significant from the normal probability plot of the 





effects. 
Normal Probability Plot of the Effects 
(response is yield, Alpha = .10) 
A: factor A 
B: factor B 
C: factor C 
D: factor D 
E: factor E 
: 
E 
2 
10 20 
Effect 
b) 
Analysis of Variance for yield (coded units) 
Term Effect Coef StDev Coef T P 
Constant 30.5312 0.2786 109.57 0.000 
factor A 11.8125 5.9063 0.2786 21.20 0.000 
factor B 9.6875 4.8437 0.2786 17.38 0.000 
factor D -0.8125 -0.4063 0.2786 -1.46 0.164 
factor E 0.4375 0.2187 0.2786 0.79 0.444 
factor A*factor B 7.9375 3.9687 0.2786 14.24 0.000 
factor A*factor C 0.4375 0.2187 0.2786 0.79 0.444 
factor A*factor D -0.0625 -0.0313 0.2786 -0.11 0.912 
factor A*factor E 0.9375 0.4687 0.2786 1.68 0.112 
factor B*factor C 0.0625 0.0313 0.2786 0.11 0.912 
factor B*factor D -0.6875 -0.3437 . 0.2786 -].23 0.235 
factor B*factor E 0.5625 0.2813 0.2786 1.01 0.328 
factor C*factor D 0.8125 0.4062 0.2786 1.46 0.164 
factor C*factor E 0.3125 0.1563 0.2786 0.56 0.583 
factor D*factor E -1.1875 -0.5938 | 0.2786 -2.13 0.049 
Analysis of Variance for yield 
Source DF Seq SS Adj SS Adj MS F P 
Main Effects 5 11087.9 111087.9 2217.58 892.61 0.000 
2-Way Interactions 10 536.3 536.3 53.63 21.59 0.000 
Residual Error 16 39.7 39.7 2.48 
Total 31 11664.0 


The analysis confirms our findings from part a) 


c) The normal probability plot of the residuals is satisfactory. However their variance appears to increase 
as the fitted value increases. 


Standardized Residual 


Normal Probability 





.999 
.99 
Pr .95 
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RESI1 
Average: 0.0000000 Anderson-Darling Normality Test 
StDev: 1.59479 A-Squared: 0.387 
N: 32 P-Value: 0.368 


Residuals Versus the Fitted Values 


(response is yield) 


Standardized Residual 





0 10 20 30 40 50 60 
Fitted Value 


d) All plots support the constant variance assumption , although there is a very slight indication that 
variability is greater at the high level of factor B. 


Residuals Versus A Residuals Versus B 


(response is yield) (response is yield) 


Standardized Residual 
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Standardized Residual 


Standardized Residual 





Residuals Versus C Residuals Versus D 


(response is yield) (response is yield) 


Standardized Residual 





Residuals Versus E 


(response is yield) 
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e) The AB interaction appears to be significant. The interaction plot from MINITAB indicates that a high 
level of A and of B increases the mean yield, while low levels of both factors would lead to a reduction in the 
mean yield. 


Interaction Plot for yield 


55 


45 





f.) To increase yield and therefor optimize the process, we would want to set A, B, and C at their high 
levels. 


g) It is evident from the cube plot that we should run the process with all factors set at their high level. 


Cube Plot - Means for yield 
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14-21 


Normal Probability Plot for the Main Effects 


ML Estimates 





Percent 














50 0 
Data 


50 


b) Based on the normal probability plot of the effects, factors A, B and AB are significant. 
The model would include these three factors. 


c) The estimated model is: y = 400 + 40.124x, —32.75x, + 26.625x,, 


Section 14-6 


Term 

Constant 
Block 

factor A 
factor B 
factor c 
factor D 


14-25 Model with four blocks 


factor A*factor B 
factor A*factor C 
factor A*factor D 
factor B*ractor C 
Factor B*ractor-D 
tractor O*ractor D 
factor A*factor B*'factor C 
factor A*factor B*factor D 
tactor A*ractor C*factor. D 
factor B*rtactor C*factor D 


Term 

Constant 
Block 

factor A 
factor B 
factor C 
factor p 


Effect 


-10.000 
-0.750 
-0.750 

5.000 


Analysis of Variance for var 1 


Source 
Blocks 


Main Effects 


Residual Error 


Total 


BLOCK A B C 
1 -1 -1 -1 
1 1 -1 1 
1 -1 1 -1 
1 1 1 1 
2 1 -1 -1 
2 -1 -1 1 
2 1 1 -1 
2 -1 1 1 
3 -1 1 -1 
3 1 1 1 
3 -1 -1 -1 
3 1 -1 1 
4 1 1 -1 
4 -1 1 1 
4 1 -1 -1 
4 -1 -1 1 
Effect 
1 
-10.000 
-0.750 
-0.750 
5.000 
4.500 
0.500 
-3.750 
-1.250 
-1.500 
1.500 
-6.000 
4.750 
-0.250 
-2.000 
Coef StDev Coef 
183.375 1.607 
-1.625 1.607 
-5.000 1.607 
-0.375 1.607 
-0.375 1.607 
2.500 1.607 
DF Seq SS 
1 42.25 
4 504.50 
10 413.00 
15 959.75 


Factor A is the only significant factor. 


Term 
Constant 
Block 1 


2 


Effect 


Coef 
56.37 
15.63 
-3.38 


14-29 a) Estimated Effects and Coefficients for y 


StDev Coef 
2.633 
4.560 
4.560 


D var 1 
-1 190 
-1 181 
1 187 
1 180 
-1 174 
-1 177 
1 185 
1 187 
-1 181 
-1 173 
1 198 
1 179 
-1 183 
-1 188 
1 172 
1 199 
Coef 
83.375 
-1.625 
-5.000 
-0.375 
-0.375 
2.500 
2.250 
0.250 
-1.875 
-0.625 
-0.750 
0.750 
-3.000 
2.375 
-0.125 
-1.000 
T P 
114.14 0.000 
-1.01 0.336 
-3.11 0.011 
-0.23 0.820 
-0.23 0.820 
1.56 0.151 
Adj SS Adj 
42.25 42 
504.50 126 
413.00 41 
P P 
21.41 0.000 
3.43 0.014 
-0.74 0.487 


Residual 


Residual 


3 -10.88 
A -45.25 -22.62 
B -1.50 -0.75 
C 14.50 7.25 
A*B 19.00 9.50 
A*C -14.50 -7.25 
B*C -9.25 -4.63 
Analysis of Variance for y 
Source DF Seq SS 
Blocks 3 1502.8 
Main Effects 3 9040.2 
2-Way Interactions 3 2627.2 
Residual Error 6 665.5 
Total 15 13835.7 


Factors A, C, AB, and AC are significant. 


b) Analysis of the residuals shows that the model is adequate. There is more variability on the response 


YNNN NN® 


.560 
.633 
.633 
.633 
.633 
.633 
.633 


.38 
.59 
.28 
.75 
.61 
.75 
. 76 
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Adj MS F 

500.9 4.52 

A 2717 

875.7 7.90 
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associated with the low setting of factor C, but that is the only problem. 


Residuals Versus A 


(response is y) 














Residuals Versus C 


(response is y) 














Residuals Versus the Fitted Values 


(response is y) 





Fitted Vdue 


Residual 


Residuals Versus B 


(response is y) 


P 
0.055 
0.001 
0.017 











Normal Probability Plot of the Residuals 


(response is y) 


Norma Score 





Normal Score 


c) Some of the information from the experiment is lost because the design is run in 4 blocks. This causes us to 
lose information on the ABC interaction even though we have replicated the experiment twice. If it 1s 
possible to run the experiment in only 2 blocks, there would be information on all interactions. 


d) To have data on all interactions, we could run the experiment so that each replicate is a block (therefore only 
2 blocks). 


Section 14-7 


14-31 a) Factors A, B and D are active factors. 


Normal Probability Plot of the Effects 


(response is color, Alpha = .10) 





Normal Score 














b) There are no serious problems with the residual plots. The normal probability plot has a little bit of 
curvature at the low end and there is a little more variability at the lower and higher ends of the fitted values. 


Normal Probability Plot of the Residuals 

















(response is var. 1) Residuds Versus the Fitted Values 
(response is var_1) 
2 — 
1.5- F 
1— ^ 1.04 
. 05- RO 
04 = le Se es att oe i eo A 
S 0.0 
a = e 
a D 40.54 : [] 
-1 — i 
Œ 10- ' 
-154 
e | | | | | | | | 
-2.5 -2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 204 . 
Residual 25 ü 











Fitted Value 


c) Part a. indicates that only A,B, and D are important. In these factors only, the design is a 2° with two 
replicates. 


Estimated Effects and Coefficients for var 1 

Term Effect Coef StDev Coeff T P 
Constant 2.7700 0.2762 10.03 0.000 
factor A 1.4350 0.7175 0.2762 2.60 0.032 


Normal Score 


























factor B -1.4650 -0.7325 0.2762 -2.65 0.029 
factor D 4.5450 2.2725 0.2762 8.23 0.000 
factor_A*factor B 1.1500 0.5750 0.2762 2.08 0.071 
factor_A*factor D -1.2300 -0.6150 0.2762 -2.23 0.057 
factor B*raccor D 0.1200 0.0600 0.2762 0.22 0.833 
factor_A*factor B*factor D -0.3650 -0.1825 0.2762 -0.66 0.527 
Analysis of Variance for var 1 
Source DF Seq SS Adj SS Adj MS F P 
Main Effects 3 99.450 99.4499 33.1500 27.15 0.000 
2-Way Interactions 3 11.399 11.3992 3.7997 3.11 0.088 
3-Way Interactions 1 0.533 0.5329 0.5329 0.44 0.527 
Residual Error 8 9.767 9.7668 1.2208 
Pure Error 8 9.767 9.7668 1.2208 
Total 15 121.149 
Factors A, B, D, AB and AD are significant. 
Normal Probability Plot of the Residuals Residuals Versus the Fitted Values 
(response is var. 1) (response is var_1) 
2 
1 
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[: 
2 
0-4 2 ls EM ccc MM ECT 
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a 
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The normal probability plot and the plot of the residuals versus fitted values are satisfactory. 


14-35 Since factors A, B, C, and E form a word in the complete defining relation, it can be verified that the 
resulting design is two replicates of a 2^! fractional factorial. This is different than the design that results 
when C and E are dropped from the 2°” in Table 14-28 which results in a full factorial because, the factors 
ABDF do not form a word in the complete defining relation 


14-37 Generators D=AB, E-AC for 2°”, Resolution III 


A B C D E var ] 
-1 -1 -1 l l 1900 
l -1 -1 -1 -1 900 
-1 l -1 -1 l 3500 
1 l -1 l -1 6100 
-1 -1 l l -1 800 
1 -1 l 1200 
-1 1 l -1 -1 3000 
l l l 6800 


Estimated Effects and Coefficients for var 1 


Term 

Constant 
factor A 
factor B 
Factor C 
factor D 
factor E 


Analysis of Variance for var 1 


Source 


Normal Score 


Main Effects 
Residual Error 


Total 


Normal Probability Plot of the Standardized Effects 


(response is var. 1, Alpha = .10) 














» factor B 
» factor D 
» factor A 
» factor E 
| | | 
0 10 20 
Standardized Effect 


Effect 


1450 
3650. 
-150 
1750 

650. 


<00 


00 


+00 
.00 


00 


Coef 


3025 


725. 
1825. 


-75 


875. 


325 


DF 


5 
2 
7 


Factors A, B and D are significant. 


Supplemental Exercises 


14-41 a Estimated Effects and Coefficients for var 1 


Term 
Constant 


factor A (PH) 
factor B (CC) 
factor A*factor B 


Analysis of Variance for var 1 


Source 
Main Effects 


2-Way Interactions 
Residual Error 


Pure Error 
Total 


Effect 


5 
-0 
11 


DF 


.875 
.125 
.625 


4 QU 
00 
00 
200 
00 
«00 


Seq 


SE Coef 
90.14 33 
90.14 8 
90.14 20 
90.14 -0 
90.14 9. 
90.14 3 


SS 


37865000 
130000 
37995000 


Coef 


191.563 


2.937 


-0.062 


5.812 


Seq SS 


138. 
540. 
zm s 
257. 
935. 


125 
562 
250 
250 
938 


(coded units) 


T 


.56 
.04 
.25 
. 83 
71 
.61 


Adj SS 
37865000 
130000 


SE Coef 
1.158 
1.158 
1.158 
1.158 


(coded units) 


Adj SS 


138 
540 
257 
257 


.125 
.562 
.250 
.250 


165 


540 


OOO 0 0 O 


(coded units) 


P 


. 001 
.015 
.002 
.493 
.010 
-0609 


Adj MS 


F 


7573000 116.51 


(coded units) 


T 


.49 
.54 
.05 
.02 


j MS 
.06 
.56 
.44 
.44 


65000 


OOO 0o 
O 
UI 
co 


P 
0.009 


The main effect of pH and the interaction of pH and Catalyst Concentration (CC) are significant at the 0.05 
level of significance. 


The model used is viscosity = 191.563 + 2.937x, — 0.062x, + 5.812x), 


b.) The interaction plot shows that there is a strong interaction. When Factor A is at its low level, the mean 
response is large at the low level of B and is small at the high level of B. However, when A is at its high 
level, the results are opposite. 


Interaction Plot (data means) for var 1 





200 — 


195 — 


Mean 


185 — 














factor B 


c.) The plots of the residuals show that the equality of variance assumption is reasonable. 
However, there is a large gap in the middle of the normal probability plot. Sometimes, this can 
indicate that there is another variable that has an effect on the response but which is not included 
in the experiment. For example, in this experiment, note that the replicates in each cell have two 
pairs of values that are very similar, but there is a rather large difference in the mean values of the 
two pairs. (Cell 1 has 189 and 192 as one pair and 198 and 199 as the other.) 


Residuals Versus factor B 


: (response is var. 1) 
Residuals Versus factor A 
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2 [a 
c 
5— 54 
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factor_A factor B 
Residuals Versus the Fitted Values 
(response is var, 1) 
Normal Probability Plot of the Residuals 5— = 
(respo is var 1) 
24 
E 
T= e] i EEEN PONE DEUM S E NOAA EIA AANA RUNE EIAN AINI AINNE AAI AAAI T AANE ER 
g v 
B id 
ao 
T 9-3 
E 
o 
Zz 
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5A 
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T T T 
-5 0 5 Fitted Value 
Residual 
14-47 a) Term Effect 


V -15.75 


F 8.75 


P 10.75 
G -25.00 
V* F 3.00 
V*P -8.00 
V*G cabe Ju 
E*P -6.00 
F*G S515 
P*G -19.25 
V*F*P -1425 
V*F*G 0.50 
V*P*G -1.50 
F*P*G -12.50 
V*F*P*G -4.25 


b) 


Normal Probability Plot of the Effects 


(response is Roughnes, Alpha = .10) 





Normal Score 
eo 
| 














Effect 


According to the probability plot, factors V, P, and G and, PG are possibly significant. 


Estimated Effects and Coefficients for roughnes (coded units) 


Term Effect Coef SE Coef T P 
Constant 102.75 2.986 34.41 0.000 
V -15.75 -7.87 2.986 -2.64 0.046 
F 8.75 4.37 2.986 1.46 0.203 
P 10.75 bao? 2.986 1.80 0.132 
G -25.00 -12.50 2.986 -4.19 0.009 
V*F 2200 1.50 2.986 0.50 0.637 
V*P -8.00 -4.00 2.986 -1.34 0.238 
V*G -2.75 -1.38 2.986 -0.46 0.665 
F*P -6.00 -3.00 2.986 -1.00 0.361 
F*G 375 1.88 2.986 0.63 0.558 
P*G -19.25 -9.63 2.986 -3.22 0.023 
Analysis of Variance for roughnes (coded units) 

Analysis of Variance for Roughnes (coded units) 

Source DF Seq SS Adj SS Adj MS 


Main Effects 4 4260.7 4260.7 1065.2 


2-Way Interactions 6 2004.7 2004.7 334.1 2.34 0.184 
Residual Error 5 T1335 713.5 142.7 
Total 15 po TEC 


y 2102.75 — 7.87x, - 5.37 x, —12.50x, —9.63x,, 


c) From the analysis, we see that water jet pressure (P), abrasive grain size (G), and jet traverse speed (V) are 
significant along with the interaction of water jet pressure and abrasive grain size 


d) The residual plots appear to indicate the assumption of constant variance may not be met. The 
assumption of normality appears reasonable. 


14-49 The design uses G-VPF as the generator. 


Alias Structure 


Normal Probability Plot of the Residuals 







































































(response is Roughnes) Residuals Versus the Fitted Values 
PAE (response is Roughnes) 
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V*P 4 F*G 
V*F 4 P*G 
V*G 4 P*F 


Estimated Effects and Coefficients for C9 (coded units) 


Term Effect Coef SE Coer T P 

Constant 102.63 6.365 16.12 0.004 

V -14.75 S Sy 6.365 -1.16 0.366 

P -28.25 -14.12 6.365 Z222 0z L57 

F -1.25 -0.62 6.365 -0.10 0.931 

G -14.75 -7.38 6.365 -1.16 0.366 

PAG 17.75 8.88 6.365 1.39 0.298 

Analysis of Variance for C9 (coded units) 

Source DF Seq SS Adj SS Adj MS F P 
Main Effects 4 2469.5 2469.5 617.4 1.90 0.373 
2-Way Interactions 1 630.1 630.1 630.1 1.94 0.298 
Residual Error 2 648.3 648.3 324.1 

Total 7 3747.9 


The results do not show any significant factors. A lot of the information is lost due to the half- 
fraction of the design. 


14-51 Design Generators: D = AB E= AC 


Alias Structure 


I + ABD + ACE + BCDE 

A + BD + CE + ABCDE 

B + AD + CDE + ABCE 

C + AE + BDE + ABCD 

D + AB + BCE + ACDE 

E + AC + BCD + ABDE 

BC + DE + ABE + ACD 

BE + CD + ABC + ADE 

Design 
StdOrder A B C D È 

1 -1 -1 -1 1 1 
2 1 -1 -1 -1 - 
3 -1 1 -1 -1 
4 1 1 -1 1 -1 
5 -1 -1 1 1 -1 
6 -1 1 -1 1 
T - 1 1 -1 - 
8 1 1 1 1 


Chapter 15 Selected Problem Solutions 
Section 15-2 


15-1. 1. The parameter of interest is median of pH. 
2.H:u-T.0 
3H,:uz7.0 
4. o=0.05 
5. The test statistic is the observed number of plus differences or r` = 8. 
6. We reject H, if the P-value corresponding to r' = 8 is less than or equal to 00.05. 
7. Using the binomial distribution with n=10 and p=0.5, P-value = 2P(R 28|p-0.5)-0.109 
8. Conclusion: we cannot reject Hy. There is not enough evidence to reject the manufacturer's claim that 
the median of the pH is 7.0 


15-5 a. 1. The parameter of interest is the median compressive strength 
2.H,: = 2250 
3.H,:u » 2250 
4. 00.05 
5. The test statistic is the observed number of plus differences or r` = 7. 
6. We reject H, if the P-value corresponding to r' = 7 is less than or equal to 60.05. 
7. Using the binomial distribution with n=12 and p=0.5, P-value = P(R 27|p-0.5)7.3872 
8.Conclusion: cannot reject Hp. The median compressive strength is not more than 2250. 


b. 1. The parameter of interest is the median compressive strength 


2H. > = 2250 
3. H, : > 2250 
4. a=0.05 


r' —0.5n 
0.5Vn 
6. We reject Hy if the |Zo| > Zo 925 = 1.96 
_ 7 —0.5(02) 20.577 
0.5412 


8. Conclusion: cannot reject Hp. The median compressive strength is not more than 2250. 


5. Test statistic is Bom 


7. Computation: 7 


The P-value = 1-@(0.58) = 1-.7190 = 0.281 


15-7. 1. The parameter of interest 1s the median titanium content 
2.H,: yp =8.5 
3. eS 
4. a=0.05 
r' —0.5n 


5. Test statistic is z, = ———————— 
" — Qn 
6. We reject Hy if the |Zo| > Zo 925 = 1.96 
E 7 — 0.5(20) 2134 
0.5420 
8. Conclusion: cannot reject Hp. The median titanium content is 8.5. 
The P-value = 2*P(|Z|>1.34) = 0.1802. 


7. Computation: 7 


15-9. 1. The parameters of interest are the median hardness readings for the two tips 
2T usce 
3.H,: Uy #0 
4. a=0.05 


15-11. 


15-17. 


5. The test statistic is r = min( r`, r`). 
6. Since o=0.05 and n=8, Appendix,- Table VII gives the critical value of P o —2. We will reject 


Hy in favor of Hj, ifr <1. 
7.r' =6 andr“ =2 and so r=min(6,2) = 2 
8. Conclusion: cannot reject Hy. There is not significant difference in the tips. 


1. The parameters of interest are the median drying times for the two formulations. 
2. H H0 
S HUS IE AO 
4. o=0.05 
r` —0.5n 


0.5 


6. We reject Hy if the |Zo| > Zo 025 = 1.96 
_ 15-0.5(20) | 


——————— = 2.24 
0.520 
8. Conclusion: reject Hp. There is a difference in the median drying times between the two formulations. 


5. Test statistic is Ze 


7. Computation: 7 


The P-value = 2*P(\Z|>2.24) = 0.025. 

a) œ = P(Z>1.96) = 0.025 

b gap -roo ues |= PU ESSI 
p en | ( ) 


c) The sign test that rejects if R` € 1 has œ = 0.011 based on the binomial distribution. 
d) p E (R7 -1 | uU = 1) = 0.1587. Therefore, R^ has a binomial distribution with p=0.1587 


and n = 10 when u = 1. Then B = 0.487. The value of D is greater for the sign test than for the normal 
test because the Z-test was designed for the normal distribution. 





Section 15-3 


15-21 


15-23 


15-27. 


1. The parameter of interest is the mean titanium content 


2H uU=85 
3.H,:ux8.5 
4. 00.05 


5. The test statistic is w = min( w *, w^). 
6. We will reject Ho if w < Wp os = 52, since o=0.05 and n=20, the value in Appendix A, Table VIII. 
7. w' =71 and w; = 102 and so w=min(71,102) = 71 


8. Conclusion: Since 71252, we cannot reject Hp. 


1. The parameter of interest is the mean titanium content 


2S I e2:5 
S. pm 
4. a=0.05 


5. The test statistic w = min( w ' ,w ). 
6. We will reject Ho if w < Wp os = 65, since o=0.05 and n=22 the value in Appendix A, Table VIII. 
7. w` 225 and w` = 8 and so w=min(225, 8) = 8 


8.Conclusion: Since 8 < 65, we reject Ho. 


1. The parameters of interest are the mean blood cholesterol levels. 


2H Vi =O 
3. H,: up z0 
4. a=0.05 


5. The test statistic is w = min( w *, w^). 


6. We will reject Ho is w < Wp os = 25, since o=0.05 and n=15, the value in Appendix A, Table VIII. 


7. w' = 118 and w= 1 and so w=min(118, 1) - 1 Since 1 « 25 
8. Conclusion: Since 1 < 25, we reject Hj. 


Section 15-4 


15-31. 1. The parameters of interest are the mean image brightness’. 
2. Hy : 44 =U, 
3. H, : Ly > Uy 
4. a=0.025 
5. The test statistic is ae W, — u,,. 
o 


Wi 


6. We will reject Ho if Zo > Z0.025 = 1.96 
7.w,=40, Ll, =85.5 and O, =128.25 


_ 54 — 85.5 


Z. = 
° Hb 
Since Zo < 1.96, cannot reject Ho 
8. Conclusion: do not reject Hp. 


= —2.78 


P-value = 0.9973 


15-35. 1. The parameters of interest are the mean etch rates 
2. Ho: Uy = 4, 
3. H Ly * Uy 
4. 00.025 
5. The test statistic is | _ Wi — uw, 
0 o, 
6. We will reject Ho if Zo > Z9.025 = 1.96 
7. wi 7 55, [L,, =105 and O,, =175 
55 —105 
A = m 
13.23 


Since |Zo| < 1.96, reject Ho 
8. Conclusion: reject Hp. 


3 


P-value = 0.0001 


Section 15-5 


15-37. Kruskal-Wallis Test on strength 


mixingte N Median Ave Rank Z 
1 4 2945 9.6 0.55 
2 4 3075 12.9 2.12 
3 4 2942 9.0 0.24 
4 4 2650 2:5 -2.91 
Overall 16 8.5 


H = 10.00 DE = 3. P= 0.019 
H = 10.03 DF = 3 P = 0.018 (adjusted for ties) 
* NOTE * One or more small samples 


Reject Ho 


Supplemental 


15-43. 1. The parameter of interest is median of surface finish. 


2. Hy: -10.0 
3 H,: pt 10.0 
4. a=0.05 


5. The test statistic is the observed number of plus differences or r` = 5. 

6. We reject H, if the P-value corresponding to r' = 5 is less than or equal to 60.05. 

7. Using the binomial distribution with n=10 and p=0.5, P-value = 2P(R 25|p-0.5)-1.0 
8. Conclusion: we cannot reject Hj. We cannot reject the claim that the median is 10 win. 


15-45. The parameter of interest is the median fluoride emissions 
H,:uU=6 
H :u<6 
&=0.05 
Using Minitab (Sign Rank Test) 
Sign test of median = 6.000 versus < 6.000 


N Below Equal Above P Median 
y 15 9 2 4 0.1334 4.000 


Do not reject Ho 


15-47. 1. The parameters of interest are the median impurity levels. 


2. H5 c= 0 
3.H,: Up #0 
4. 00.01 


5. The test statistic is r= min( r`, r`). 
6. Since o=0.01 and n=8, Appendix,- Table VII gives the critical value of nR =0. We will reject 


H in favor of Hj ifr € 10. 
7.r —landr'-7 and so r=min(1,7) = 1 
8. Conclusion: cannot reject Ho. There is no significant difference in the impurity levels. 


15-49. The parameter of interest is the median fluoride emissions 
Hy:u-6 
H,:u«6ó 
05005 


Using Minitab Wilcoxon signed-rank t test 
Test of median = 6.000 versus median « 6.000 


N for Wilcoxon Estimated 
N Test Statistic P Median 
y 15 13 19.0 0.035 5.000 


Reject Ho 
The Wilcoxon signed-rank test applies to symmetric continuous distributions. The test to applies to the 
mean of the distribution. 


15-51. 1. The parameters of interest are the mean volumes 
2. Hy : My =U, 
3. HV: Ly * Uy 
4. o=0.01 
5. The test statistic is ,, — (qutnmXn-tnt-l)) | " 
2 2 1 
6. We will reject Ho if w < Wo; = 71, since o=0.01 and n;710 and n;710, the value in Appendix A, 


Table IX. 
7. w; = 42 and w, = 78 and so since 42 is less than 78, we reject Ho 


15-57. 


8. Conclusion: reject Ho 


Kruskal-Wallis Test on VOLUME 


TEMPERAT N Median Ave Rank 

70 5 1245 12.4 

75 5 1220 J9 

80 5 1170 Sue 
Overall 15 8.0 

H = 9.46 DF =2 P = 0.009 

H = 9.57 DF =2 P= 0.008 (adjusted 


Reject Hp, P-value=0.0009 


ZLUP 
s 


Chapter 16 Selected Problem Solutions 


Section 16-5 


16-3 a) X-bar and Range - Initial Study 
Charting Problem 16-3 

X-bar | Range 

ap | PN do 

UCL: + 3.0 sigma = 17.4 | UCL: + 3.0 sigma = 5.792 
Centerline = 15.09 | Centerline = 2.26 
LCL: - 3.0 sigma = 12.79 | LCL: - 3.0 sigma = 0 

| 


Test Results: X-bar One point more than 3.00 sigmas from center line. 
Test Failed at points: 4 6 7 10 12 15 16 20 


Test Results for R Chart:One point more than 3.00 sigmas from center line. 
Test Failed at points: 19 


Xbar/R Chart for x1-x3 










































































20 — 4 
q ] UCL=17.40 
S =17. 
a os = Mean=15.09 
ps: LCL=12.79 
= 
P 10 4 
Subgroup 0 10 20 
ao ] UCL=5.792 
O 9 — 
G 4 — 
aa 
(€b) 3 7] = 
2 2 \ R=2.25 
m. du 
(05) 
0 LCL=0 
b. Removed points 4, 6, 7, 10, 12, 15, 16, 19, and 20 and revised the control limits 
The control limits are not as wide after being revised X-bar UCL=17.96, CL=15.78 
LCL=13.62 and R UCL = 5.453, R-bar=2.118, LCL=0. 
The X-bar control moved down. 
Xbar/R Chart for x1-x3 
Revised Control Limits 
18 — UCL-17.95 
S 17 4 
= 
oo Mean=15.78 
E 15 — 
(39) 
S E LCL=13.62 
13 —- T T 
Subgroup 0 5 10 
6 — 
PONE UCL=5.453 
D) 
Pa 
a ^ | R-2.118 
E = 
a4 = 
” 
0 LCL=0 














C) X-bar and StDev - Initial Study 
Charting Problem 16-3 


X-bar | StDev 

UCL: + 3.0 sigma = 17.42 | UCL: + 3.0 sigma = 3.051 
Centerline = 15.09 | Centerline = 1.188 
LCL: - 3.0 sigma = 12.77 | LCL: - 3.0 sigma = 0 


Test Results: X-bar One point more than 3.00 sigmas from center line. 
Test Failed at points: 4 6 7 10 12 15 16 20 


Test Results for S Chart:One point more than 3.00 sigmas from center line. 
Test Failed at points: 19 


Xbar/S Chart for x1-x3 



































20 A 
c 
ç UCL=17.42 
= 15 Mean=15.09 
a LCL=12.77 
= 
a 10 
Subgroup 0 10 20 
> 3 UCL=3.051 
je 
A 
H 2 
[^ 
E 1 S-1.188 
(99) 
o) 
0 LCL-0 
Removed points 4, 6, 7, 10, 12, 15, 16, 19, and 20 and revised the control limits 
The control limits are not as wide after being revised X-bar UCL=17.95, CL=15.78 
LCL=13.62 and S UCL = 2.848, S-bar-1.109, LCL=0. 
The X-bar control moved down. 
Xbar/S Chart for x1-x3 
Revised Control Limits 
18 — UCL-17.95 
S 17 = 
E 16 — Mean=15.78 
E 15 — 
o we LCL-13.62 
13 q | 
Subgroup O0 5 10 
ten UCL=2.848 
© 
Q 2 
Ù 
[^ 
Q 4 _ S=1.109 
E 
(99) 
0) 
0 LCL-0 











x chart 


UCL = CL + A,r = 223 + 0.577(34.286) = 242.78 
CLh= 223 
LCL = CL — A,r = 223 — 0.577(34.286) = 203.22 


R chart 
UCL = D,r = 2.115(34.286) = 72.51 


CL = 34.286 
LCL = D, = 0(34.286) = 0 





b) 
=x =223 
~ r 34.286 
Oo =—= = 14.74 
d, 2.326 
a) X-bar and Range - Initial Study 
Charting Problem 16-7 
X-bar | Range 
RE | monet tice 
UCL: + 3.0 sigma = 0.0634706 | UCL: + 3.0 sigma = 1.95367E-3 
Centerline = 0.0629376 | Centerline = 9.24E-4 
LCL: - 3.0 sigma = 0.0624046 | LCL: - 3.0 sigma = 0 
| 
out of limits = 5 | out of limits - 2 
Chart: Both Normalize: No 
25 subgroups, size 5 O0 subgroups excluded 
Estimated 
Xbar/R Chart for x 
0.064 1 
= 1 
[o UCL=0.06347 
= 
2 0.063 Mean=0.06294 
= 
© LCL=0.06240 
o) 
0.062 
Subgroup O0 5 10 15 20 25 
0.003 
o 
e 
œ 0.002 UCL=0.001954 
ad 
c 
e 0.001 R-0.000924 
© 
o) 
0.000 LCL=0 


b) 


Xbar/S Chart for x 

0.064 1 
S 1 
Ü UCL=0.06346 
> 
2 0.063 Mean=0.06294 
E 
(C LCL=0.06241 
” 1 

0.062 
Subgroup O0 5 10 15 20 25 

0.0010 
D 
Q UCL=7.66E-04 
U) 
© 0.0005 - 
Q S=3.67E-04 
= 
[qv] 
N 

0.0000 LCL=0 


c) There are several points out of control. The control limits need to be revised. 
The points are 1, 5, 14,17, 20,21, and 22; or outside the control limits of the R chart: 6 and 15 


Section 16-6 


16-0. a) 
Individuals and MR(2) - Initial Study 
Charting Problem 15-8 

Ind. | MR(2) 
UCL: + 3.0 sigma = 60.8887 | UCL: + 3.0 sigma = 9.63382 
Centerline = 53.05 | Centerline - 2.94737 
LCL: - 3.0 sigma - 45.2113 | LCL: - 3.0 sigma = 0 
out of limits = 0 | out of limits = 0 
Chart: Both Normalize: No 

20 subgroups, size 1 O0 subgroups excluded 
Estimated 
process mean = 53.05 
process sigma = 2.61292 


mean MR = 2.94737 


There are no points beyond the control limits. The process appears to be in control. 


| and MR Chart for hardness 


————————————————————————— — — — — | UCL 760.89 

E 60 
© 

^ 55 
T Mean=53.05 
© 

> 50 

O 
C 

^ 45 LCL=45.21 

Subgroup 0 10 20 

10 UCL=9.630 
[o 
O 
Cc 
c 
D 5 
€ R=2.947 
Ó 
= 
0 LCL=0 
b) 

(=x = 53.05 
. mr 2.94737 

Ó = — = 2.613 

d, 1.128 
Section 16-7 
, —O—- TE ~ r 0344 
16-15. a) Assuming a normal distribution with u — 0.14.510 and © = P = CETT = 0.148 
2 . 


P(X < LSL) = Az < E 

14.00 —14.51 
0.148 | 

= P(Z «-345) 

-]- P(Z«345) 

= 1-0.99972 

= 0.00028 


-z< 


P(X > USL) = fz > M 
O 


0.148 
= P(Z > 3.31) 

-1- P(Z <3.31) 
= 1— 0.99953 

= 0.00047 


Therefore, the proportion nonconforming is given by 
P(X«LSL) + P(X>USL) =0.00028 + 0.00047 = 0.00075 


15.00—14.51 








b) 
PCR = USL we - 15.00—14.00 A 
6(6) 6(0.148) 
PCR, - min De x | X 2 
30 30 


9 


3(0.148) 3(0.148) 
= min[.104,1.15] 
=1.104 


Since PCR exceeds unity, the natural tolerance limits lie inside the specification limits and very few 
defective units will be produced. 


fe —14.51 14.51— € 


PCRy = PCR the process appears to be centered. 





16-19 a) Assuming a normal distribution with i — 223 and ô = = = 14. 


^ 


e 


P(X <LSL)= Az < ux 


14.74 
= P(Z < 3.60) 
= 0.00016 


P(X > USL) = Az > | 
O 


_pzs 270 — 223 
14.75 


= P(Z » 3.18) 
-1- P(Z «3.18) 
= | — 0.99926 
= 0.00074 
Probability of producing a part outside the specification limits is 0.00016+0.00074 = 0.0009 
b 
USL- LSL 270—220 | 
66) 60475) ` 
USL-¥ X= =| 


PCR = 





PCR, = minl 2 0 
O O 

3(14.75) ` 3(14.75) 

= minÍ1.06,1.19] 

=1.06 


Since PCR exceeds unity, the natural tolerance limits lie inside the specification limits and 
very few defective units will be produced. The estimated proportion nonconforming is given by 
P(X<LSL) + P(X>USL) =0.00016 + 0.00074=0.0009 


E M 
1n| — ——— .,———— 


16-23. Assuming a normal distribution with ù = 500.6 and 6 = 17.17 
 USL-LSL 525-475 |, 


PCR = - = = 
6(0 ) 6(17.17) 
PCR, =min eae | X 2 
30 30 


? 


307.17) ' 307.17) 
= min|0.474,0.50| 
= 0.474 


ES —500.6 500.6— m: 


Since the process capability ratios are less than unity, the process capability appears to be poor. 


Section 16-8 


























16-25. 
U Chart for defects 
5 1 
1 
= 3.0SL=3.811 
c 
2 
63 
© = 
E 2 U=1.942 
© 
N 
1 
04>... | 3.08L=0.07217 
0 5 10 15 20 25 
Sample Number 
samples 5 and 24 have out-of-control points. The limits need to be revised. 
b) 
U Chart for defects _ 
4 — 
UCL-3.463 
3 — 
t 
2 
O 
O ul i 
2 U=1.709 
E 
a1 
0 LCL=0 
| | | 
0 10 20 
Sample Number 
The control limits are calculated without the out-of-control points. There are no points out of control for 
the revised limits. 
16-27. 
C Chart for defects 
1 
" 20 3.0SL=19.06 
c 
Q 
Ó 
® " 
E 10 C-9.708 
[ev] 
o 
0 T —— | -3.08L-0.3609 
0 5 10 15 20 25 


Sample Number 


There are two points beyond the control limits. They are samples 5 and 24. 
The U chart and the C chart both detected out-of-control points at samples 5 and 24. 


Section 16-9 





ô 2.4664 
16-31. a) ô- = —— = —=— -1.103,u-36 
7^ dn 45 
P(30.78 « X < 37.404) 
-p 30.78 — 36 p En < 37.404 — 36 
1.103 Ô- 1.103 
= P(-4.73« Z < 1.27) = P(Z «1.27) - P(Z < —4.73) 
= 0.8980 — 0 = 0.8980 
The probability that this shift will be detected on the next sample is p = 1—0.8980 = 0.1020. 
b) Ahi TE 
p 0.102 
~ R 6. ) 3 
16-33. a) Ô EO R ô- = RR E E 
d, 2.059 Jn 44 


P(5.795 « X «15.63) 
ap 5.795 —13 , X-u : 15.63 —13 
1.64 O- 1.64 
= P(-4.39 < Z < 1.60) = P(Z «1.60) — P(Z « —4.39) 
— 0.9452 — 0 = 0.9452 





The probability that this shift will be detected on the next sample is p = 1—0.9452 = 0.0548. 


o ccs 
p 0.0548 





b) ARL — 


Section 16-10 


16-39. a) Ó - 0.1695 
b) The process appears to be out of control at the specified target level. 


CUSUM Chart for diameter 


BEEN 
0.678191 





c 0.5 
2 
0» 
Q 
a 
© 0.0 
2 
= 
5 
O 
-0.5 
-6.8E-01 
0 5 10 15 20 25 
Subgroup Number 
Supplemental 
16-43. a) 
X-bar and Range - Initial Study 
X-bar | Range 
"em | "HE 
UCL: + 3.0 sigma = 64.0181 | UCL: + 3.0 sigma = 0.0453972 
Centerline = 64 | Centerline = 0.01764 
LCL: - 3.0 sigma = 63.982 | LCL: - 3.0 sigma = 0 
| 
gut of limits = 9 | ut of limits s: 
Chart: Both Normalize: No 
Estimated 
process mean = 64 
process sigma = 0.0104194 
mean Range = 0.01764 
Xbar/R Chart for diameter 
One _sT— UI CL=64.02 
© 
œ 64.01 
= 
© 64.00 Mean=64.00 
Q 
S 63.99 
> 
63.98 —TTATLTLTLATANTNANTTA NN | LCL =63.98 
Subgroup 0 5 10 15 20 25 


Sample Range 


0.05 

do — — | UCL-0.04438 
0.04 
0.03 
one R=0.01724 
0.01 
0.00 ————————————— | LOLA 


The process is in control. 





b)ü-x-64 gies 2 = 0.0104 
d, 1.693 
>) PCR = USL > - 64.02 — 63.98 per 
60 6(0.0104) 


The process does not meet the minimum capability level of PCR = 1.33. 


d) 








A 


36 ^ 36 
= min|0.641,0.641]= 0.641 


e) In order to make this process a “six-sigma process”, the variance o^ would have to be decreased such that 
x-LSL 
3o 


USL —X Pu | bun md 
= MIN| — ———— , ———— 


PCR, — min , 
3(0.0104) ° 3(0.0104) 


PCR, = 2.0. The value of the variance is found by solving PCR, = = 2.0 for o: 


64-61 _ 
30 
60 = 64.- 61 
64.— 61 
O = 
6 


Therefore, the process variance would have to be decreased to o^ = (0.50y = 0.025. 


2.0 





= 0.50 





f) ô- -0.0104 
P(63.98 « X « 64.02) 


E ETE .X-u | E g 





0.0104 O 0.0104 


= P(-2.88 < Z < 0.96) = P(Z < 0.96) - P(Z < -2.88) 
= 0.8315 — 0.0020 = 0.8295 


The probability that this shift will be detected on the next sample is p = 1—0.8295 = 0.1705 
l l 





ARL =— = E 
p 0.705 
16-45. a) 
P Chart - Initial Study 
P Chart 
UCL: + 3.0 sigma = 0.203867 
Centerline = 0.11 
LCL: - 3.0 sigma = 0.0161331 
out of limits = 0 
Estimated 
mean P = 0.11 


sigma = 0.031289 


P Chart for def 























02 — UCL=0.2039 
c 
Oo 
5 oa = P=0.11 
8 0. 
Oo. 
LCL=0.01613 
00: ! ! 
0 10 20 


Sample Number 


There are no points beyond the control limits. The process is in control. 
b) 
P Chart - Initial Study 
Sample Size, n - 200 


























P Chart 
UCL: + 3.0 sigma = 0.176374 
Centerline = 0.11 
LCL: - 3.0 sigma = 0.0436261 
out of limits = 1 
Estimated 
mean P = 0.11 
Sigma = 00221246 
P Chart for def2 
0.19 — 1 
UCL=0.1764 
0.14 — 
c 
Ke) 
E [à : 
B P=0.11 
2 
o 0.09 — 
004 — LCL=0.04363 
| | | 
0 10 20 


Sample Number 


There is one point beyond the upper control limit. The process is out of control. 


The revised limits are: 
P Chart - Revised Limits 
Sample Size, n = 200 


P Chart 

UCL: + 3.0 sigma = 0.171704 
Centerline = 0.106316 
LCL: - 3.0 sigma = 0.0409279 
out of limits = 0 

Estimated 


mean P = 0.106316 


16-49. 


Sigma = 0.021796 


There are no points beyond the control limits. The process is now in control. 


P Chart for def2 
0.19 
UCL=0.1717 

0.14 
Cc 
[s 
Lc _ 
9 P-0.1063 
o 
& 0.09 

0.04 sO LCL =0,.04093 

0 10 20 


Sample Number 


c) A larger sample size with the same number of defective items will result in more narrow control limits. 
The control limits corresponding to the larger sample size are more sensitive. 


ARL = 1/p where p is the probability a point falls outside the control limits. 
a) U —-ug-*o andn=1 


p= P(X >UCL)+ P(X < LCL) 


O 
Ho t—--—H,-9 Ho — — Uz0 
pp. pe 0 Ra 0 0 NE 0 
= > —————— |H PZ < —  .— 
on o / 4n 


= P(Z »3-4n)* P(Z <-3-Vn) 

= P(Z >2)+P(Z < —4) when n-l 

-]-P(Z«2)-[I- P(Z « 4) -1-0.97725 - [1-1] = 0.02275 
Therefore, ARL = 1/p = 1/0.02275 = 43.9. 


b) U = uo +20 


P(X > UCL) * P(X < LCL) 
30 30 
Ho ra - 20 

= P| Z > E 


o/An o/An 


= P(Z » 3-24n)* P(Z < -3-24In) 

= P(Z >1)+ P(Z < —5) when n-l 
=1-P(Z <1)+[l-P(Z « 5) 

-1— 0.84134 4 [1—1] 

= 0.15866 


Therefore, ARL = 1/p = 1/0.15866 = 6.30. 
c) U = Ho +30 
P(X > UCL) + P(X < LCL) 
it ty, etia Hu 39 01-36 
grec 37 S Qo crea c EOS - 
yn FIZ < vn 


Epp A ——— —————— a 
o /4n o /4n 

= P(Z » 3-34 n) P(Z <-3-3Vn) 

= P(Z » 0) 4 P(Z « —6) when n=1 

-]-P(Z«0)-«[I- P(Z «6)| =1-—0.50+[1-—1] 2 0.50 

Therefore, ARL = 1/p = 1/0.50 = 2.00. 


d) The ARL is decreasing as the magnitude of the shift increases from o to 20 to 3o. The ARL will 
decrease as the magnitude of the shift increases since a larger shift is more likely to be detected earlier 
than a smaller shift. 


16-51. a) 
X-bar and Range - Initial Study 
Charting xbar 
X-bar | Range 
"em | — 
UCL: + 3.0 “sigma: e 240); 168 | UCL: + 3.0 sigma = 2.48437 
Centerline = 139.49 | Centerline sq 175 
Che. = 3.0 sigma = 138.812 | LCL: - 3.0 sigma = 0 
out of limits = 9 | OuEOF Tannese: 
Estimated 
process mean = 139.49 
process sigma - 0.505159 
mean Range = dl 5 


Problem 16-51 


140.168 
139.49 
138.812 


^j 2.48437 


1.175 





0 E 8 12 16 20 
subgroup 


There are points beyond the control limits. The process is out of control. The points are 4, 8, 10, 13, 15, 
16, and 19. 


b) Revised control limits are given in the table below: 
X-bar and Range - Initial Study 
Charting Xbar 


X-bar | Range 
MN ] ceed 
UCL: + 3.0 Sigma = 140.518 | UCL: + 3,0 Sigma, = 2.60229 
Centerline = 139.808 | Centerline = 1.23077 
LOL; = 3-0 sigma. = 139.098 | LCL: - 3.0 sigma = 0 
out of limits = 0 | out of limits = 0 
Estimated 
process mean = 139.808 
process sigma - 0.529136 
mean Range = 1.23077 
Z 
There are no points beyond the control limits the process is now in control. 
- — . R 123077 
The process standard deviation estimate is given by 6 = — = ———— = 0.529 
d5 2.326 
USL-LSL 142-138 
c) PCR = —— ——— = ————— = 1. 
66 6(0.529) 


USL-x x-LSL 
36 ` 36 


PCR, = min P= 


? 


3(0.529) 3(0.529) 
= min[138,1.14 | 
- 114 


Since PCR exceeds unity, the natural tolerance limits lie inside the specification limits and very few 
defective units will be produced. 
PCR is slightly larger than PCR, indicating that the process is somewhat off center. 

d) In order to make this process a *six-sigma process”, the variance o^ would have to be decreased such that 


— | x — LSL 
PCR, = 2.0. The value of the variance is found by solving PCR, = A — 2.0 for o: 
o 


n 142—139.808 139.808 — 38) 


139.808—138 — 
3o j 
60 = 139.808 — 138 
139.808 — 138 
(entes LUN 
6 
o = 03013 
Therefore, the process variance would have to be decreased to o^ = (0.3013)? = 0.091. 
e) o, — 0.529 
p = P(139.098 < X «140.518| u 2139.7) 


ax m X-u a - 
—pP|——————— « E LM 


2.0 





0.529 O 0.529 


X 


= P(-1.14 < Z «1.55) = P(Z «1.55) - P(Z < -1.14) 
= P(Z «1.55)- [1 - P(Z «1.14)] = 0.93943 -[1— 0.87285] = 0.8123 


The probability that this shift will be detected on the next sample is 1—p = 1—0.8123 = 0.1877. 
ARL = zc 5. 
1l-p 01877 





